LLM A/B Dashboard
Dashboard
Experiments
Models
Experiments
Browse and analyze all A/B test experiments
All
Running
Completed
Running
2 variants
GPT-3.5 vs GPT-4 Cost-Quality Tradeoff
3245
Users
28405
Interactions
response_…
Metric
Running
2 variants
Gemini-1.5 vs GPT-4 Summarization
2156
Users
17248
Interactions
converted
Metric