LLM A/B Dashboard
Dashboard
Experiments
Models
Experiments
Browse and analyze all A/B test experiments
All
Running
Completed
Running
2 variants
Gemini-1.5 vs GPT-4 Summarization
3945
Users
171244
Interactions
converted
Metric
Running
2 variants
GPT-3.5 vs GPT-4 Cost-Quality Tradeoff
4009
Users
177188
Interactions
response_…
Metric