LLM A/B Dashboard
Dashboard
Experiments
Models
Experiments
Browse and analyze all A/B test experiments
All
Running
Completed
Running
2 variants
Gemini-1.5 vs GPT-4 Summarization
3945
Users
171244
Interactions
converted
Metric
Running
2 variants
GPT-3.5 vs GPT-4 Cost-Quality Tradeoff
4009
Users
177188
Interactions
response_…
Metric
Completed
2 variants
Llama-3-70B vs Mixtral-8x7B Q&A
4018
Users
175048
Interactions
converted
Metric
Completed
2 variants
GPT-4 vs Claude-3 Code Generation
4029
Users
178130
Interactions
response_…
Metric