LLM A/B Dashboard
Dashboard
Experiments
Models
Experiments
Browse and analyze all A/B test experiments
All
Running
Completed
Completed
2 variants
GPT-4 vs Claude-3 Code Generation
4521
Users
45210
Interactions
response_…
Metric
Completed
2 variants
Llama-3 vs Mixtral Q&A
7834
Users
62672
Interactions
converted
Metric
Running
2 variants
GPT-3.5 vs GPT-4 Cost-Quality Tradeoff
3245
Users
28405
Interactions
response_…
Metric
Running
2 variants
Gemini-1.5 vs GPT-4 Summarization
2156
Users
17248
Interactions
converted
Metric