Real-time insights into your LLM A/B testing experiments
| Experiment Name | Status | Users | Primary Metric | Action |
|---|---|---|---|---|
| Gemini-1.5 vs GPT-4 Summarization | Running | 3945 |
converted
|
View |
| GPT-3.5 vs GPT-4 Cost-Quality Tradeoff | Running | 4009 |
response_quality_score
|
View |
| Llama-3-70B vs Mixtral-8x7B Q&A | Completed | 4018 |
converted
|
View |
| GPT-4 vs Claude-3 Code Generation | Completed | 4029 |
response_quality_score
|
View |