LLM A/B Dashboard
Dashboard
Experiments
Models
Dashboard
Experiments
Gemini-1.5 vs GPT-4 Summariza…
Gemini-1.5 vs GPT-4 Summarization
Running
2024-02-15
2156
Total Users
17248
Total Interactions
2
Variants
converted
Primary Metric
Conversion Rate Over Time
Metric Comparison
Data Quality Checks
Temporal Leakage
Passed
Duplicate Records
Passed
Cross-Group Contamination
Passed
Sample Ratio Mismatch
Passed
Recommendations
No recommendations available