2024-01-01 - 2024-02-15
response_quality_score| Metric | Control | Treatment | Lift | CI (95%) | P-value | Significant | Effect Size |
|---|---|---|---|---|---|---|---|
response_quality_score |
7.82 | 8.15 | +4.22% | [0.21, 0.45] | 0.0023 | Yes | 0.24 |
converted |
0.72 | 0.75 | +4.17% | [0.01, 0.05] | 0.0156 | Yes | 0.07 |
latency_ms |
1245.0 | 1420.0 | +14.06% | [120.0, 230.0] | 0.0001 | Yes | 0.31 |
| Segment | Control Rate | Treatment Rate | Lift | Sample Size | Significant |
|---|---|---|---|---|---|
free |
0.68 | 0.7 | +2.94% | 2800 | No |
pro |
0.75 | 0.8 | +6.67% | 1400 | Yes |
enterprise |
0.82 | 0.88 | +7.32% | 321 | Yes |
| Segment | Control Rate | Treatment Rate | Lift | Sample Size | Significant |
|---|---|---|---|---|---|
code_generation |
0.78 | 0.84 | +7.69% | 1500 | Yes |
text_summarization |
0.7 | 0.72 | +2.86% | 1200 | No |
qa |
0.68 | 0.71 | +4.41% | 1100 | No |
creative_writing |
0.72 | 0.75 | +4.17% | 721 | No |