Task Success
--
Relevance
--
Hallucination
--
Consistency
--
Gate Pass Rate
--
Overall Score
--
Last Run Summary
Loading...
Metrics Over Time
Provider Comparison
Live Evaluations
Evaluation Runs
0 runs| Provider / Model | Dataset | Commit | Score | Gate | Regression | Timestamp |
|---|
Average Scores by Provider
Metric Breakdown
Provider Comparison Table
| Provider / Model | Runs | Overall | Task Success | Relevance | Hallucination | Gate Rate | Trend |
|---|
Quality Gate History
Gate Events
| Run ID | Result | Overall Score | Failed Metrics | Commit | Timestamp |
|---|