Gaia Evaluator

Advanced AI Model Quality Analysis & Comparison

🔬 Data Generation 💰 Cost Tracking 📊 Quality Scoring ⚡ One-Click Loading 🔍 Detailed Analysis 📈 Model Comparison

Available Reports

Test Data

Groundtruth

Experiments

Evaluations

Agent Outputs

No reports loaded

Select experiment and/or evaluation files to visualize results