Upgrade to Pro

Unlock premium features and boost productivity!

Learn More

Evaluation

Evaluate and benchmark your AI models and agents.

Accuracy
94.8%

+2.4% from last month

Response Time
124ms

-15ms from last month

Relevance Score
8.7/10

+0.5 from last month

Error Rate
0.8%

-0.3% from last month

Performance by Model

Performance chart would appear here

Metrics Over Time

Trends chart would appear here

Model Performance
ModelAccuracyResponse TimeRelevanceError RateRAGAS ScoreLast Evaluated
GPT-4o96.2%145ms9.1/100.5%0.872 days ago
Claude 395.8%120ms8.9/100.7%0.853 days ago
Llama 393.5%95ms8.5/101.2%0.821 week ago
Mistral Large94.2%110ms8.7/100.9%0.835 days ago
Custom Fine-tuned Model97.1%130ms9.3/100.4%0.891 day ago