Prompt performance,
not guesswork
Every prompt tested across the same models. Scored by independent AI judges.
Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
providerprompt
Sort By
All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).
5 prompts found
Summarization
medical-docs-summarizer
Best Modelclaude-3-5-haiku
Overall3.9
Winner4.2
0
View details →
Extraction
healthcare-provider-prompt
Best Modelclaude-3-5-haiku
Overall3.1
Winner4.9
0
View details →
Extraction
drug_interaction_checker
Best Modelclaude-3-5-haiku
Overall3.0
Winner4.7
0
View details →
Extraction
cardiology_risk_treatment_guide
Best Modelclaude-3-5-haiku
Overall2.8
Winner3.8
0
View details →
Extraction
diabetes_risk_assessment
Best Modelclaude-3-5-haiku
Overall2.7
Winner4.0
0
View details →
You've reached the end
