Prompt performance,
not guesswork
Every prompt tested across the same models. Scored by independent AI judges.
Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
Sort By
All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).
24 prompts+ found
Classification
kold
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
get_multiple_choice_answer_fewshot_en
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
gen_tasks
Best Modelgpt-4o-mini
5.0
0
View details →
Summarization
pre-top-3-summarization
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
qa-react
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
sport-routine-to-program-short
Best Modelgemini-1.5-flash
5.0
0
View details →
Summarization
sport-routine-to-program
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
prompt-vi-06
Best Modelclaude-3-haiku
5.0
0
View details →
Summarization
pre-next-5-summarization
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
sciscigpt-tool-eval
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
drug_interaction_checker
Best Modelgemini-1.5-flash
5.0
0
View details →
Classification
tnt-llm-classify
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
rag-answer-hallucination
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
evaluator-rag-precision
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
more-crafted-rag-prompt
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
generate_questions_by_knowledge_tags
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
text-to-sql
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
hatedetection
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
aza-hr-workflow-prompt-v4
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
rag-answer-helpfulness
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
qa-react-dev
Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction
russian_react_chat
Best Modelclaude-3-haiku
5.0
0
View details →
Extraction
test-question-making
Best Modelgpt-4o-mini
5.0
0
View details →
Extraction
tab2_widget2_openai
Best Modelgemini-1.5-flash
5.0
0
View details →
Scroll for more
