Prompt performance,
not guesswork
Every prompt tested across the same models. Scored by independent AI judges.
Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
English
Sort By
All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).
24 prompts+ found
Extraction
rag-qa-with-history
Best Modelgpt-5-mini
Overall3.9
Winner4.2
0
View details →
Extraction
rag-prompt-llama3
Best Modelgemini-2.5-flash-lite
Overall3.7
Winner3.9
0
View details →
Extraction
rag-prompt-med
Best Modelgpt-5-mini
Overall3.6
Winner3.8
0
View details →
Extraction
more-crafted-rag-prompt
Best Modelgemini-2.5-flash-lite
Overall3.4
Winner4.0
0
View details →
Extraction
rag-prompt
Best Modelgemini-2.5-flash-lite
Overall3.4
Winner3.8
0
View details →
Summarization
simple-rag
Best Modelclaude-3-5-haiku
Overall3.4
Winner3.8
0
View details →
Extraction
rag-prompt
Best Modelgpt-5-mini
Overall3.4
Winner3.5
0
View details →
Extraction
sport-routine-to-program-short
Best Modelgpt-5-mini
Overall3.2
Winner5.0
0
View details →
Extraction
rag-prompt
Best Modelgpt-5-mini
Overall3.2
Winner3.4
0
View details →
Extraction
rag-answer-helpfulness
Best Modelgpt-5-mini
Overall3.1
Winner3.8
0
View details →
Extraction
rag-with-history-guidance
Best Modelgemini-2.5-flash-lite
Overall3.1
Winner3.9
0
View details →
Extraction
rag-prompt-chat-history
Best Modelgpt-5-mini
Overall3.0
Winner3.5
0
View details →
Extraction
rag-answer-hallucination
Best Modelclaude-3-5-haiku
Overall2.9
Winner3.9
0
View details →
Summarization
sport-routine-to-program
Best Modelgpt-5-mini
Overall2.9
Winner3.7
0
View details →
Classification
tnt-llm-taxonomy-generation
Best Modelclaude-3-5-haiku
Overall2.8
Winner3.0
0
View details →
Extraction
rag-context-precision
Best Modelgpt-5-mini
Overall2.8
Winner3.1
0
View details →
Summarization
pre-next-5-summarization
Best Modelclaude-3-5-haiku
Overall2.8
Winner3.1
0
View details →
Extraction
assumption-checker
Best Modelclaude-3-5-haiku
Overall2.8
Winner4.7
0
View details →
Extraction
rag-answer-hallucination
Best Modelgemini-2.5-flash-lite
Overall2.7
Winner3.7
0
View details →
Extraction
my-first-prompt
Best Modelgpt-5-mini
Overall2.7
Winner3.2
0
View details →
Extraction
pairwise-evaluation-2
Best Modelclaude-3-5-haiku
Overall2.7
Winner3.1
0
View details →
Summarization
pre-reflection-summary
Best Modelclaude-3-5-haiku
Overall2.7
Winner4.1
0
View details →
Extraction
chat-langchain-rephrase
Best Modelgemini-2.5-flash-lite
Overall2.6
Winner2.9
0
View details →
Extraction
youtube-transcript-to-article
Best Modelgpt-5-mini
Overall2.6
Winner2.7
0
View details →
Scroll for more
