Prompt performance,
not guesswork

Every prompt tested across the same models. Scored by independent AI judges.

Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
English
Sort By

All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).

How it works →

24 prompts+ found

Extraction

rag-qa-with-history

Best Modelgpt-5-mini
Overall3.9
Winner4.2
0
View details →
Extraction

rag-prompt-llama3

Best Modelgemini-2.5-flash-lite
Overall3.7
Winner3.9
0
View details →
Extraction

rag-prompt-med

Best Modelgpt-5-mini
Overall3.6
Winner3.8
0
View details →
Extraction

more-crafted-rag-prompt

Best Modelgemini-2.5-flash-lite
Overall3.4
Winner4.0
0
View details →
Extraction

rag-prompt

Best Modelgemini-2.5-flash-lite
Overall3.4
Winner3.8
0
View details →
Summarization

simple-rag

Best Modelclaude-3-5-haiku
Overall3.4
Winner3.8
0
View details →
Extraction

rag-prompt

Best Modelgpt-5-mini
Overall3.4
Winner3.5
0
View details →
Extraction

sport-routine-to-program-short

Best Modelgpt-5-mini
Overall3.2
Winner5.0
0
View details →
Extraction

rag-prompt

Best Modelgpt-5-mini
Overall3.2
Winner3.4
0
View details →
Extraction

rag-answer-helpfulness

Best Modelgpt-5-mini
Overall3.1
Winner3.8
0
View details →
Extraction

rag-with-history-guidance

Best Modelgemini-2.5-flash-lite
Overall3.1
Winner3.9
0
View details →
Extraction

rag-prompt-chat-history

Best Modelgpt-5-mini
Overall3.0
Winner3.5
0
View details →
Extraction

rag-answer-hallucination

Best Modelclaude-3-5-haiku
Overall2.9
Winner3.9
0
View details →
Summarization

sport-routine-to-program

Best Modelgpt-5-mini
Overall2.9
Winner3.7
0
View details →
Classification

tnt-llm-taxonomy-generation

Best Modelclaude-3-5-haiku
Overall2.8
Winner3.0
0
View details →
Extraction

rag-context-precision

Best Modelgpt-5-mini
Overall2.8
Winner3.1
0
View details →
Summarization

pre-next-5-summarization

Best Modelclaude-3-5-haiku
Overall2.8
Winner3.1
0
View details →
Extraction

assumption-checker

Best Modelclaude-3-5-haiku
Overall2.8
Winner4.7
0
View details →
Extraction

rag-answer-hallucination

Best Modelgemini-2.5-flash-lite
Overall2.7
Winner3.7
0
View details →
Extraction

my-first-prompt

Best Modelgpt-5-mini
Overall2.7
Winner3.2
0
View details →
Extraction

pairwise-evaluation-2

Best Modelclaude-3-5-haiku
Overall2.7
Winner3.1
0
View details →
Summarization

pre-reflection-summary

Best Modelclaude-3-5-haiku
Overall2.7
Winner4.1
0
View details →
Extraction

chat-langchain-rephrase

Best Modelgemini-2.5-flash-lite
Overall2.6
Winner2.9
0
View details →
Extraction

youtube-transcript-to-article

Best Modelgpt-5-mini
Overall2.6
Winner2.7
0
View details →
Scroll for more