Prompt performance,
not guesswork
Every prompt tested across the same models. Scored by independent AI judges.
Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
Sort By
All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).
24 prompts+ found
Classification
Sentiment Analysis
Best Modelclaude-3-5-haiku
Overall4.9
Winner5.0
0
View details →
Extraction
City Extractor (Few-Shot)
Best Modelgemini-2.5-flash-lite
Overall4.9
Winner4.9
0
View details →
Classification
Language Detector
Best Modelgpt-5-mini
Overall4.7
Winner4.8
0
View details →
Extraction
Capital City Extractor
Best Modelgemini-2.5-flash-lite
Overall4.7
Winner4.8
0
View details →
Extraction
chat-langchain-general-prompt
Best Modelgpt-5-mini
Overall4.6
Winner4.8
0
View details →
Summarization
Legal Document Summarizer
Best Modelgemini-2.5-flash-lite
Overall4.6
Winner4.9
0
View details →
Extraction
Table Data Extractor
Best Modelgpt-5-mini
Overall4.5
Winner5.0
0
View details →
Classification
Topic Classifier
Best Modelgemini-2.5-flash-lite
Overall4.5
Winner4.9
0
View details →
Extraction
chat-langchain-response-prompt
Best Modelclaude-3-5-haiku
Overall4.4
Winner4.7
0
View details →
Extraction
Quote and Citation Extractor
Best Modelclaude-3-5-haiku
Overall4.3
Winner4.8
0
View details →
Extraction
bytes_to_megabytes
Best Modelclaude-3-5-haiku
Overall4.1
Winner4.3
0
View details →
Extraction
RAG Query Answering
Best Modelgpt-5-mini
Overall4.0
Winner4.1
0
View details →
Summarization
medical-docs-summarizer
Best Modelclaude-3-5-haiku
Overall3.9
Winner4.2
0
View details →
Extraction
rag-qa-with-history
Best Modelgpt-5-mini
Overall3.9
Winner4.2
0
View details →
Extraction
retrieval-qa-chat
Best Modelgpt-5-mini
Overall3.9
Winner4.1
0
View details →
Extraction
chat-langchain-more-info-prompt
Best Modelclaude-3-5-haiku
Overall3.8
Winner4.2
0
View details →
Extraction
python_repl
Best Modelgemini-2.5-flash-lite
Overall3.8
Winner4.3
0
View details →
Extraction
self-rag-answer-grader
Best Modelgemini-2.5-flash-lite
Overall3.8
Winner4.8
0
View details →
Extraction
tweet-critic-fewshot
Best Modelclaude-3-5-haiku
Overall3.8
Winner4.1
0
View details →
Extraction
rag-prompt-llama3
Best Modelgemini-2.5-flash-lite
Overall3.7
Winner3.9
0
View details →
Extraction
sciscigpt-tool-eval
Best Modelgemini-2.5-flash-lite
Overall3.7
Winner4.9
0
View details →
Extraction
rag-prompt-med
Best Modelgpt-5-mini
Overall3.6
Winner3.8
0
View details →
Extraction
tbot20_rag
Best Modelclaude-3-5-haiku
Overall3.6
Winner4.0
0
View details →
Extraction
aza-hr-workflow-prompt-v4
Best Modelgemini-2.5-flash-lite
Overall3.5
Winner5.0
0
View details →
Scroll for more
