Prompt performance,
not guesswork

Every prompt tested across the same models. Scored by independent AI judges.

Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
Sort By

All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).

How it works →

24 prompts+ found

Classification

Sentiment Analysis

Best Modelclaude-3-5-haiku
Overall4.9
Winner5.0
0
View details →
Extraction

City Extractor (Few-Shot)

Best Modelgemini-2.5-flash-lite
Overall4.9
Winner4.9
0
View details →
Classification

Language Detector

Best Modelgpt-5-mini
Overall4.7
Winner4.8
0
View details →
Extraction

Capital City Extractor

Best Modelgemini-2.5-flash-lite
Overall4.7
Winner4.8
0
View details →
Extraction

chat-langchain-general-prompt

Best Modelgpt-5-mini
Overall4.6
Winner4.8
0
View details →
Summarization

Legal Document Summarizer

Best Modelgemini-2.5-flash-lite
Overall4.6
Winner4.9
0
View details →
Extraction

Table Data Extractor

Best Modelgpt-5-mini
Overall4.5
Winner5.0
0
View details →
Classification

Topic Classifier

Best Modelgemini-2.5-flash-lite
Overall4.5
Winner4.9
0
View details →
Extraction

chat-langchain-response-prompt

Best Modelclaude-3-5-haiku
Overall4.4
Winner4.7
0
View details →
Extraction

Quote and Citation Extractor

Best Modelclaude-3-5-haiku
Overall4.3
Winner4.8
0
View details →
Extraction

bytes_to_megabytes

Best Modelclaude-3-5-haiku
Overall4.1
Winner4.3
0
View details →
Extraction

RAG Query Answering

Best Modelgpt-5-mini
Overall4.0
Winner4.1
0
View details →
Summarization

medical-docs-summarizer

Best Modelclaude-3-5-haiku
Overall3.9
Winner4.2
0
View details →
Extraction

rag-qa-with-history

Best Modelgpt-5-mini
Overall3.9
Winner4.2
0
View details →
Extraction

retrieval-qa-chat

Best Modelgpt-5-mini
Overall3.9
Winner4.1
0
View details →
Extraction

chat-langchain-more-info-prompt

Best Modelclaude-3-5-haiku
Overall3.8
Winner4.2
0
View details →
Extraction

python_repl

Best Modelgemini-2.5-flash-lite
Overall3.8
Winner4.3
0
View details →
Extraction

self-rag-answer-grader

Best Modelgemini-2.5-flash-lite
Overall3.8
Winner4.8
0
View details →
Extraction

tweet-critic-fewshot

Best Modelclaude-3-5-haiku
Overall3.8
Winner4.1
0
View details →
Extraction

rag-prompt-llama3

Best Modelgemini-2.5-flash-lite
Overall3.7
Winner3.9
0
View details →
Extraction

sciscigpt-tool-eval

Best Modelgemini-2.5-flash-lite
Overall3.7
Winner4.9
0
View details →
Extraction

rag-prompt-med

Best Modelgpt-5-mini
Overall3.6
Winner3.8
0
View details →
Extraction

tbot20_rag

Best Modelclaude-3-5-haiku
Overall3.6
Winner4.0
0
View details →
Extraction

aza-hr-workflow-prompt-v4

Best Modelgemini-2.5-flash-lite
Overall3.5
Winner5.0
0
View details →
Scroll for more