Prompt performance,
not guesswork

Every prompt tested across the same models. Scored by independent AI judges.

Evaluating across GPT-4, Claude 3.5, and Gemini 1.5
Sort By

All scores are aggregated using multi-judge consensus (GPT-4o Mini + Claude 3 Haiku).

How it works →

24 prompts+ found

Classification

kold

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

get_multiple_choice_answer_fewshot_en

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

gen_tasks

Best Modelgpt-4o-mini
5.0
0
View details →
Summarization

pre-top-3-summarization

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

qa-react

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

sport-routine-to-program-short

Best Modelgemini-1.5-flash
5.0
0
View details →
Summarization

sport-routine-to-program

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

prompt-vi-06

Best Modelclaude-3-haiku
5.0
0
View details →
Summarization

pre-next-5-summarization

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

sciscigpt-tool-eval

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

drug_interaction_checker

Best Modelgemini-1.5-flash
5.0
0
View details →
Classification

tnt-llm-classify

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

rag-answer-hallucination

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

evaluator-rag-precision

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

more-crafted-rag-prompt

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

generate_questions_by_knowledge_tags

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

text-to-sql

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

hatedetection

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

aza-hr-workflow-prompt-v4

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

rag-answer-helpfulness

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

qa-react-dev

Best Modelgemini-1.5-flash
5.0
0
View details →
Extraction

russian_react_chat

Best Modelclaude-3-haiku
5.0
0
View details →
Extraction

test-question-making

Best Modelgpt-4o-mini
5.0
0
View details →
Extraction

tab2_widget2_openai

Best Modelgemini-1.5-flash
5.0
0
View details →
Scroll for more