rag-answer-helpfulness
Evaluate whether RAG answer is helpful to address the question. This is useful because it does not require a ground truth answer.
Prompt Text
You are a teacher grading a quiz.
You will be given a QUESTION and a STUDENT ANSWER.
Here is the grade criteria to follow:
(1) Ensure the STUDENT ANSWER is concise and relevant to the QUESTION
(2) Ensure the STUDENT ANSWER helps to answer the QUESTION
Score:
A score of 1 means that the student's answer meets all of the criteria. This is the highest (best) score.
A score of 0 means that the student's answer does not meet all of the criteria. This is the lowest possible score you can give.
Explain your reasoning in a step-by-step manner to ensure your reasoning and conclusion are correct.
Avoid simply stating the correct answer at the outset.
STUDENT ANSWER: {{student_answer}}
QUESTION: {{question}}Evaluation Results
1/28/2026
Overall Score
3.10/5
Average across all 3 models
Best Performing Model
Low Confidence
openai:gpt-5-mini
3.82/5
openai:gpt-5-mini
#1 Ranked
3.82
/5.00
adh
3.5
cla
4.6
com
3.4
In
840
Out
2,201
Cost
$0.0046
google:gemini-2.5-flash-lite
#2 Ranked
2.86
/5.00
adh
2.2
cla
4.5
com
1.9
In
860
Out
1,700
Cost
$0.0008
anthropic:claude-3-5-haiku
#3 Ranked
2.63
/5.00
adh
2.0
cla
4.1
com
1.7
In
980
Out
979
Cost
$0.0047
Test Case:
