Skip to content

请教,ColossalEval inference 阶段的 output 没有遵循指令回答问题 #4973

Closed Answered by chengeharrison
xyxxxxx asked this question in Community | Q&A
Discussion options

You must be logged in to vote

这其实是正常的,有的7B模型可以直接先给出选项,有的不行。但在评估的时候我们可以根据模型预测的第一个token在对应A, B, C, D上的概率来判断模型选择了哪个选项。哪个概率大就代表模型选了哪个。上面第二个问题可以看到模型预测A的概率最大,然后与target一样。可以参考MMLU的repo,他们评测时就是用的这个方法。

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by xyxxxxx
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants