We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
python run.py --models hf_llama2_7b --custom-dataset-path xxx/test_qa.jsonl --custom-dataset-data-type qa --custom-dataset-infer-method gen
使用这个命令得到的结果得分默认是accuracy。这意味着要完全相同才能算对么?如何替换成别的评估指标呢? 通过新增配置文件,学习成本有点高。。。
The text was updated successfully, but these errors were encountered:
tonysy
No branches or pull requests
Describe the feature
python run.py
--models hf_llama2_7b
--custom-dataset-path xxx/test_qa.jsonl
--custom-dataset-data-type qa
--custom-dataset-infer-method gen
使用这个命令得到的结果得分默认是accuracy。这意味着要完全相同才能算对么?如何替换成别的评估指标呢?
通过新增配置文件,学习成本有点高。。。
Will you implement it?
The text was updated successfully, but these errors were encountered: