Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于训练结果ACC指标下降的问题 #788

Open
xjDUAN184 opened this issue May 14, 2024 · 1 comment
Open

关于训练结果ACC指标下降的问题 #788

xjDUAN184 opened this issue May 14, 2024 · 1 comment

Comments

@xjDUAN184
Copy link

xjDUAN184 commented May 14, 2024

模型为bge-m3
我使用了450条训练数据,其中每条数据包括,1个query sentence,1个pos sentence,7个neg sentence。
其中7个neg sentence,有两种情况:
1 其中有1个neg sentence是标注的,剩下的6个是随机匹配的(满足bgemodel.compute_score小于0.7)
2 7个全都是随机生成的。
1个npos sentence,有两种情况:LLM生成的或者手动标注的。
image
从实验结果中可以发现,当我的权重配比中,sparse不为0时,acc会降低,这种情况是为什么?

@staoxiao
Copy link
Collaborator

Based on the results, sparse retrieval might not be suitable for your data. You can choose the best way to use it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants