Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于Two-Models Ensemble的问题 #2

Open
xixiareone opened this issue Nov 17, 2020 · 2 comments
Open

关于Two-Models Ensemble的问题 #2

xixiareone opened this issue Nov 17, 2020 · 2 comments

Comments

@xixiareone
Copy link

你好,我想请教论文表1中的Two-Models Ensemble这一实验,不是很能理解,想请教一下作者,是哪两种model进行ensemble呢,以及对于bert和gru又如何做不同的ensemble?

非常感谢!

@kywen1119
Copy link
Owner

kywen1119 commented Nov 17, 2020

感谢您的关注!
Two-models ensemble是把同一个网络训练两次,分别保存为两个模型(训练时由于随机种子训练结果也不同),分别推理得到image与text的相似度矩阵(在evaluation.py文件中会保存为.npy文件),将两个矩阵求平均得到最终测试的相似度矩阵。
简单地说,基于bert的模型训练两次进行ensemble;基于gru的模型训练两次进行ensemble。这两个是分开的。

Thank you for your attention!
For two-models ensemble, we train the same network twice and save them as two models (due to different random seed training results during training), and infers the similarity matrix between image and text (which will be saved in the evaluation.py file as .npy file), average the two matrices to get the final test similarity matrix.
Generally, the bert-based model is trained twice for ensemble; the gru-based model is trained twice for ensemble. These two are separate.

@xixiareone
Copy link
Author

非常感谢~~~,你解决了我的困惑~~~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants