Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于悟空大模型 #210

Open
Soulscb opened this issue Sep 2, 2022 · 1 comment
Open

关于悟空大模型 #210

Soulscb opened this issue Sep 2, 2022 · 1 comment

Comments

@Soulscb
Copy link

Soulscb commented Sep 2, 2022

悟空大模型Vit_l_G,模型效果似乎不是很好,贵方有没有试过呢?

@mengxj08
Copy link
Contributor

您指的是Wukong-ViT-L吗?我们在paper有验证过性能。您看一下加载的config是否则正确?
另外,Wukong-ViT-L采用的是细粒度对齐的训练,inference的时候每个patch和token都会参与计算,不是像CLIP一样只使用[CLS]作为图像和文本的global表征。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants