Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zero-shot retrieval reproduction issue #112

Open
jqsun98 opened this issue Apr 25, 2024 · 1 comment
Open

Zero-shot retrieval reproduction issue #112

jqsun98 opened this issue Apr 25, 2024 · 1 comment

Comments

@jqsun98
Copy link

jqsun98 commented Apr 25, 2024

According to the ReadMe at https://github.com/OpenGVLab/InternVideo/tree/main/InternVideo1/Downstream/Video-Text-Retrieval, the zero-shot retrieval results will be obtained after running the command ./zeroshot_scripts/eval_msrvtt.sh. This command will execute the main_task_retrieval.py. But in "main_task_retrieval.py", I find that the model is CLIP4CLIP, instead of ViCLIP. I'd like to know how to conduct zero-shot video-text retrieval experiments with pretrained ViCLIP.

@leexinhao
Copy link
Collaborator

Maybe you need to use the code of Internvideo2.mulitidality and add a model defintion of ViCLIP.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants