Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Json files of pretraining dataset #173

Open
qtli opened this issue May 6, 2024 · 2 comments
Open

Json files of pretraining dataset #173

qtli opened this issue May 6, 2024 · 2 comments

Comments

@qtli
Copy link

qtli commented May 6, 2024

I was following DATA.md to download pretraining dataset.

However, I cannot find webvid_10m_train.json, cc12m_train.json, and so on from OpenGVLab/VideoChat2-IT repository. I was wondering how to download these annotation files to place under anno_pretrain/ directory?

@qtli
Copy link
Author

qtli commented May 6, 2024

Are there any kind people to help me out? Thanks in advance!

@qtli qtli changed the title pretrain dataset Json files of pretraining dataset May 6, 2024
@bexxnaz
Copy link

bexxnaz commented May 14, 2024

You can download these datasets from the following links:

webvid-10M: TempoFunk/webvid-10M
cc12m: GitHub Repository

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants