Simple question: What are the public datasets included in InternVid-200M? #100

jong980812 · 2024-04-15T16:38:47Z

In "InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation," I would like to use ViCLIP-B-16 on InternVid-200M. Does this dataset ( or InternVid-FLT) contain videos from Kinetics400, SSV2, and UCF101? It is not clearly written in your paper whether only the labels were referred to, or if the videos were also included. I am curious to know

shepnerd · 2024-04-16T02:47:26Z

It does not contain videos from your mentioned datasets. We clearified it in Sec. 3.1 data curation as follows:"We ensure the uniqueness of our dataset by creating a database of YouTube video IDs and excluding any videos already present in publicly available datasets (released prior to April 2023)."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple question: What are the public datasets included in InternVid-200M? #100

Simple question: What are the public datasets included in InternVid-200M? #100

jong980812 commented Apr 15, 2024

shepnerd commented Apr 16, 2024

Simple question: What are the public datasets included in InternVid-200M? #100

Simple question: What are the public datasets included in InternVid-200M? #100

Comments

jong980812 commented Apr 15, 2024

shepnerd commented Apr 16, 2024