Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add ocr vqa images #1458

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

add ocr vqa images #1458

wants to merge 1 commit into from

Conversation

Victorwz
Copy link

@Victorwz Victorwz commented Apr 26, 2024

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

@SamuelSchmidgall
Copy link

You are a legend

@hellangleZ
Copy link

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

@Victorwz
Copy link
Author

Victorwz commented May 1, 2024

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

@hellangleZ
Copy link

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

Dude, You are a Legend

@yanghu819
Copy link

hero!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants