add ocr vqa images #1458

Victorwz · 2024-04-26T03:04:54Z

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

SamuelSchmidgall · 2024-04-27T00:11:04Z

You are a legend

hellangleZ · 2024-05-01T04:46:44Z

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

Victorwz · 2024-05-01T04:48:27Z

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

hellangleZ · 2024-05-01T07:26:21Z

Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release from https://huggingface.co/datasets/howard-hou/OCR-VQA into a zip file. Everyone can easily download it and unzip to their path of ./ocr_vqa/images

It's parquet, not jpg, can not use to train directly

You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release.

Dude, You are a Legend

yanghu819 · 2024-05-05T09:40:59Z

hero!

add ocr vqa images

7299f71

Victorwz force-pushed the main branch from 1dbed95 to 7299f71 Compare April 27, 2024 00:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ocr vqa images #1458

add ocr vqa images #1458

Victorwz commented Apr 26, 2024 •

edited

SamuelSchmidgall commented Apr 27, 2024

hellangleZ commented May 1, 2024

Victorwz commented May 1, 2024

hellangleZ commented May 1, 2024

yanghu819 commented May 5, 2024

add ocr vqa images #1458

Are you sure you want to change the base?

add ocr vqa images #1458

Conversation

Victorwz commented Apr 26, 2024 • edited

SamuelSchmidgall commented Apr 27, 2024

hellangleZ commented May 1, 2024

Victorwz commented May 1, 2024

hellangleZ commented May 1, 2024

yanghu819 commented May 5, 2024

Victorwz commented Apr 26, 2024 •

edited