vqa-dataset

Here are 34 public repositories matching this topic...

gutbash / lmm-graph-vision

How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?

data-structures openai vqa visual-question-answering vqa-dataset google-generative-ai gpt-4v gpt-4-vision-preview gemini-pro-vision claude-3

Updated Jun 6, 2024
Python

CAMMA-public / SSG-VQA

Star

SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.

scene-graph vqa-dataset surgical-data-science

Updated May 23, 2024
Python

csebuetnlp / IllusionVQA

Star

This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"

vqa vqa-dataset optical-illusions visual-language-models

Updated Apr 8, 2024
Jupyter Notebook

yanx27 / CLEVR3D

Star

CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation

point-cloud scene-graph vqa-dataset scene-understanding vqa-3d

Updated Feb 2, 2024
Python

abachaa / VQA-Med-2019

Star

Visual Question Answering in the Medical Domain VQA-Med 2019

nlp medical-imaging vqa radiology imageclef vqa-dataset vqa-med

Updated Jan 12, 2024

dinesh-kumar-mr / MediVQA

Star

Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs

vqa medical-application vqa-dataset vqa-med-2018 llms llms-benchmarking

Updated Jan 12, 2024
HTML

Letian2003 / C-VQA

Star

Counterfactual Reasoning VQA Dataset

benchmark dataset symbolic vqa reasoning counterfactual vqa-dataset llm

Updated Nov 23, 2023
Python

ghazaleh-mahmoodi / lxmert_compression

Star

B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.

python deep-learning pytorch vqa pruning visual-question-answering vqa-dataset

Updated Nov 22, 2023
Python

vzhou842 / easy-VQA

Star

The Easy Visual Question Answering dataset.

dataset vqa visual-question-answering vqa-dataset easy-vqa

Updated Oct 3, 2023
Python

findalexli / SciGraphQA

Star

SciGraphQA

vqa datasets synthetic-data vqa-dataset vision-language vision-transformer llm

Updated Aug 8, 2023
Jupyter Notebook

yousefkotp / Visual-Question-Answering

Star

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder

machine-learning deep-learning vqa clip text-encoding image-and-text visual-question-answering vqa-dataset image-encoding vizwiz clip-model vizwiz-vqa visual-question-anwsering open-ai-clip vqa-2023

Updated Jun 27, 2023
Jupyter Notebook

shivam1423 / VQA

Star

Visual Question Answer (VQA) software! Powered by Flask, this project seamlessly combines images and questions to generate accurate responses. Explore the world of interactive visual understanding with ease.

python flask jupyter-notebook html-css-javascript vqa-dataset

Updated Jun 2, 2023
HTML

lisamalani / VLR_term_project

Star

Multi-page document understanding and VQA using OCR-free method

computer-vision deep-learning vqa-dataset vision-transformer ocr-free-recognition

Updated May 8, 2023
Python

google-research-datasets / maverics

Star

MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).

evaluation vqa vqa-dataset multimodal data-creation maverics vq2a

Updated Feb 18, 2023

sutdcv / SUTD-TrafficQA

Star

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

paper annotations dataset vqa cvpr video-qa vqa-dataset traffic-events multimodal multimodal-deep-learning cvpr2021 video-reasoning

Updated Dec 13, 2022
JavaScript

Cloud-CV / VQA

Star

CloudCV Visual Question Answering Demo

machine-learning artificial-intelligence vqa vqa-dataset

Updated Nov 4, 2022
Lua

thatAverageGuy / EarlyFusion-on-EasyVQA

Star

Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.

transformers pytorch visual-question-answering vqa-dataset multimodal-deep-learning streamlit early-fusion

Updated Aug 22, 2022
Python

abachaa / VQA-Med-2021

Star

VQA-Med 2021

medical-imaging vqa radiology visual-question-answering vqa-dataset visual-question-generation vqa-med

Updated Jul 11, 2022
Python

vztu / BVQA_Benchmark

Star

A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

vqa-dataset image-quality-assessment video-quality-assessment youtube-dataset performance-benchmark picture-quality ugc-vqa ugc-datasets bvqa-benchmark bvqa-models

Updated Apr 12, 2022
Python

juletx / egunean-behin-vqa

Star

Egunean Behin Visual Question Answering Dataset

qa vqa question-answering visual-question-answering vqa-dataset visual-question-generation egunean-behin

Updated Mar 31, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the vqa-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vqa-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vqa-dataset

Here are 34 public repositories matching this topic...

gutbash / lmm-graph-vision

CAMMA-public / SSG-VQA

csebuetnlp / IllusionVQA

yanx27 / CLEVR3D

abachaa / VQA-Med-2019

dinesh-kumar-mr / MediVQA

Letian2003 / C-VQA

ghazaleh-mahmoodi / lxmert_compression

vzhou842 / easy-VQA

findalexli / SciGraphQA

yousefkotp / Visual-Question-Answering

shivam1423 / VQA

lisamalani / VLR_term_project

google-research-datasets / maverics

sutdcv / SUTD-TrafficQA

Cloud-CV / VQA

thatAverageGuy / EarlyFusion-on-EasyVQA

abachaa / VQA-Med-2021

vztu / BVQA_Benchmark

juletx / egunean-behin-vqa

Improve this page

Add this topic to your repo