This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
-
Updated
May 31, 2024 - Python
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
Pheye - a family of efficient small vision-language models
Towards Explainable Metrics for Conditional Image Synthesis Evaluation (ACL 2024)
A study on Knowledge-based question generation from images. Undergraduate Thesis for 2023-2024.
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"
LLM projects
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
Official repository for the A-OKVQA dataset
Perform visual question answering on your images
Visual Question Answering Using CLIP + LSTM
A collection of resources on applications of multi-modal learning in medical imaging.
An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Multimodal Instruction Tuning for Llama 3
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Add a description, image, and links to the visual-question-answering topic page so that developers can more easily learn about it.
To associate your repository with the visual-question-answering topic, visit your repo's landing page and select "manage topics."