multimodal-large-language-models

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen

Updated Mar 17, 2024
Python

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

Star

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

autonomous-car autonomous-driving autonomous-vehicles self-driving multimodal vision-transformer foundation-models large-language-models vision-language-model multimodal-large-language-models

Updated Mar 14, 2024

burglarhobbit / Awesome-Medical-Large-Language-Models

Star

Curated papers on Large Language Models in Healthcare and Medical domain

large-language-models large-vision-language-models multimodal-large-language-models

Updated Apr 30, 2024

AviSoori1x / seemore

Star

From scratch implementation of a vision language model in pure PyTorch

deep-learning pytorch artificial-intelligence neural-networks multimodal-learning multimodal pytorch-implementation large-language-models llm llms vision-language-model multimodal-large-language-models

Updated May 6, 2024
Jupyter Notebook

zjukg / KoPA

Star

[Paper][Preprint 2023] Making Large Language Models Perform Better in Knowledge Graph Completion

knowledge-graph knowledge-graph-completion multi-modal knowledge-graph-embeddings large-language-models instruction-tuning multimodal-large-language-models

Updated Feb 10, 2024
Python

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

Star

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

text-to-speech multimodality text-to-image text-to-audio text-to-video text-to-music multimodal-models aigc large-language-models text-to-3d multimodal-generation text-to-sound large-vision-language-models multimodal-large-language-models

Updated May 9, 2024
HTML

ChenDelong1999 / polite-flamingo

Star

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

large-language-models visual-instruction-tuning multimodal-large-language-models

Updated Dec 9, 2023
Python

X-PLUG / mPLUG-HalOwl

Star

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

benchmark contrastive-learning hallucinations mllm multimodal-large-language-models multimodal-hallucination

Updated Jan 29, 2024
Python

Improve this page

Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-large-language-models

Here are 42 public repositories matching this topic...

BradyFU / Awesome-Multimodal-Large-Language-Models

X-PLUG / MobileAgent

YangLing0818 / RPG-DiffusionMaster

X-PLUG / mPLUG-DocOwl

BAAI-DCAI / Bunny

LLaVA-VL / LLaVA-Plus-Codebase

BradyFU / Woodpecker

rese1f / MovieChat

HenryHZY / Awesome-Multimodal-LLM

richard-peng-xia / awesome-multimodal-in-medical-imaging

tsujuifu / pytorch_mgie

X-PLUG / Youku-mPLUG

Coobiw / MiniGPT4Qwen

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

burglarhobbit / Awesome-Medical-Large-Language-Models

AviSoori1x / seemore

zjukg / KoPA

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

ChenDelong1999 / polite-flamingo

X-PLUG / mPLUG-HalOwl

Improve this page

Add this topic to your repo