multimodal-large-language-models

A PyTorch-based system for highly accurate drug-target interaction predictions utilizing multi-modal large language models to discern structural affinities in drug-target pairs.

attention-mechanism drug-target-interactions contrastive-learning multimodal-large-language-models

Updated Mar 26, 2024
Python

X-PLUG / MobileAgent

Star

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

android agent harmony ios app gui automation mobile copilot multimodal mobile-agents mllm multimodal-large-language-models gpt4v multimodal-agent

Updated Apr 3, 2024
Python

sitamgithub-MSIT / well-being

Star

chatbot artificial-intelligence gradio gemini-api multimodal-data huggingface-spaces generative-ai multimodal-large-language-models gemini-pro-vision gemini-pro

Updated Apr 11, 2024
Python

sitamgithub-MSIT / TechSage

Star

chatbot artificial-intelligence gradio techbot gemini-api multimodal-data huggingface-spaces generative-ai multimodal-large-language-models gemini-pro-vision gemini-pro

Updated Apr 11, 2024
Python

nicolay-r / Awesome-Image-Captioning-MLLMs

Star

A curated list of awesome Image captioning strudies, aimed at annotating and reporting CT / MRI scans

nlp image text reports multimodality languagemodels multimodal-large-language-models

Updated Apr 12, 2024

VisualWebBench / VisualWebBench

Star

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

machine-learning natural-language-processing computer-vision deep-learning evaluation question-answering visual-question-answering multimodal multimodal-deep-learning foundation-models large-language-models llm llms mllm multimodal-large-language-models large-multimodal-models

Updated Apr 17, 2024
Python

OpenKG-ORG / EasyDetect

Star

An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing knowledge-graph generation hallucinations aigc large-language-models multimodal-large-language-models genrative-ai easydetect hallucination-detection

Updated Apr 21, 2024
Python

Improve this page

Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-large-language-models

Here are 45 public repositories matching this topic...

vincentlux / Awesome-Multimodal-LLM

HenryHZY / Awesome-Multimodal-LLM

ChenDelong1999 / polite-flamingo

EternityYW / Gemini-Commonsense-Evaluation

X-PLUG / Youku-mPLUG

BradyFU / Woodpecker

hpc203 / Chinese-CLIP-opencv-onnxrun

X-PLUG / mPLUG-HalOwl

LLaVA-VL / LLaVA-Plus-Codebase

zjukg / KoPA

tsujuifu / pytorch_mgie

bigai-nlco / LSTP-Chat

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

Lzcstan / DrugLAMP

X-PLUG / MobileAgent

sitamgithub-MSIT / well-being

sitamgithub-MSIT / TechSage

nicolay-r / Awesome-Image-Captioning-MLLMs

VisualWebBench / VisualWebBench

OpenKG-ORG / EasyDetect

Improve this page

Add this topic to your repo