FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀
-
Updated
May 23, 2024 - Jupyter Notebook
FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥
A codebase dedicated to exploring multimodal learning approaches by integrating images of host galaxies of supernovae and their corresponding light-curves and spectra.
Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances (ACL 2024)
Deep learning based content moderation from text, audio, video & image input modalities.
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
Pure C 3D Hybrid GAN using Cross attention, attention and convolution
Fine-tuning BLIP for pathological visual question answering.
A curated list of awesome Multimodal studies.
Multimodal Pretraining for Unsupervised Protein Representation Learning
Corpus of resources for multimodal machine learning with physiological signals
Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
LAVIS - A One-stop Library for Language-Vision Intelligence
Multimodal Computer Vision application leveraging object detections, gesture recognition and speech to text, in order to help user ask questions about their environment.
Demo for Binding Text, Images, Graphs, and Audio for Music Representation Learning
Code and Models for Binding Text, Images, Graphs, and Audio for Music Representation Learning
Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos"
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Add a description, image, and links to the multimodal-deep-learning topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-deep-learning topic, visit your repo's landing page and select "manage topics."