Raster graphics package for Fōrmulæ, in JavaScript
-
Updated
May 31, 2024 - JavaScript
Raster graphics package for Fōrmulæ, in JavaScript
Data release for the ImageInWords (IIW) paper.
Image Captioning With MobileNet-LLaMA 3
Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.
11000-Image-Video-caption-data-of-human-action
20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes
10000-Image-caption-data-of-gestures
10100-Image-caption-data-of-human-face
10000-Image-caption-data-of-vehicles
10000-Image-caption-data-of-diverse-scenes
Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Download flickr8k, flickr30k image caption datasets
lmmtoolkit is a toolkit for Multi-Modal Learning
The first public Vietnamese visual linguistic foundation model(s)
Character Recognition system using CNN and Streamlit
The largest multilingual image-text classification dataset. It contains fashion products.
MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation
FCLL: A Fine-grained Contrastive Language-Image Learning Model
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.
To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."