image-text

Here are 37 public repositories matching this topic...

formulae-org / package-graphic-raster-js

Raster graphics package for Fōrmulæ, in JavaScript

javascript formulae graphics graphics-programming turtle-graphics rotating image-transformations image-colors image-text raster-graphics image-coordinates graphic-primitives stroke-imaging xor-mode

Updated May 31, 2024
JavaScript

google / imageinwords

Star

Data release for the ImageInWords (IIW) paper.

evaluation dataset image-captioning dataset-generation image-to-text image-descriptions image-text human-annotation t2i i2t detailed-descriptions detailed-annotations

Updated May 25, 2024
JavaScript

reshalfahsi / image-captioning-mobilenet-llama3

Star

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated May 5, 2024
Jupyter Notebook

DarkKnightSgh / Text-Image-Text

Star

Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

python information-retrieval transformers image-text flickr8k-dataset text-image streamlit semantic-embedding huggingface-transformers

Updated Apr 27, 2024
Python

Nexdata-AI / 11000-Image-Video-caption-data-of-human-action

Star

11000-Image-Video-caption-data-of-human-action

computer-vision human-action-recognition image-text text-image caption-data aigc generative-ai

Updated Apr 18, 2024

Nexdata-AI / 20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

Star

20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

ocr natural-scenes image-text text-image caption-data generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10000-Image-caption-data-of-gestures

Star

10000-Image-caption-data-of-gestures

gesture-recognition asian image-text caption-data generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10100-Image-caption-data-of-human-face

Star

10100-Image-caption-data-of-human-face

image-recognition image-text caption-data human-face-recognition generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10000-Image-caption-data-of-vehicles

Star

10000-Image-caption-data-of-vehicles

image-recognition vehicle-detection image-text caption-data generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10000-Image-caption-data-of-diverse-scenes

Star

10000-Image-caption-data-of-diverse-scenes

image-recognition scene-recognition image-text caption-data generative-ai

Updated Apr 18, 2024

antonlukin / poster-editor

Star

Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.

php composer php-library image-processing php-gd intervention php-image image-text php-class poster-editor

Updated Mar 31, 2024
PHP

miccunifi / QualiCLIP

Star

Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

computer-vision deep-learning image-processing image-quality clip iqa image-text image-quality-assessment blind-image-quality-assessment low-level-vision image-degradation self-supervised-learning ranking-loss biqa vision-language nr-iqa no-reference-image-quality-assessment opinion-unaware opinion-unaware-nr-iqa

Updated Mar 19, 2024

awsaf49 / flickr-dataset

Star

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

jianzhnie / MultimodalTransformers

Star

lmmtoolkit is a toolkit for Multi-Modal Learning

image-text text-image multi-modal-learning text-to-video

Updated Nov 21, 2023
Python

dinhanhx / VisualRoBERTa

Sponsor

Star

The first public Vietnamese visual linguistic foundation model(s)

python python3 image-captioning python-3 vietnamese-nlp visual-question-answering image-text visual-linguistic

Updated Oct 29, 2023
Python

AkshayBura / Character-Recognition

Star

Character Recognition system using CNN and Streamlit

python deep-neural-networks tensorflow image-processing cnn preprocessing image-text streamlit recognizing-characters

Updated Aug 22, 2023
Jupyter Notebook

glami / glami-1m

Star

The largest multilingual image-text classification dataset. It contains fashion products.

multilingual natural-language-processing computer-vision deep-learning fashion text-classification dataset classification image-classification image-to-text image-text multimodal text-to-image-generation multi-modal-deep-learning image-text-classification multilingual-image-text-classification

Updated Jun 8, 2023
Jupyter Notebook

CharlesYang030 / MTA

Star

MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation

multilingual image-text multimodal language-vision visualwsd

Updated May 31, 2023
Jupyter Notebook

CharlesYang030 / FCLL

Star

FCLL: A Fine-grained Contrastive Language-Image Learning Model

multilingual pytorch fine-grained image-text multimodal language-vision contrastive-learning visualwsd sense-autocomplement

Updated May 31, 2023
Jupyter Notebook

X-PLUG / mPLUG

Star

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

pytorch transformer vqa image-captioning visual-language image-text multimodal pretraining image-text-retrieval

Updated May 8, 2023
Python

Improve this page

Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image-text

Here are 37 public repositories matching this topic...

formulae-org / package-graphic-raster-js

google / imageinwords

reshalfahsi / image-captioning-mobilenet-llama3

DarkKnightSgh / Text-Image-Text

Nexdata-AI / 11000-Image-Video-caption-data-of-human-action

Nexdata-AI / 20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

Nexdata-AI / 10000-Image-caption-data-of-gestures

Nexdata-AI / 10100-Image-caption-data-of-human-face

Nexdata-AI / 10000-Image-caption-data-of-vehicles

Nexdata-AI / 10000-Image-caption-data-of-diverse-scenes

antonlukin / poster-editor

miccunifi / QualiCLIP

awsaf49 / flickr-dataset

jianzhnie / MultimodalTransformers

dinhanhx / VisualRoBERTa

AkshayBura / Character-Recognition

glami / glami-1m

CharlesYang030 / MTA

CharlesYang030 / FCLL

X-PLUG / mPLUG

Improve this page

Add this topic to your repo