#

image-text

Here are 37 public repositories matching this topic...

Nexdata-AI / 11000-Image-Video-caption-data-of-human-action

11000-Image-Video-caption-data-of-human-action

computer-vision human-action-recognition image-text text-image caption-data aigc generative-ai

Updated Apr 18, 2024

ppraneeth270 / img2text

textrecognition image-text image2text

Updated May 23, 2021
Python

AkshayBura / Character-Recognition

Character Recognition system using CNN and Streamlit

python deep-neural-networks tensorflow image-processing cnn preprocessing image-text streamlit recognizing-characters

Updated Aug 22, 2023
Jupyter Notebook

yomnaFathy / Text-Detection-and-Recognition

opencv ocr computer-vision deep-learning text-recognition transfer-learning pretrained-models text-detection pytesseract east image-text text-detection-recognition

Updated Oct 20, 2020
Python

Nexdata-AI / 20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

ocr natural-scenes image-text text-image caption-data generative-ai

Updated Apr 18, 2024

ask0ne / ocrator

Scan text from an image and convert into speech/audio of desired language.

natural-language-processing text-to-speech image-recognition pytesseract image-text

Updated Dec 8, 2022
Python

Nexdata-AI / 10100-Image-caption-data-of-human-face

10100-Image-caption-data-of-human-face

image-recognition image-text caption-data human-face-recognition generative-ai

Updated Apr 18, 2024

DarkKnightSgh / Text-Image-Text

Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

python information-retrieval transformers image-text flickr8k-dataset text-image streamlit semantic-embedding huggingface-transformers

Updated Apr 27, 2024
Python

makefile / text_extraction

Windows version of text_extraction(VS2013). This code is the implementation of the method proposed in the paper “Multi-script text extraction from natural scenes” (Gomez & Karatzas) to appear in ICDAR2013 conference.

Updated Aug 19, 2017
C++

Nexdata-AI / 10000-Image-caption-data-of-gestures

10000-Image-caption-data-of-gestures

gesture-recognition asian image-text caption-data generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10000-Image-caption-data-of-vehicles

10000-Image-caption-data-of-vehicles

image-recognition vehicle-detection image-text caption-data generative-ai

Updated Apr 18, 2024

xiongshufeng / MTFN-RR-PyTorch-Code

The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral

fusion image-text

Updated Sep 28, 2019
Python

Nexdata-AI / 10000-Image-caption-data-of-diverse-scenes

10000-Image-caption-data-of-diverse-scenes

image-recognition scene-recognition image-text caption-data generative-ai

Updated Apr 18, 2024

dinhanhx / VL-datasets

Some Python scripts to load Vietnamese visual linguistic data

python vietnamese python3 image-captioning python-3 vietnamese-nlp visual-question-answering image-text visual-linguistic

Updated Aug 23, 2022
Python

reshalfahsi / image-captioning-mobilenet-llama3

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated May 5, 2024
Jupyter Notebook

CharlesYang030 / MTA

MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation

multilingual image-text multimodal language-vision visualwsd

Updated May 31, 2023
Jupyter Notebook

jianzhnie / MultimodalTransformers

lmmtoolkit is a toolkit for Multi-Modal Learning

image-text text-image multi-modal-learning text-to-video

Updated Nov 21, 2023
Python

formulae-org / package-graphic-raster-js

Raster graphics package for Fōrmulæ, in JavaScript

javascript formulae graphics graphics-programming turtle-graphics rotating image-transformations image-colors image-text raster-graphics image-coordinates graphic-primitives stroke-imaging xor-mode

Updated May 31, 2024
JavaScript

CharlesYang030 / FCLL

FCLL: A Fine-grained Contrastive Language-Image Learning Model

multilingual pytorch fine-grained image-text multimodal language-vision contrastive-learning visualwsd sense-autocomplement

Updated May 31, 2023
Jupyter Notebook

dinhanhx / VisualRoBERTa

The first public Vietnamese visual linguistic foundation model(s)

python python3 image-captioning python-3 vietnamese-nlp visual-question-answering image-text visual-linguistic

Updated Oct 29, 2023
Python

Improve this page

Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."