Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
run_predict.py		run_predict.py

README.md

ImageBind

1. 模型简介

Paddle implementation of ImageBind.

To appear at CVPR 2023 (Highlighted paper)

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.

2. Demo

example: Extract and compare features across modalities (e.g. Image, Text and Audio).

cd paddlemix/examples/imagebind/

python run_predict.py \
--model_name_or_path imagebind-1.2b/ \
--input_text "A dog." \
--input_image https://paddlenlp.bj.bcebos.com/models/community/paddlemix/audio-files/dog_image.jpg \
--input_audio https://paddlenlp.bj.bcebos.com/models/community/paddlemix/audio-files/wave.wav \

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imagebind

imagebind

README.md

README.md

run_predict.py

run_predict.py

README.md

ImageBind

1. 模型简介

2. Demo

Files

imagebind

Directory actions

More options

Directory actions

More options

Latest commit

History

imagebind

Folders and files

parent directory

README.md

README.md

run_predict.py

run_predict.py

README.md

ImageBind

1. 模型简介

2. Demo