v3: custom pipelines #747

Th3G33k · 2024-05-09T14:05:58Z

Add get_text_embeddings and get_similarities methods to ImageFeatureExtractionPipeline

xenova · 2024-05-09T14:10:54Z

Thanks! Are these methods present in the python version of the image feature extraction pipeline? If so, can you please link to the original documentation? If not, please note that we're trying to match the python API as closely as possible, and may make it difficult to merge the PR.

Th3G33k · 2024-05-09T14:49:48Z

I don't think there in an equivalent in Python transformers.

transformers.js/src/models.js

Lines 5816 to 5821 in 880cd3e

    
           // NOTE: This is custom to Transformers.js, and is necessary because certain models 
        
           // (e.g., CLIP) are split into vision and text components 
        
           const MODEL_FOR_IMAGE_FEATURE_EXTRACTION_MAPPING_NAMES = new Map([ 
        
               ['clip', ['CLIPVisionModelWithProjection', CLIPVisionModelWithProjection]], 
        
               ['siglip', ['SiglipVisionModel', SiglipVisionModel]], 
        
           ])

However, I think it would be really handful to have those fonctions directly into the library transformers.js image feature extraction pipeline.

It's used in hugging face spaces and in examples model code

Th3G33k · 2024-05-10T20:26:23Z

To not differ from the Python library, I will add the custom pipeline separetely, with register_pipeline. #684

* Add RawAudio class and 'save to wav' * Apply suggestions from code review Co-authored-by: Joshua Lochner <admin@xenova.com> * RawAudio toBlob() + save() rewrite * Add saveBlob in utils/core.js * RawAudio : Add support 2 channels + interleave * Fix * Fix * simplify type check * RawAudio : improve interleave + change env -> apis * image.js: change env -> apis * env.js remove changes --------- Co-authored-by: Joshua Lochner <admin@xenova.com>

* Add custom task `register_pipeline` * Change model_name in register_pipeline * add custom tasks to SUPPORTED_TASKS * models.js getModelClassFromName + refactor model mapping * models.js fix model mapping * Beautify code * Allow updating existing supported_tasks

* v3: wasmPaths relative path * Onnx InferenceSession logLevel * remove logLevel * set logLevel * Beautify code

* Add `ignore_merges` option to BPE tokenizers (xenova#716) * [version] Update to 2.17.1 * Use ungated version of mistral tokenizer (xenova#718) * Add mobilevitv2 (xenova#721) * Add support for MobileViTV2 * Update supported_models.py * Add support for `do_flip_channel_order` * Add unit test for `do_flip_channel_order=true` * docs: update vanilla-js.md (xenova#738) minor fix * Support reading data from blob URI (xenova#645) * Make blob as valid URL * Create function to detect the blob URI * Change to `isValidUrl` * Remove comment Co-authored-by: Joshua Lochner <admin@xenova.com> * Merge `isValidHttpUrl` into `isValidUrl` * Correct implement * Update docs * Add test * Remove export for `isValidUrl` * Test read blob via `getFile` * Use `res.text()` instead `res.body` --------- Co-authored-by: Joshua Lochner <admin@xenova.com> * Add aggregation_strategy + start end tokens * Beautify code * tokenizers return_offsets_mapping * QuestionAnswering start end char --------- Co-authored-by: Joshua Lochner <admin@xenova.com> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com> Co-authored-by: Hans <me@hans00.me>

Th3G33k added 2 commits May 9, 2024 03:54

Improve image-feature-extraction

c4907ef

add semicolon

770ea95

Beautify code

aa979fd

Th3G33k and others added 5 commits May 10, 2024 23:48

Merge V3 fix wasm relpath -> v3 fork build (#4)

1f9ed4f

* v3: wasmPaths relative path * Onnx InferenceSession logLevel * remove logLevel * set logLevel * Beautify code

Merge v3testing -> v3 fork build (#5)

129709f

Create custom-pipelines.js

d019f7b

Th3G33k changed the title ~~v3: Improve image-feature-extraction~~ v3: custom pipelines May 11, 2024

Th3G33k and others added 2 commits May 11, 2024 07:10

Merge branch 'v3-fork-build' into v3-image-feature-extraction

63ee5a0

Th3G33k closed this May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v3: custom pipelines #747

v3: custom pipelines #747

Th3G33k commented May 9, 2024

xenova commented May 9, 2024

Th3G33k commented May 9, 2024

Th3G33k commented May 10, 2024

v3: custom pipelines #747

v3: custom pipelines #747

Conversation

Th3G33k commented May 9, 2024

xenova commented May 9, 2024

Th3G33k commented May 9, 2024

Th3G33k commented May 10, 2024