Experimental StableLM support #1719

kazssym · 2024-02-25T10:01:27Z

What does this PR do?

This pull request tries to add support for the StableLM model. It is still incomplete.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

This commit introduces two improvements: 1. Support StableLM-Epoch for ONNX export: - A new class StableLMEpochOnnxConfig is added in optimum/exporters/onnx/model_configs.py to define the configuration for exporting StableLM-Epoch models to ONNX format. - This aligns with existing configs for other supported models. 2. Enable StableLM-Epoch tasks in optimum/exporters/tasks.py: - The supported_tasks_mapping function now includes "stablelm-epoch" with supported tasks like "text-generation" and "text-classification". - This allows exporting StableLM-Epoch models for various tasks using the appropriate ONNX configuration. These changes expand the capabilities of optimum by enabling ONNX export and task handling for StableLM-Epoch models.

This commit adds NormalizedTextConfig as the normalized configuration class for StableLM-Epoch models in optimum/utils/normalized_config.py. This ensures consistency with other supported models and improves the overall compatibility and functionality of optimum when working with StableLM-Epoch models.

This commit updates MODEL_TYPES_REQUIRING_POSITION_IDS in optimum/exporters/onnx/utils.py to include "stablelm-epoch". This ensures that position IDs are properly handled during ONNX export for StableLM-Epoch models, aligning with other models requiring this information.

- Adds StableLmOnnxConfig for exporting StableLM models to ONNX. - Updates TasksManager to use StableLmOnnxConfig for "stablelm" task. - Updates NormalizedConfigManager to recognize "stablelm". - Requires Transformers version >= 4.37.99 for StableLM export. This commit partially enables exporting StableLM models to ONNX, allowing them to be used in various inference pipelines.

- Replaces "stablelm-epoch" with "stablelm" in MODEL_TYPES_REQUIRING_POSITION_IDS. This commit reflects the updated model name "stablelm" for consistency with other parts of the codebase.

- Adds "stablelm": "gpt2" mapping to ORTConfigManager. This commit specifies that StableLM models should be treated similarly to GPT-2 models during ONNX Runtime inference for consistency and potentially improved optimization.

optimum/onnxruntime/utils.py

fxmarty · 2024-02-29T13:25:45Z

@kazssym let me know if you'd like a review.

xenova · 2024-03-02T11:54:28Z

Can confirm that this PR works for transformers.js models 👍

@kazssym Just a reminder: you can add https://huggingface.co/hf-internal-testing/tiny-random-StableLmForCausalLM as a unit test.

This commit sets the default ONNX opset version to 13 for StableLM models exported with `optimum.onnx.export`. This change ensures compatibility with a wider range of ONNX runtimes that might not support the latest opsets.

…into feature/stablelm

This commit adds the StableLM model `"hf-internal-testing/tiny-random-StableLmForCausalLM"` to the dictionaries of models for testing PyTorch ONNX export: * `PYTORCH_EXPORT_MODELS_TINY` * `PYTORCH_EXPORT_MODELS_LARGE` This allows for including StableLM in regression tests to ensure compatibility with the updated ONNX exporter.

…into feature/stablelm

kazssym added 8 commits February 18, 2024 23:09

Merge branch 'huggingface:main' into feature/stablelm

11db41d

Update MODEL_TYPES_REQUIRING_POSITION_IDS for StableLM

80589fa

- Replaces "stablelm-epoch" with "stablelm" in MODEL_TYPES_REQUIRING_POSITION_IDS. This commit reflects the updated model name "stablelm" for consistency with other parts of the codebase.

Set stablelm model type for ONNX Runtime inference

af092e8

- Adds "stablelm": "gpt2" mapping to ORTConfigManager. This commit specifies that StableLM models should be treated similarly to GPT-2 models during ONNX Runtime inference for consistency and potentially improved optimization.

Merge branch 'huggingface:main' into feature/stablelm

f9177bb

kazssym commented Feb 27, 2024

View reviewed changes

optimum/onnxruntime/utils.py Show resolved Hide resolved

fxmarty mentioned this pull request Feb 29, 2024

Cannot download ONNX external data file from Hugging Face Hub #1736

Closed

4 tasks

D4ve-R mentioned this pull request Mar 1, 2024

Add StableLM support xenova/transformers.js#616

Merged

2 tasks

kazssym added 3 commits March 3, 2024 17:30

Set default ONNX opset for StableLM models

4175b51

This commit sets the default ONNX opset version to 13 for StableLM models exported with `optimum.onnx.export`. This change ensures compatibility with a wider range of ONNX runtimes that might not support the latest opsets.

Merge branch 'feature/stablelm' of https://github.com/kazssym/optimum …

5539103

…into feature/stablelm

kazssym force-pushed the feature/stablelm branch from 142a558 to ebbf9ba Compare March 3, 2024 08:50

kazssym added 4 commits March 3, 2024 17:56

Merge branch 'huggingface:main' into feature/stablelm

71bd1b5

Merge branch 'huggingface:main' into feature/stablelm

86bb8d8

Merge remote-tracking branch 'upstream/main' into feature/stablelm

f01a7a2

Merge branch 'feature/stablelm' of https://github.com/kazssym/optimum …

75b650b

…into feature/stablelm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental StableLM support #1719

Experimental StableLM support #1719

kazssym commented Feb 25, 2024

fxmarty commented Feb 29, 2024

xenova commented Mar 2, 2024

Experimental StableLM support #1719

Are you sure you want to change the base?

Experimental StableLM support #1719

Conversation

kazssym commented Feb 25, 2024

What does this PR do?

Before submitting

fxmarty commented Feb 29, 2024

xenova commented Mar 2, 2024