fix falcon dummy input generator #1825

eaidova · 2024-04-23T15:37:33Z

What does this PR do?

during enabling falcon40b export for openvino, I found that this model is not covered by dummy input generator configuration and failed with shape mismatch. This PR fixes that.

I propose these changed into optimum-intel, but the proper place for it here.

Who can review?

ONNX / ONNX Runtime : @fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun

optimum/utils/input_generators.py

Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

HuggingFaceDocBuilderDev · 2024-04-23T16:31:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

echarlaix

Thanks for the addition @eaidova

echarlaix · 2024-04-25T12:49:33Z

optimum/utils/input_generators.py

+        if normalized_config.new_decoder_architecture and normalized_config.multi_query:
+            self.num_kv_heads = normalized_config.num_attention_heads
+        elif normalized_config.new_decoder_architecture:
+            self.num_kv_heads = normalized_config.num_kv_heads
+        else:
+            self.num_kv_heads = 1


shouldn't it be :

Suggested change

if normalized_config.new_decoder_architecture and normalized_config.multi_query:

self.num_kv_heads = normalized_config.num_attention_heads

elif normalized_config.new_decoder_architecture:

self.num_kv_heads = normalized_config.num_kv_heads

else:

self.num_kv_heads = 1

if normalized_config.new_decoder_architecture:

self.num_kv_heads = normalized_config.num_attention_heads

else:

self.num_kv_heads = normalized_config.num_kv_heads if not normalized_config.multi_query else 1

?

echarlaix · 2024-04-25T13:03:01Z

optimum/utils/input_generators.py

+        elif normalized_config.new_decoder_architecture:
+            self.num_kv_heads = normalized_config.num_kv_heads
+        else:
+            self.num_kv_heads = 1
        self.head_dim = self.hidden_size // self.num_attention_heads



also shouldn't it be udpated here and here as well?

also why not have this value set in the normalized_config directly and use it in both FalconDummyPastKeyValuesGenerator and ORTFalconForCausalLM @fxmarty ? (removing the need to have this check in both places)

fix falcon dummy input generator

6120452

eaidova mentioned this pull request Apr 23, 2024

fix input generator for falcon40b huggingface/optimum-intel#685

Merged

michaelbenayoun reviewed Apr 23, 2024

View reviewed changes

optimum/utils/input_generators.py Outdated Show resolved Hide resolved

Update optimum/utils/input_generators.py

958eeaf

Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

echarlaix reviewed Apr 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix falcon dummy input generator #1825

fix falcon dummy input generator #1825

eaidova commented Apr 23, 2024

HuggingFaceDocBuilderDev commented Apr 23, 2024

echarlaix left a comment

echarlaix Apr 25, 2024

echarlaix Apr 25, 2024 •

edited

echarlaix Apr 25, 2024

fix falcon dummy input generator #1825

Are you sure you want to change the base?

fix falcon dummy input generator #1825

Conversation

eaidova commented Apr 23, 2024

What does this PR do?

Who can review?

HuggingFaceDocBuilderDev commented Apr 23, 2024

echarlaix left a comment

Choose a reason for hiding this comment

echarlaix Apr 25, 2024

Choose a reason for hiding this comment

echarlaix Apr 25, 2024 • edited

Choose a reason for hiding this comment

echarlaix Apr 25, 2024

Choose a reason for hiding this comment

echarlaix Apr 25, 2024 •

edited