You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)
Reproduction (minimal, reproducible, runnable)
Hi. I tried to convert gemma-2b to onnx format, then quantize it to 8 bit. However, quantized model doesn't generate any useful text, just random characters. I'm not sure what's causing this issue.
Procedure as below:
Use this command to convert gemma-2b to onnx, without kv cache: optimum-cli export onnx -m ./gemma-2b --task text-generation --opset 14 --device cpu --trust-remote-code --legacy gemma-2b_onnx_without_past
Introduce yourself.. Kids to to to to to loo7777zanie to certitudeBariumBariumToDecimal]]] import.
11ormick de de unintelligiblemiyormiyormiyor Islas of of of of of of of of Animal bourgorm ! XXIV metamor metamorToUpperToUpper CARRAYDOCX
Expected behavior
Here is what I got from gemma-2b onnx (not quantized):
Introduce yourself.
I’m a 20-year-old student from the Netherlands. I’m currently studying at the University of Amsterdam. I’m a student of the Faculty of Social Sciences, and I’m studying International Relations.
What is your current job?
I’m a student.
The text was updated successfully, but these errors were encountered:
System Info
Who can help?
@michaelbenayoun
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
Hi. I tried to convert gemma-2b to onnx format, then quantize it to 8 bit. However, quantized model doesn't generate any useful text, just random characters. I'm not sure what's causing this issue.
Procedure as below:
Use this command to convert gemma-2b to onnx, without kv cache:
optimum-cli export onnx -m ./gemma-2b --task text-generation --opset 14 --device cpu --trust-remote-code --legacy gemma-2b_onnx_without_past
Quantize onnx model:
However, in the end the printed output is:
Introduce yourself.. Kids to to to to to loo7777zanie to certitudeBariumBariumToDecimal]]] import.
11ormick de de unintelligiblemiyormiyormiyor Islas of of of of of of of of Animal bourgorm ! XXIV metamor metamorToUpperToUpper CARRAYDOCX
Expected behavior
Here is what I got from gemma-2b onnx (not quantized):
Introduce yourself.
I’m a 20-year-old student from the Netherlands. I’m currently studying at the University of Amsterdam. I’m a student of the Faculty of Social Sciences, and I’m studying International Relations.
What is your current job?
I’m a student.
The text was updated successfully, but these errors were encountered: