How to get a Qoperator format ONNX model after quantization? #2825

JiliangNi · 2024-03-18T02:04:35Z

How to get a Qoperator format ONNX model after quantization?

e-said · 2024-03-18T19:10:39Z

I'm not aware of Qoperator support in AIMET. However, you can obtain QDQ format in your ONNX using use_embedded_encodings=true with AIMET's ONNX export feature. If you're unfamiliar with QDQ format, you can find more information in this link

PS: Please note that AIMET QDQ format is supported only for int8 quantization (W8A8) due to limitation in the ONNX opset version related to the Torch version (1.13) of AIMET

JiliangNi · 2024-03-19T02:29:35Z

So AIMET does not support Qoperator format in ONNX, right?

Is there any method which could convert QDQ to Qoperator in ONNX?

quic-mangal · 2024-03-25T16:29:52Z

@JiliangNi, we don't have the support to convert currently.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get a Qoperator format ONNX model after quantization? #2825

How to get a Qoperator format ONNX model after quantization? #2825

JiliangNi commented Mar 18, 2024

e-said commented Mar 18, 2024 •

edited

JiliangNi commented Mar 19, 2024

quic-mangal commented Mar 25, 2024

How to get a Qoperator format ONNX model after quantization? #2825

How to get a Qoperator format ONNX model after quantization? #2825

Comments

JiliangNi commented Mar 18, 2024

e-said commented Mar 18, 2024 • edited

JiliangNi commented Mar 19, 2024

quic-mangal commented Mar 25, 2024

e-said commented Mar 18, 2024 •

edited