quantsim.export(path, filename_prefix) could not generate int8 QNN ONNX model #2773

JiliangNi · 2024-02-26T06:40:55Z

After calling quantsim.export(path, filename_prefix), I could not get int8 QNN ONNX model. My objective is to get an int8 ONNX model through aimet quant toolkit, which shows like the attached image below.

However, by calling quantsim.export(path, filename_prefix), I only can get pth files, encoding files and one fp32 ONNX model. Did I use the export functionality incorrectly? Or is any way to convert encoding files and the fp32 ONNX model to one int8 QNN model?

quic-mangal · 2024-03-20T00:29:19Z

You used it correctly, you can take the encodings and FP32 model to a quantized target to get a quantized model. AIMET only simulates HW performance

quic-akinlawo · 2024-03-20T00:35:18Z

@JiliangNi please use --keep_quant_nodes option with the qnn converters to see a QNN model with activation quant/dequant nodes. Without this option, quant nodes are stripped from the graph.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantsim.export(path, filename_prefix) could not generate int8 QNN ONNX model #2773

quantsim.export(path, filename_prefix) could not generate int8 QNN ONNX model #2773

JiliangNi commented Feb 26, 2024

quic-mangal commented Mar 20, 2024

quic-akinlawo commented Mar 20, 2024

quantsim.export(path, filename_prefix) could not generate int8 QNN ONNX model #2773

quantsim.export(path, filename_prefix) could not generate int8 QNN ONNX model #2773

Comments

JiliangNi commented Feb 26, 2024

quic-mangal commented Mar 20, 2024

quic-akinlawo commented Mar 20, 2024