Implement transformers

Framework / Tool	Source code
pytorch	pytorch
tensorflow	tf

Inference ONNX Model with ONNX Runtime

link refer

speed up 2x
For CPU, optimized graph is slightly different: FastGelu is replaced by BiasGelu.
Note that ONNX Runtime is compatible with Python versions 3.5 to 3.7.

What is ONNX Runtime? (vnese)

link refer

tackled optimizing một model cho các môi trường (cloud GPU, desktop CPU,..) tốn nhiều thời gian

Export the loaded model

python convert_onnx.py

Inference ONNX Model across multiple platforms

python bert_onnxruntime.py

Offline optimization

sometime OnnxRuntime cannot be fully optimized:
- new subgraph generated by new export tool and not covered by older version of OnnxRuntime
- exported model uses dynamic axis, make harder for shape inference
- some optimization is better to done offline. Like change input tensor type from float32 to float16 avoid Cast nodes to achieve better performance in V100 and T4 GPU

python experiment.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
phobert_onnx_convert		phobert_onnx_convert
transformer_pytorch		transformer_pytorch
transformer_tf_translation		transformer_tf_translation
README.md		README.md
bert_onnxruntime.py		bert_onnxruntime.py
convert_onnx.py		convert_onnx.py
gpt2_export.py		gpt2_export.py
gpt2_generator.py		gpt2_generator.py
optimize_offline.py		optimize_offline.py
requirements.txt		requirements.txt
utils.py		utils.py
verify_onnx_optimize.py		verify_onnx_optimize.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

phobert_onnx_convert

phobert_onnx_convert

transformer_pytorch

transformer_pytorch

transformer_tf_translation

transformer_tf_translation

README.md

README.md

bert_onnxruntime.py

bert_onnxruntime.py

convert_onnx.py

convert_onnx.py

gpt2_export.py

gpt2_export.py

gpt2_generator.py

gpt2_generator.py

optimize_offline.py

optimize_offline.py

requirements.txt

requirements.txt

utils.py

utils.py

verify_onnx_optimize.py

verify_onnx_optimize.py

visualize.py

visualize.py

Repository files navigation

Implement transformers

Inference ONNX Model with ONNX Runtime

What is ONNX Runtime? (vnese)

Export the loaded model

Inference ONNX Model across multiple platforms

Offline optimization

About

Releases

Packages

Languages

BinhMinhs10/transformers_onnx

Folders and files

Latest commit

History

Repository files navigation

Implement transformers

Inference ONNX Model with ONNX Runtime

What is ONNX Runtime? (vnese)

Export the loaded model

Inference ONNX Model across multiple platforms

Offline optimization

About

Topics

Resources

Stars

Watchers

Forks

Languages