Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.
-
Updated
May 13, 2019 - Python
Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"
Fast Forward-Only Deep Neural Network Library for the Nao Robots
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
Verification of the effect of speculative decoding in Japanese.
Add a description, image, and links to the fast-inference topic page so that developers can more easily learn about it.
To associate your repository with the fast-inference topic, visit your repo's landing page and select "manage topics."