Qualization library based on 1R
-
Updated
Apr 13, 2020 - Rust
Qualization library based on 1R
Pytorch Model Quantization, Layer Fusion and Optimization
Project assignment for course Introduction to Telecommunications at ECE NTUA
Uniform quantizer that uses mexCallMATLAB to call different MATLAB commands and plot the results
Quantizing LLMs using GPTQ
Models made for Edge Devices and NN Optimizations
Matlab code for "Tree-structured quantization on Grassmann and Stiefel manifold", S. Schwarz et al., DCC 2021
DynamicQuantization_Bert from pytorch tutorials
ZeQLoRA: Efficient Finetuning of Quantized LLMs with ZeRO and LoRA
A toy example of OCTAV algorithm for finding the optimal clipping scalar in the quantization error problem
Regularized Classification-Aware Quantization
A compilation of various ML and DL models and ways to optimize the their inferences.
Optimized CPU Implementation of Llama2-LLM
Efficient Inference techniques implemented in PyTorch for computer vision.
Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.
To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."