Neural Magic
Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.
Pinned
Repositories
Showing 10 of 39 repositories
- compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
-
-
-
- alpaca_eval Public Forked from tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
- nm-AutoGPTQ Public Forked from AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.