Skip to content

v0.7.1: patch release

Latest
Compare
Choose a tag to compare
@fxmarty fxmarty released this 01 Mar 13:14
· 21 commits to main since this release

Support loading sharded quantized checkpoints

Sharded checkpoints can now be loaded in the from_quantized method.

  • Support loading sharded quantized checkpoints. by @LaaZa in #425

Gemma GPTQ quantization

Gemma model can be quantized with AutoGPTQ.

Other changes and fixes

Full Changelog: v0.7.0...v0.7.1