Pull requests: AutoGPTQ/AutoGPTQ
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BUG/DEPRECATION] Remove fused attention/mlp
#659
opened Apr 28, 2024 by
Qubitium
Loading…
4 tasks done
[USABILITY] Warn users if quantization using insufficient nsamples
#656
opened Apr 28, 2024 by
Qubitium
Loading…
[FEATURE] Allow loading of quantized lm_head
#648
opened Apr 25, 2024 by
Qubitium
Loading…
1 task done
[PERFORMANCE] Fix Packing thread regression in code
#642
opened Apr 16, 2024 by
Qubitium
Loading…
2 tasks done
[BUG/FEATURE] Fix Sym=False, new checkpoint_format = gptq_v2
#640
opened Apr 12, 2024 by
Qubitium
Loading…
29 tasks done
[Minor] peft bug fix: HF peft version and tokenizer path in peft scripts
#493
opened Dec 24, 2023 by
realAsma
Loading…
Allow specifying GPU used for quantisation, overriding hardcoded cuda:0
#405
opened Nov 5, 2023 by
TheBloke
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.