Skip to content

Pull requests: neuralmagic/nm-vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

updates for automation (and release)
#265 opened May 23, 2024 by andy-neuma Loading…
update install commands
#264 opened May 23, 2024 by dhuangnm Loading…
Fix quantization rounding
#263 opened May 22, 2024 by varun-sundar-rabindranath Loading…
Update Dockerfile for the release
#262 opened May 22, 2024 by dhuangnm Loading…
Address py38/39 incompatibilities
#261 opened May 22, 2024 by dbarbuzzi Loading…
Upstream sync 2024 05 19
#249 opened May 19, 2024 by robertgshaw2-neuralmagic Loading…
Reasonable int8 configs
#238 opened May 13, 2024 by varun-sundar-rabindranath Loading…
Skipping refactor
#234 opened May 13, 2024 by robertgshaw2-neuralmagic Loading…
Create test_optional_libraries.py
#230 opened May 9, 2024 by mgoin Loading…
[Activation Quantization] Dynamic Per Token Support
#225 opened May 6, 2024 by dsikka Loading…
[WIP] FLAN-T5 integration
#194 opened Apr 17, 2024 by afeldman-nm Loading…
WIP: basic correctness test
#192 opened Apr 17, 2024 by derekk-nm Draft
whl centric
#191 opened Apr 17, 2024 by andy-neuma Loading…
Added Docker Compose Example
#182 opened Apr 12, 2024 by robertgshaw2-neuralmagic Loading…
vllm - quantization : DO NOT MERGE
#180 opened Apr 11, 2024 by varun-sundar-rabindranath Loading…
Pypi and updates
#177 opened Apr 9, 2024 by andy-neuma Loading…
Support for compressed-tensors
#159 opened Apr 2, 2024 by dbogunowicz Loading…
ProTip! Add no:assignee to see everything that’s not assigned.