Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Bugfix] Fix CLI arguments in OpenAI server docs
#4729 opened May 10, 2024 by AllenDou Loading…
[Core]fix type annotation for swap_blocks
#4726 opened May 10, 2024 by jikunshang Loading…
[Speculative decoding] Improve n-gram efficiency
#4724 opened May 9, 2024 by comaniac Loading…
[Misc] Added devcontainer to help vscode dev setup
#4720 opened May 9, 2024 by ElefHead Loading…
[Misc] Apply a couple g++ cleanups
#4719 opened May 9, 2024 by stevegrubb Loading…
[CORE] Improvement in ranks code
#4718 opened May 9, 2024 by SwapnilDreams100 Loading…
add TypeLogitsProcessor
#4712 opened May 9, 2024 by eitanturok Draft
Remove Ray health check
#4693 opened May 8, 2024 by Yard1 Loading…
[Core] Implement sharded state loader
#4690 opened May 8, 2024 by aurickq Loading…
[Misc] Add OpenTelemetry support
#4687 opened May 8, 2024 by ronensc Loading…
Support Deepseek-V2 new model Requests to new models
#4650 opened May 7, 2024 by zwd003 Loading…
ProTip! Updated in the last three days: updated:>2024-05-07.