Skip to content

Pull requests: microsoft/DeepSpeed-MII

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Reuse KV cache of prefixes
#484 opened May 27, 2024 by tohtana Draft
Enable streaming option in the OpenAI API server
#480 opened May 16, 2024 by adk9 Loading…
Add Kubernetes health check route to REST server
#445 opened Mar 20, 2024 by richiejp Loading…
Update model support
#429 opened Mar 5, 2024 by mrwyattii Loading…
Pydantic v2 migration
#423 opened Feb 27, 2024 by mrwyattii Draft
add stable diffusion CI workflow
#412 opened Feb 14, 2024 by mrwyattii Loading…
use deploy_rank to allocate gpus
#234 opened Sep 14, 2023 by tulika612 Loading…
Multi model refactor
#223 opened Aug 9, 2023 by TosinSeg Draft
Lazy loading
#220 opened Aug 7, 2023 by TosinSeg Draft
Add Pydantic v2 support
#213 opened Jul 25, 2023 by mrwyattii Draft
Multi model deployment
#208 opened Jun 27, 2023 by TosinSeg Draft
Fix for Stable Diffusion deployments
#172 opened Apr 25, 2023 by mrwyattii Loading…
Non-blocking client API
#158 opened Mar 12, 2023 by tohtana Loading…
Add AML local deployment type
#143 opened Feb 1, 2023 by mrwyattii Loading…
1 of 3 tasks
add ds inject policies
#46 opened Aug 2, 2022 by jeffra Draft
ProTip! Add no:assignee to see everything that’s not assigned.