Skip to content

Latest commit

 

History

History
19 lines (17 loc) · 700 Bytes

ROADMAP.md

File metadata and controls

19 lines (17 loc) · 700 Bytes

Roadmap

Functionality

  • Batched inference
  • Fine-grained KV cache management
  • Explore tree sparsity
  • Fine-tune Medusa heads together with LM head from scratch
  • Distill from any model without access to the original training data

Integration

Local Deployment

Serving