Skip to content
@FasterDecoding

FasterDecoding

Think deeper, decode faster

Pinned

  1. Medusa Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 1.9k 120

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…