Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial C-style interface for whisper.cpp #19

Merged
merged 3 commits into from
Oct 4, 2022
Merged

Initial C-style interface for whisper.cpp #19

merged 3 commits into from
Oct 4, 2022

Conversation

ggerganov
Copy link
Owner

ref #9

First iteration of the C-style interface.
Already, a lot of the duplicated code in main.cpp and stream.cpp is now gone.

This is a high-level API that allows separate encode/decode calls + token sampling.
The user will be able to implement their own Mel computation, as well as their own token sampling algorithms using the computed token logits and probabilities.

For convenience, the greedy sampling (and in the future - the beam search sampling) are also available.

Additionally, there will be convenience functions that perform the entire interference (e.g. whisper_full) - simply provide RAW PCM audio and get the result.

@ggerganov ggerganov merged commit 6814cc9 into master Oct 4, 2022
@ggerganov ggerganov deleted the c-api branch October 4, 2022 20:18
kultivator-consulting pushed a commit to KultivatorConsulting/whisper.cpp that referenced this pull request Feb 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant