Replies: 2 comments
-
16khz mono, 2 bytes (16bit) - also known as pcm_s16le |
Beta Was this translation helpful? Give feedback.
0 replies
-
I am not an author here. Please don't ping other users. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello @cjheath, thanks for great asset, in order to detect text from voice or a sound clip, that data that should be passed into whisper_full() method, should be in what format? I mean mono/stereo and also 2 bytes or 4 bytes for each samples?
Beta Was this translation helpful? Give feedback.
All reactions