Speaker Diarization #31

fakerybakery · 2024-02-19T01:06:17Z

Hi,
Is speaker diarization planned (espec. in realtime)?
Thx!

ZachNagengast · 2024-02-20T00:27:46Z

For now we're mainly focused on running the core whisper models, which don't support diarization by default, but if you're up for building a library for this we'd be happy to point to your project.

Seems like there's a few models that have this capability at the moment, here is the most popular thread in the openai/whisper repo on the subject: openai/whisper#264. We will do our best to stay in parity with them if they pick a specific approach to diarization, but in the meantime there is an open opportunity for another project to bring it to swift. Will keep this issue open until then.

ZachNagengast added needs model updates Requires associated model change needs info Further information is requested feature New feature or request and removed feature New feature or request labels Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speaker Diarization #31

Speaker Diarization #31

fakerybakery commented Feb 19, 2024

ZachNagengast commented Feb 20, 2024

Speaker Diarization #31

Speaker Diarization #31

Comments

fakerybakery commented Feb 19, 2024

ZachNagengast commented Feb 20, 2024