Skip to content

saba99/Whisper_ASR_OpenAi

Repository files navigation

WhisperX

Whisper-Based Automatic Speech Recognition (ASR) with improved timestamp accuracy using forced alignment.

Final Project of Speech Recognition university Course - Winter 2023 -Dr koochari

Author(Modified Source Code): Saba Hesaraki

English

Result using WhisperX with forced alignment to wav2vec2.0 large:

sample01.mp4

Compare this to original whisper out the box, where many transcriptions are out of sync:

sample_whisper_og.mov

German

sample_de_01_vis.mov