Whisper-Based Automatic Speech Recognition (ASR) with improved timestamp accuracy using forced alignment.
Final Project of Speech Recognition university Course - Winter 2023 -Dr koochari
Author(Modified Source Code): Saba Hesaraki
Result using WhisperX with forced alignment to wav2vec2.0 large:
sample01.mp4
Compare this to original whisper out the box, where many transcriptions are out of sync: