Hindi-ASR-and-TTS

Summary

In this project, we have focussed on creating an Automatic Speech Recognition (ASR) and Text to Speech (TTS) module for low resource Indic language Hindi.

The Hindi ASR module was designed using Facebook’s Wav2Vec 2.0 model. Submissions were also made to the ‘GRAM VAANI ASR Challenge 2022’ in the ‘Open’ and ‘self-supervised’ categories. For the open challenge, we have used a pre-trained model by Vakyansh named ‘CLSRIL-23’ and further fine-tuned using 100hrs telephonic labeled data. For the self-supervised challenge, we initially designed a pre-trained model using 1000hrs telephonic unlabeled data and then finetuned using 100hrs telephonic labeled data. A language model, KenLM was used in the decoding step to improve the model’s accuracy.

The Hindi TTS model was developed using Tacotron2 and Parallel WaveGAN models. TTS synthesizer mainly contains two modules, one being 'Spectrogram Prediction Network' and 'Vocoder'. Tacotron2 is an AI-powered end-to-end speech synthesis model developed by Google. It takes processed characters as input and has the ability to convert them to a speech waveform. In our project, we have used the Tacotron2 model to just create the acoustic features for us. The acoustic and spectral features are then fed to a vocoder named ‘Parallel WaveGAN’ where we get speech as the output. Both the models are trained on 21.46hrs IIT Madras Hindi dataset (female voice).

Project report can be accessed here.
Project presentation can be accessed here.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
ASR		ASR
TTS		TTS
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASR

ASR

TTS

TTS

README.md

README.md

Repository files navigation

Hindi-ASR-and-TTS

Summary

About

Releases

Packages

Contributors 3

Languages

parvatijay2901/Hindi-ASR-and-TTS

Folders and files

Latest commit

History

Repository files navigation

Hindi-ASR-and-TTS

Summary

About

Topics

Resources

Stars

Watchers

Forks

Languages