Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Help with training own language model using DeepSpeech #3763

Open
rajib-raiyat opened this issue Mar 30, 2023 · 2 comments
Open

Help with training own language model using DeepSpeech #3763

rajib-raiyat opened this issue Mar 30, 2023 · 2 comments

Comments

@rajib-raiyat
Copy link

I am interested in training my own Bangla language model using DeepSpeech, but I'm not sure where to start. I have my own Bangla dataset with audio and text, and I would like to use it to train a model that can transcribe Bangla speech to text offline. I am looking for guidance on how to preprocess my data, train a model, and evaluate its performance.

Can someone please provide detailed instructions or point me to a tutorial or guide that can help me with this process? Here are some specific questions I have:

  • What are the best practices for preprocessing Bangla audio and text data for use with DeepSpeech?
  • How do I create a Bangla language model and generate the necessary files for training a DeepSpeech model?
  • What are the recommended training parameters and settings for training a Bangla language model using DeepSpeech?
  • How do I evaluate the performance of my trained model, and what metrics should I use?
  • Installation guidelines from scratch.

I would appreciate any help or advice that can be provided. Thank you in advance!

@prasanth50
Copy link

Hey Komol Kunty Rajib, you can do this by making the training model consume the Bangla dataset with audio and tex mapping to the model. Now create a superset model where it takes audio to text API to convert and take the audio and make clear text input to model and since model has trained on your data set it will provide the value mapping of the provided text key and easily retrieve the translation text.

@KathyReid
Copy link
Contributor

@rajib-raiyat The DeepSpeech Playbook has been produced to assist you with training a model on other languages.

Shrutakeerti added a commit to Shrutakeerti/DeepSpeech that referenced this issue Feb 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants