A PyTorch-based Speech Toolkit
-
Updated
May 22, 2024 - Python
A PyTorch-based Speech Toolkit
SDK & Sample to do speech recognition using websockets in Javascript
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...
🙊 Speech Recognition , Text To Speech , Google Translate
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
A pytorch based end2end speech recognition system.
Open source projects related to Snips https://snips.ai/.
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Artificially Intelligent Machine with Computer Vision, Natural Language Processing, AI, Sense and Feelings.
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Long audio alignment using Kaldi
Web Browser Audio Detection/Speech Recording Events API
A library for using Web Speech API with Angular
Microsoft Engage program 2021
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Pytorch based phoneme recognition (TIMIT phoneme classification)
Add a description, image, and links to the speechrecognition topic page so that developers can more easily learn about it.
To associate your repository with the speechrecognition topic, visit your repo's landing page and select "manage topics."