kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Apr 30, 2024 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
SoftVC VITS Singing Voice Conversion
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
💬 Speech recognition for your site
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Videos, notes and experiments to understand deep learning
ModelScope: bring the notion of Model-as-a-Service to life.
Data manipulation and transformation for audio signal processing, powered by PyTorch
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
WaveNet vocoder
Foundational model for human-like, expressive TTS
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."