Tools for handling speech data in machine learning projects.
-
Updated
May 31, 2024 - Python
Tools for handling speech data in machine learning projects.
End-to-End Speech Processing Toolkit
kaldi-asr/kaldi is the official location of the Kaldi project.
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Speaker Verification using Pytorch
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A Python wrapper for Kaldi
Command line utility for forced alignment using Kaldi
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Parallelized video speech-to-text converter using ffmpeg and kaldi/vosk
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
A Speech to Text Personal Assist inspired by kaldi2 and joint-bert
Lab exercises of Speech and Language Processing course in NTUA
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Фонограми та синтагми: інструменти обробки
Add a description, image, and links to the kaldi topic page so that developers can more easily learn about it.
To associate your repository with the kaldi topic, visit your repo's landing page and select "manage topics."