Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
Updated
May 20, 2024 - Jupyter Notebook
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
Various applications using tensorflow.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).
Speech-To-Text (STT) project
speech to text benchmark framework
Generates section wise topics and transcription for lecture videos and helps to control the lecture video playback based on generated topic-wise timestamps.
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
Automatic speech recognition model for the Spoken Word Recognition seminar (SWR2) Tübingen
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
Training scripts for Speech-To-Text models for Ukrainian language
Examples of how to use or integrate DeepSpeech
Lightning talk on DeepSpeech to Canberra Python Users' Group 4th March 2021
Text-to-Speech and Speech-to-Text methods for Python
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Add a description, image, and links to the deepspeech topic page so that developers can more easily learn about it.
To associate your repository with the deepspeech topic, visit your repo's landing page and select "manage topics."