#

deepspeech

Here are 130 public repositories matching this topic...

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated May 20, 2024
Jupyter Notebook

yeyupiaoling / MASR

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

deep-learning speech pytorch speech-recognition speech-to-text asr conformer deepspeech squeezeformer

Updated May 8, 2024
Python

yeyupiaoling / PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

docker deep-learning speech-recognition chinese speech-to-text nvidia-docker asr paddlepaddle deepspeech2 deepspeech

Updated May 3, 2024
Python

waikato-datamining / tensorflow

Various applications using tensorflow.

python deep-learning tensorflow image-classification object-detection image-segmentation model-maker deepspeech tflite efficientdet

Updated Apr 22, 2024
Python

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Feb 18, 2024
C++

gulabpatel / Speech-to-Text

text-to-speech speech-to-text gtts deepspeech wav2vec2

Updated Feb 5, 2024
Jupyter Notebook

waikato-ufdl / wai-annotations

Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).

image-annotation conversion python3 vgg tfrecords mscoco deepspeech common-voice festvox

Updated Feb 2, 2024
Dockerfile

yui-mhcp / speech_to_text

Speech-To-Text (STT) project

jasper speech-to-text stt whisper deepspeech tensorflow2 audio-transcription video-transcription stt-api

Updated Feb 1, 2024
Python

Picovoice / speech-to-text-benchmark

speech to text benchmark framework

privacy deep-neural-networks deep-learning offline voice-recognition speech-recognition speech-to-text pocketsphinx cheetah deepspeech aws-transcribe mozilla-deepspeech edge-ai google-speech-to-text picovoice

Updated Jan 12, 2024
Python

manthan410 / audio-lecture-notes-generator

Generates section wise topics and transcription for lecture videos and helps to control the lecture video playback based on generated topic-wise timestamps.

nlp youtube-dl topic-modeling automatic-speech-recognition speech-to-text transcription deepspeech whisper-ai wave2vec2

Updated Dec 29, 2023
Jupyter Notebook

abhirooptalasila / AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

python video ffmpeg sox subtitle srt speech-to-text asr autosub deepspeech mozilla-deepspeech coqui-ai

Updated Dec 24, 2023
Python

milahu / autosub-by-abhirooptalasila

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

offline tensorflow speech-recognition srt vtt autosub deepspeech subtitle-generator subtitles-generator

Updated Dec 15, 2023
Python

manhph2211 / ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

text-to-speech speech-synthesis mfa vocoder deepspeech normalizing-flow hifi-gan multispeaker-speech-synthesis mosnet portaspeech realtime-tts istftnet vietnamese-tts vietnamese-text-to-speech

Updated Nov 24, 2023
Python

Algo-Boys / SWR2-ASR

Automatic speech recognition model for the Spoken Word Recognition seminar (SWR2) Tübingen

automatic-speech-recognition german-language deepspeech tuebingen ctc-decode

Updated Nov 23, 2023
Python

AdroitAnandAI / Indian-Accent-Speech-Recognition

Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech

hmm dnn voice-recognition speech-recognition accent asr indian indian-language deepspeech speech-modelling cepstral-analysis accented-speech custom-training indian-accent

Updated Oct 19, 2023
Jupyter Notebook

robinhad / voice-recognition-ua

Training scripts for Speech-To-Text models for Ukrainian language

speech-recognition speech-to-text ukrainian stt asr deepspeech ukrainian-language wav2vec coqui-ai

Updated Aug 28, 2023
Jupyter Notebook

mozilla / DeepSpeech-examples

Examples of how to use or integrate DeepSpeech

nodejs python machine-learning dotnet examples speech-recognition deepspeech

Updated Jul 25, 2023
Python

KathyReid / cpug-2021-deepspeech

Lightning talk on DeepSpeech to Canberra Python Users' Group 4th March 2021

lightning-talk canberra deepspeech

Updated Jul 20, 2023
JavaScript

glhr / speech

Text-to-Speech and Speech-to-Text methods for Python

python speech-recognition speech-to-text google-speech-recognition deepspeech

Updated Jul 6, 2023
Python

scription

smlum / scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

speech-to-text transcription deepspeech aws-transcribe mozilla-deepspeech scription google-speech-to-text yandex-speech-kit

Updated Jun 29, 2023
JavaScript

Improve this page

Add a description, image, and links to the deepspeech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeech topic, visit your repo's landing page and select "manage topics."