speech

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

speech image-editing caption data-generation 3d-whole-body-pose-estimation open-vocabulary-detection open-vocabulary-segmentation automatic-labeling-system

Updated May 18, 2024
Jupyter Notebook

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

TalAter / annyang

Star

💬 Speech recognition for your site

voice speech speech-recognition speech-to-text hacktoberfest

Updated Oct 3, 2022
JavaScript

m-bain / whisperX

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated May 16, 2024
Python

avinashkranjan / Amazing-Python-Scripts

Sponsor

Star

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

python machine-learning projects speech artificial-intelligence webcam python-scripts hacktoberfest python-projects

Updated May 21, 2024
Jupyter Notebook

AIGC-Audio / AudioGPT

Star

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audio music speech sound gpt talking-head

Updated Apr 2, 2024
Python

roatienza / Deep-Learning-Experiments

Star

Videos, notes and experiments to understand deep learning

nlp deep-learning speech pytorch artificial-intelligence vision deep-learning-tutorial

Updated Mar 25, 2024
Jupyter Notebook

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated May 23, 2024
Python

pytorch / audio

Star

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio python machine-learning speech pytorch io audio-processing

Updated May 23, 2024
Python

netease-youdao / EmotiVoice

Star

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Feb 6, 2024
Python

r9y9 / wavenet_vocoder

Sponsor

Star

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Jul 29, 2023
Python

metavoiceio / metavoice-src

Star

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated May 14, 2024
Python

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Mar 14, 2022
Python

Kyubyong / tacotron

Star

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

tensorflow speech tts speech-synthesis-model

Updated Jan 17, 2022
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 1,617 public repositories matching this topic...

kaldi-asr / kaldi

babysor / MockingBird

svc-develop-team / so-vits-svc

coqui-ai / TTS

PaddlePaddle / models

huggingface / datasets

IDEA-Research / Grounded-Segment-Anything

mozilla / TTS

TalAter / annyang

m-bain / whisperX

avinashkranjan / Amazing-Python-Scripts

AIGC-Audio / AudioGPT

roatienza / Deep-Learning-Experiments

modelscope / modelscope

pytorch / audio

netease-youdao / EmotiVoice

r9y9 / wavenet_vocoder

metavoiceio / metavoice-src

mravanelli / pytorch-kaldi

Kyubyong / tacotron

Improve this page

Add this topic to your repo