A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
-
Updated
Jan 25, 2024 - Python
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
Persian/Farsi text to speech(TTS) training using coqui tts
Text to Speech using Coqui TTS + RVC
Rust bindings to the https://github.com/coqui-ai TTS library
Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS
Voice cloning using coqui-TTS
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.
EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.
Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
Synthesize speech using state-of-the-art open and closed-source tools
Generate cursed videos with AI
llm server using outlines for json/regex/cfg formatted generation
Add a description, image, and links to the coqui-tts topic page so that developers can more easily learn about it.
To associate your repository with the coqui-tts topic, visit your repo's landing page and select "manage topics."