1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
-
Updated
May 23, 2024 - Python
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A service designed to translate speeches in multimedia using AI and ML voice cloning technology.
VITS-based Voice Conversion focused on simplicity, quality and performance.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A webui for different audio related Neural Networks
Voice Cloner is a tool to clone human voices in a very natural and realistic way. The application collects voice samples and generates the audio using text to speech.
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Simple WhisperSpeech web UI
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation
Problem Statement: Developing A Software For Dubbing Videos.
A research project and state-of-the-art review on text-to-speech models and voice cloning.
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
The best looking and most functional webui for RVC related tasks. See website for UI demo:
Robust functionality, focused on granting convenient access to AI models developed using the RVC technology.
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Add a description, image, and links to the voice-cloning topic page so that developers can more easily learn about it.
To associate your repository with the voice-cloning topic, visit your repo's landing page and select "manage topics."