Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
-
Updated
Jun 3, 2024 - Python
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
Preparation and processing of data for tacotron2.
speech synthesis - common voice polish dataset.
Tacotron-Korean-Tensorflow2 for ubuntu
Code used in conjunction with an implementation of a Seq2Seq LSTM TTS frontend, to process and evaluate Google Research's Wikipedia Homograph Dataset (WHD) and LibriSpeech data, with the aim of improving the TTS frontend's homograph disambiguation abilities.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Synthese vocale avec conditionnement sur tres petit jeu de données. Utilisation des modeles Tacotron2 et WaveGlow de Nvidia avec Pytorch.
Converting text to audio and applying audio augmentation
A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about
Catalan Text to Speech
Pytorch implementation of Tacotron 2 (https://arxiv.org/pdf/1712.05884.pdf)
Training Tacotron2 for Persian language as a Persian text-to-speech
EC499: Major Project
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Add a description, image, and links to the tacotron2 topic page so that developers can more easily learn about it.
To associate your repository with the tacotron2 topic, visit your repo's landing page and select "manage topics."