Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Unofficial implementation of Avocodo: Generative Adversarial Network for Artifact-free Vocoder.

Disclaimer: It only works on config_v1.json for now and this repo build with experimentation purpose not for Production.

For best quality speech synthesis please visit deepsync.co

Training:

python train.py --config config_v1.json

Notes:

Avocodo uses same Generator as HiFi-GAN V1 and V2 but using different discriminators for modelling better lower and higher frequencies.
PQMF is the crucial for both Discriminators.
Losses are similar to HiFi-GAN.
Performance and speed both are some what similar to HiFi-GAN.
Avocodo far better than HiFi-GAN when it comes to synthesize unseen speaker.
Avocodo training is around 20 % faster than HiFi-GAN also it took very less training to output excellent quality of audio.

Citations:

@misc{https://doi.org/10.48550/arxiv.2206.13404,
  doi = {10.48550/ARXIV.2206.13404},
  
  url = {https://arxiv.org/abs/2206.13404},
  
  author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
  
  keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LJSpeech-1.1		LJSpeech-1.1
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
avocodo_arch.png		avocodo_arch.png
config_v1.json		config_v1.json
config_v2.json		config_v2.json
config_v3.json		config_v3.json
env.py		env.py
inference.py		inference.py
inference_e2e.py		inference_e2e.py
meldataset.py		meldataset.py
models.py		models.py
modules.py		modules.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

License

rishikksh20/Avocodo-pytorch

Folders and files

Latest commit

History

Repository files navigation

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Training:

Notes:

Citations:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages