-
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing,
arXiv, 2404.06674
, arxiv, pdf, cication: -1Philip Anastassiou, Zhenyu Tang, Kainan Peng, Dongya Jia, Jiaxin Li, Ming Tu, Yuping Wang, Yuxuan Wang, Mingbo Ma · (voiceshopai.github)
-
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion,
arXiv, 2401.11053
, arxiv, pdf, cication: -1Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Zhuo Chen, Lei Xie, Yuping Wang, Yuxuan Wang
-
GPT-SoVITS - RVC-Boss
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
-
CoMoSVC: Consistency Model-based Singing Voice Conversion,
arXiv, 2401.01792
, arxiv, pdf, cication: -1Yiwen Lu, Zhen Ye, Wei Xue, Xu Tan, Qifeng Liu, Yike Guo · (comosvc.github)
-
Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion,
arXiv, 2310.11160
, arxiv, pdf, cication: -1Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu · (zhangxueyao)
-
llvc - koeai
-
Rhythm Modeling for Voice Conversion,
arXiv, 2307.06040
, arxiv, pdf, cication: -1Benjamin van Niekerk, Marc-André Carbonneau, Herman Kamper
-
HierVST: Hierarchical Adaptive Zero-shot Voice Style Transfer,
arXiv, 2307.16171
, arxiv, pdf, cication: -1Sang-Hoon Lee, Ha-Yeong Choi, Hyung-Seok Oh, Seong-Whan Lee
-
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs,
arXiv, 2307.09435
, arxiv, pdf, cication: -1Yinghao Aaron Li, Cong Han, Nima Mesgarani
-
Voice Conversion With Just Nearest Neighbors,
arXiv, 2305.18975
, arxiv, pdf, cication: -1Matthew Baas, Benjamin van Niekerk, Herman Kamper · (knn-vc - interspeech2023blind) · (bshall.github)
-
Applio - IAHispano
VITS-based Voice Conversion focused on simplicity, quality and performance.
-
Retrieval-based-Voice-Conversion-WebUI - RVC-Project
Voice data <= 10 mins can also be used to train a good VC model!