GitHub - wladradchenko/advanced.wunjo.wladradchenko.ru: Extension to add advanced features to Wunjo AI

(RU)

Advanced Wunjo AI

Extension

Project documentation

Issue · Features

Deprecated: Starting from version 1.4, the use of extensions in Wunjo AI is deprecated.

About extension

The extension adds to the main application:

Training panel of your own neural network model for voice synthesis
Ability to use GPU
Switching work from CPU to GPU and vice versa
Improved background quality when creating animations
Console to track learning progress and content generation

Note: GPU extensions available if you have CUDA installed.

About

Wunjo AI Extensions are add-on modules for extending the capabilities of Wunjo AI. Main GitHub project at link.

Wunjo AI is a speech-to-text and speech-to-text recognition application. One of the unique features of this application is the ability to create multi-dialogues with multiple voices, and the number of characters used is not limited, unlike similar web applications. You can also speak text in real time and the app will recognize it from the audio. This feature is great for dictating text instead of manually typing it.

All in all, this neural network desktop application is a handy and powerful tool for anyone who needs speech synthesis and voice-to-text recognition. Best of all, the app is free, installs locally, and is easy to use! And you can use it in the voice acting of commercials, books, games, etc.

Update 1.0.0

Added GPU usage for faster processing.
Switching work from CPU to GPU and vice versa
Background quality improvements when creating animations
Added panel for training Tacotron2 neural network model (training result in .wunjo/user_trained_voice)
Added panel for training Waveglow neural network model (training result in .wunjo/user_trained_voice)

Update 1.0.1

Added console to track learning and synthesis progress.

Update 1.0.2

Add inspect right format of mark file.
Change ru and en on icon.

Installation

Download to directory .wunjo/extensions/{folder}

Extension format

To create your own extension, you will need to create a run.py file with a run method that takes the media_folder, extension_folder, app directory as input, where media_folder is the media file directory for saving the code, extension_folder is the directory of the extension itself, app is the Flask application, where you can add new pages or options.

To add new elements to the front of the project, you need to create a templates/index.html directory in your extension, where you can add new elements, js, css.

An example of creating an extension structure in this project.

Train data format

Audio has to be in .wav format Mono channels, sample rate 22050 Hz and bit rate 352Kbts. Example for marks:

006522.wav|Н+е к Пл+юхиной ж+е обращ+аться, сказ+ал ред+актор. В+от он+о, д+умаю, тво+ё подсозн+ание. Сд+елайте, гол+убчик.
006523.wav|В см+ысле заш+ить? Н+а ск+орую р+уку. Вообщ+е‑т+о я н+е ум+ею… Д+а к+ак сум+еете. Кор+оче, заш+ил я ем+у бр+юки.
006524.wav|Чег+о +уж т+ам… Заглян+ул в лаборат+орию к Жбанк+ову. Собир+айся, говор+ю, пошл+и.

Where + is the stress in the word.

P.S. Even if your custom dataset is smaller in terms of total number of files, individual sequences could be longer. Tacotron2's memory consumption largely depends on the sequence lengths due to the recurrent nature of the model. Ensure that sequences in your custom dataset aren't too long or consider truncating or splitting them. If you still have an error CUDA out of memory, then you can reduce batch size to 16 in hparams.yaml

Video

Контакт

Author: Wladislav Radchenko

Email: i@wladradchenko.ru

Project: https://github.com/wladradchenko/wunjo.wladradchenko.ru

Web site: wladradchenko.ru/voice

Credits

Tacatron 2 - https://github.com/NVIDIA/tacotron2
Waveglow - https://github.com/NVIDIA/waveglow
Apex - https://github.com/NVIDIA/apex

(to top)

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
example		example
speech/tps		speech/tps
static		static
tacotron2		tacotron2
templates		templates
waveglow		waveglow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_ru.md		README_ru.md
run.py		run.py

License

wladradchenko/advanced.wunjo.wladradchenko.ru

Folders and files

Latest commit

History

Repository files navigation

Advanced Wunjo AI

Extension

About extension

About

Update 1.0.0

Update 1.0.1

Update 1.0.2

Installation

Extension format

Train data format

Video

Контакт

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Languages