-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
将音频切分更改为fap的音频切分与响度匹配,以及完成剩余的i18n #937
base: main
Are you sure you want to change the base?
Conversation
GPT-SoVits, adding loudness normalization and maximum audio length control.
webui.py
Outdated
p = Popen(cmd, shell=True) | ||
ps_slice.append(p) | ||
if (p_slice == None): | ||
cmd = f'"{python_exec}" tools/slice_audio.py -i "{input_path}" -o "{output_dir}" --threshold {threshold} --min_duration {min_duration} --max_duration {max_duration} --min_interval {min_interval} --hop_size {hop_size} --max_sil_kept {max_sil_kept} --num_worker {num_worker} --merge_short {merge_short}''' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
末尾的单引号多了
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
末尾的单引号多了
fixed
@@ -770,8 +813,7 @@ def change_size_choices(key): # 根据选择的模型修改可选的模型尺寸 | |||
if_uvr5.change(change_uvr5, [if_uvr5], [uvr5_info]) | |||
open_asr_button.click(open_asr, [asr_inp_dir, asr_opt_dir, asr_model, asr_size, asr_lang], [asr_info,open_asr_button,close_asr_button]) | |||
close_asr_button.click(close_asr, [], [asr_info,open_asr_button,close_asr_button]) | |||
open_slicer_button.click(open_slice, [slice_inp_path,slice_opt_root,threshold,min_length,min_interval,hop_size,max_sil_kept,_max,alpha,n_process], [slicer_info,open_slicer_button,close_slicer_button]) | |||
close_slicer_button.click(close_slice, [], [slicer_info,open_slicer_button,close_slicer_button]) | |||
open_slicer_button.click(open_slice, [slice_inp_path, slice_opt_root, num_worker, min_duration, max_duration, min_interval, threshold, hop_size, max_sil_kept,if_merge_short, if_loudness_norm, loudness, peak], [slicer_info,open_slicer_button,close_slicer_button]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
close_slicer_button 不要了?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
close_slicer_button 不要了?
fixed
为什么要改切分和响度匹配? |
限制最大长度,省的切太长,响度匹配是有群友提议加上的,后来就直接把fap的这两块改一改搬过来了,切分如果保持原样,可以加个单独的响度匹配? |
响度匹配部分代码拉下来用了,效果明显,很稳定,未发现bug。我直接用在推理部分了。 |
融了点冷月佬 fish audio的audio preprocess进去,在readme和py文件中已注明
支持最小长度与最大长度控制,并支持合并短片段,并自动抛弃三秒以下片段(训练时不会使用)
支持响度匹配,默认打开
更新了所有语言的json,现在GSV的i18n json齐全了