self-supervised pretraining(wav2vec 2.0/data2vec) for wenet #1003

Emiyassstar · 2022-03-31T09:44:20Z

1.support self-supervised pretraining using wav2vec 2.0/data2vec method
2.add ssl recipe in librispeech/ssl
3.add ssl recipe in aishell/ssl

misaka23 · 2022-03-31T11:21:42Z

cool!

wenet/data2vec/data2vec_encoder.py

wenet/bin/train.py

wenet/data2vec/data2vec_model.py

Yymax-max · 2022-04-02T03:14:35Z

nice

liufei1656 · 2022-04-08T02:51:24Z

Looking forward to the latest developments

rookie0607 · 2023-04-03T09:01:16Z

1.support self-supervised pretraining using wav2vec 2.0/data2vec method 2.add ssl recipe in librispeech/ssl 3.add ssl recipe in aishell/ssl

我尝试复现这个例子，使用https://huggingface.co/emiyasstar/ch-w2v-conformer 这个预训练模型，报错如下：
Traceback (most recent call last):
File "wenet/bin/train.py", line 322, in
main()
File "wenet/bin/train.py", line 234, in main
infos = load_trained_modules(model, args)
File "/home/wenet_ssl/wenet/utils/checkpoint.py", line 95, in load_trained_modules
model.load_state_dict(main_state_dict)
File "/home/miniconda3/envs/wenet/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1482, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for Wav2vec2Model:
Unexpected key(s) in state_dict: "encoder.embed.linear.weight", "encoder.embed.linear.bias".
size mismatch for encoder.embed.conv.2.weight: copying a param with shape torch.Size([512, 512, 5, 5]) from checkpoint, the shape in current model is torch.Size([512, 512, 3, 3]).
我该如何修改？希望得到您的回复。

Emiyassstar · 2023-04-21T08:28:04Z

我尝试复现这个例子，使用https://huggingface.co/emiyasstar/ch-w2v-conformer 这个预训练模型，报错如下： Traceback (most recent call last): File "wenet/bin/train.py", line 322, in main() File "wenet/bin/train.py", line 234, in main infos = load_trained_modules(model, args) File "/home/wenet_ssl/wenet/utils/checkpoint.py", line 95, in load_trained_modules model.load_state_dict(main_state_dict) File "/home/miniconda3/envs/wenet/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1482, in load_state_dict raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for Wav2vec2Model: Unexpected key(s) in state_dict: "encoder.embed.linear.weight", "encoder.embed.linear.bias". size mismatch for encoder.embed.conv.2.weight: copying a param with shape torch.Size([512, 512, 5, 5]) from checkpoint, the shape in current model is torch.Size([512, 512, 3, 3]). 我该如何修改？希望得到您的回复。

ch-w2v-conformer使用的是6倍降采样模型，并且去除了预训练部分的训练参数以兼容master分支代码，你可以配合我们放出的 openasr recipe 里面提供的配置文件去加载模型
https://github.com/wenet-e2e/wenet/tree/main/examples/openasr2021/s0

rookie0607 · 2023-04-25T03:22:25Z

我尝试复现这个例子，使用https://huggingface.co/emiyasstar/ch-w2v-conformer 这个预训练模型，报错如下： Traceback (most recent call last): File "wenet/bin/train.py", line 322, in main() File "wenet/bin/train.py", line 234, in main infos = load_trained_modules(model, args) File "/home/wenet_ssl/wenet/utils/checkpoint.py", line 95, in load_trained_modules model.load_state_dict(main_state_dict) File "/home/miniconda3/envs/wenet/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1482, in load_state_dict raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for Wav2vec2Model: Unexpected key(s) in state_dict: "encoder.embed.linear.weight", "encoder.embed.linear.bias". size mismatch for encoder.embed.conv.2.weight: copying a param with shape torch.Size([512, 512, 5, 5]) from checkpoint, the shape in current model is torch.Size([512, 512, 3, 3]). 我该如何修改？希望得到您的回复。

ch-w2v-conformer使用的是6倍降采样模型，并且去除了预训练部分的训练参数以兼容master分支代码，你可以配合我们放出的 openasr recipe 里面提供的配置文件去加载模型 https://github.com/wenet-e2e/wenet/tree/main/examples/openasr2021/s0

感谢您的回复，https://github.com/wenet-e2e/wenet/blob/1269a6e5bbec440302e934f243f623baeebf2758/examples/aishell/s0_ssl/README.md 提到的使用fbank作为特征输入所训练的w2v-conformer 模型开源了吗？

aydentang added 11 commits March 15, 2022 20:14

wav2vec2 training

023e94d

ssl recipe

d41cdfc

update recipe

f82c443

update recipe

139001e

update recipe

1754fe1

fix recog bug

f8b3dfd

w2v mask

b9b2f18

fix encoder

422c89e

readme

e59b33c

data2vec training

12f6973

update config

ae098dc

Emiyassstar changed the title ~~self-supervised pretraining(wa2vec 2.0/data2vec) for wenet~~ self-supervised pretraining(wav2vec 2.0/data2vec) for wenet Mar 31, 2022

fanlu reviewed Mar 31, 2022

View reviewed changes

wenet/data2vec/data2vec_encoder.py Outdated Show resolved Hide resolved

wenet/bin/train.py Outdated Show resolved Hide resolved

wenet/data2vec/data2vec_model.py Outdated Show resolved Hide resolved

aydentang added 2 commits March 31, 2022 21:39

fix arxiv paper,fix jit export

76169d4

fix jit export

8a1cb96

aydentang and others added 6 commits August 25, 2022 22:41

fix some bugs

54f6b36

aishell finetune example

28842eb

readme

f1ba852

Update README.md

b3e9244

Update README.md

ccc425e

update readme

1269a6e

Emiyassstar mentioned this pull request Jan 11, 2023

How to train a multilingual model, is there a script for it? #1656

Closed

Mddct mentioned this pull request Oct 7, 2023

[ssl/wav2vec2] support wav2vec2 #2034

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-supervised pretraining(wav2vec 2.0/data2vec) for wenet #1003

self-supervised pretraining(wav2vec 2.0/data2vec) for wenet #1003

Emiyassstar commented Mar 31, 2022 •

edited

misaka23 commented Mar 31, 2022

Yymax-max commented Apr 2, 2022

liufei1656 commented Apr 8, 2022

rookie0607 commented Apr 3, 2023

Emiyassstar commented Apr 21, 2023

rookie0607 commented Apr 25, 2023

self-supervised pretraining(wav2vec 2.0/data2vec) for wenet #1003

Are you sure you want to change the base?

self-supervised pretraining(wav2vec 2.0/data2vec) for wenet #1003

Conversation

Emiyassstar commented Mar 31, 2022 • edited

misaka23 commented Mar 31, 2022

Yymax-max commented Apr 2, 2022

liufei1656 commented Apr 8, 2022

rookie0607 commented Apr 3, 2023

Emiyassstar commented Apr 21, 2023

rookie0607 commented Apr 25, 2023

Emiyassstar commented Mar 31, 2022 •

edited