Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

T5模型预训练问题 #356

Open
zhangzai666 opened this issue Mar 23, 2023 · 0 comments
Open

T5模型预训练问题 #356

zhangzai666 opened this issue Mar 23, 2023 · 0 comments

Comments

@zhangzai666
Copy link

您好:
我尝试基于t5_base模型进行预训练pretrain,数据量较少大概3000多条,训练了1000步,结果输出基本全是“”的“”,如下:
input= "中extra0的首都是extra1京"
output=[{'generated_text': 'extra0 的 extra1 的 extra2 extra3'}]
请问我这是过拟合了破坏原来模型参数了??
哪位大神指导一下

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant