Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

你好,这个训练参数如何配置,如何命令启动训练 #28

Open
dgo2dance opened this issue Jan 19, 2024 · 3 comments
Open

Comments

@dgo2dance
Copy link

dgo2dance commented Jan 19, 2024

你好,训练参数如何配置,模型需要下载放到对应目录吗

@ssbuild
Copy link
Owner

ssbuild commented Jan 19, 2024

你好,训练参数如何配置,模型需要下载放到对应目录吗

  1. 制作数据集:
    cd scripts
    bash train_full.sh -m dataset
    or
    bash train_lora.sh -m dataset
    or
    bash train_ptv2.sh -m dataset
  2. 训练

cd scripts
bash train_full.sh -m train
or
bash train_lora.sh -m train
or
bash train_ptv2.sh -m train

默认配置文件是: scripts/config/train_pl.yaml
其他参数可以根据需求修改配置

@dgo2dance
Copy link
Author

dgo2dance commented Jan 20, 2024

  1. 制作数据集:
    cd scripts 光盘脚本
    bash train_full.sh -m dataset bash train_full.sh -m 数据集
    or 或者
    bash train_lora.sh -m dataset bash train_lora.sh -m 数据集
    or 或者
    bash train_ptv2.sh -m dataset bash train_ptv2.sh -m 数据集

您好,请教下,制作数据集这一步是做什么的,数据是在data目录已经放好的吧,用make_data_example.py生成的

bash train_lora.sh -m dataset

@ssbuild
Copy link
Owner

ssbuild commented Jan 20, 2024

  1. 制作数据集:
    cd scripts 光盘脚本
    bash train_full.sh -m dataset bash train_full.sh -m 数据集
    or 或者
    bash train_lora.sh -m dataset bash train_lora.sh -m 数据集
    or 或者
    bash train_ptv2.sh -m dataset bash train_ptv2.sh -m 数据集

您好,请教下,制作数据集这一步是做什么的,数据是在data目录已经放好的吧,用make_data_example.py生成的

bash train_lora.sh -m dataset

读取配置文件train_pl.yaml 里面的max_length 等信息 制作ids数据集。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants