Can't find 'adapter_config.json' #3466

may012345 · 2024-04-26T09:40:16Z

Reminder

I have read the README and searched the existing issues.

Reproduction

want to train dpo

#!/bin/bash

CUDA_VISIBLE_DEVICES=0 python ../../src/train_bash.py
--stage dpo
--do_train
--model_name_or_path BASE_MODEL
--adapter_name_or_path FINETUNED_MODEL
--create_new_adapter
--dataset orca_rlhf
--dataset_dir ../../data
--template default
--finetuning_type lora
--lora_target q_proj,v_proj
--output_dir ../../saves/LLaMA2-7B/lora/dpo
--overwrite_cache
--overwrite_output_dir
--cutoff_len 1024
--preprocessing_num_workers 16
--per_device_train_batch_size 1
--per_device_eval_batch_size 1
--gradient_accumulation_steps 8
--lr_scheduler_type cosine
--logging_steps 10
--warmup_steps 20
--save_steps 100
--eval_steps 100
--evaluation_strategy steps
--load_best_model_at_end
--learning_rate 1e-5
--num_train_epochs 1.0
--max_samples 1000
--val_size 0.1
--dpo_ftx 1.0
--plot_loss
--fp16

Expected behavior

error. cannot run dpo.
Should I just put FINETUNED_MODEL at "--model_name_or_path" and delete "--adapter_name_or_path"?

System Info

ValueError: Can't find 'adapter_config.json' at '/path/to/FINETUNED_MODEL'

Others

No response

hiyouga · 2024-04-26T10:10:01Z

if you have merged lora adapters to the base model, just use --model_name_or_path only, otherwise, put your sft lora to --adapter_name_or_path

hiyouga added the solved This problem has been already solved. label Apr 26, 2024

hiyouga closed this as completed Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't find 'adapter_config.json' #3466

Can't find 'adapter_config.json' #3466

may012345 commented Apr 26, 2024

hiyouga commented Apr 26, 2024

Can't find 'adapter_config.json' #3466

Can't find 'adapter_config.json' #3466

Comments

may012345 commented Apr 26, 2024

Reminder

Reproduction

Expected behavior

System Info

Others

hiyouga commented Apr 26, 2024