loss=0 for lora sft Baichuan2-13B-Chat with bf16 #3353

conderls · 2024-04-19T15:34:10Z

using examples/lora_multi_gpu/single_node.sh and update some params as shown above

this behavior seems occurred after v0.6.0. I was using commit 2e592be, a quite early one, which works just fine.

both bf16 and fp16 should work.

ubuntu 22.04 with H800 and torch 2.1.2, transformers 4.38.2

similar issues: #3344 #3308

The text was updated successfully, but these errors were encountered:

hiyouga added the pending This problem is yet to be addressed. label Apr 21, 2024

hiyouga added wontfix This will not be worked on and removed pending This problem is yet to be addressed. labels May 1, 2024

hiyouga closed this as not planned Won't fix, can't repro, duplicate, stale May 1, 2024

Provide feedback