deepseek lora #91

xionghao132 · 2024-04-22T11:33:19Z

response = tokenizer(f"Assistant: {example['output']}<｜end▁of▁sentence｜>", add_special_tokens=False)
input_ids = instruction["input_ids"] + response["input_ids"] + [tokenizer.pad_token_id]
attention_mask = instruction["attention_mask"] + response["attention_mask"] + [1] # 因为eos token咱们也是要关注的所以补充为1
labels = [-100] * len(instruction["input_ids"]) + response["input_ids"] + [tokenizer.pad_token_id]
想问一下，<｜end▁of▁sentence｜>是否多余，因为后面加了一个tokenizer.pad_token_id，也表示<｜end▁of▁sentence｜>

KMnO4-zx · 2024-04-24T06:50:59Z

这个应该无所谓吧？因为都是pad_token

xionghao132 · 2024-04-24T07:01:23Z

好的谢谢解答

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepseek lora #91

deepseek lora #91

xionghao132 commented Apr 22, 2024

KMnO4-zx commented Apr 24, 2024

xionghao132 commented Apr 24, 2024

deepseek lora #91

deepseek lora #91

Comments

xionghao132 commented Apr 22, 2024

KMnO4-zx commented Apr 24, 2024

xionghao132 commented Apr 24, 2024