Update pip package RWKV.model and v2/chat.py to support LoRA #82

RafaRed · 2023-04-05T17:38:21Z

Update pip package RWKV.model and v2/chat.py to support LoRA based on Blealtan/RWKV-LM-LoRA implementation.

Notice that pip package need to be uploaded again with the new changes to work.

…n Blealtan/RWKV-LM-LoRA implementation. Notice that pip package need to be uploaded again with the new changes to work.

KerfuffleV2 · 2023-04-06T20:29:27Z

Won't this break everything that uses the rwkv PIP package? It's trying to access attributes on lora. Why not have it as a keyword argument that defaults to None, that way existing code can keep working.

Also, what happens if the model is "preconverted"? I'm assuming it won't work correctly if that part has already occurred, so probably would want to add checks for whether it's a preconverted model before doing the lora part.

RafaRed · 2023-04-08T14:50:03Z

@KerfuffleV2 you are right about attributes, Need to change then to optional.
About the preconverted I don't know, did not perform any tests about it yet.

KerfuffleV2 · 2023-04-09T08:10:25Z

About the preconverted I don't know, did not perform any tests about it yet.

I'm almost certain it couldn't work. Those tensors may be in a different format like u8. They will also have some additional fields like mx, my, rx, ry (maximums, minimums related to quantization).

So it may be possible to support that but I'm pretty positive it would need special handling. Probably the easiest approach at first is to just raise an exception if Lora is specified and the model file has been preconverted.

Blealtan · 2023-04-09T14:26:24Z

IMO the LoRA merging should go before the conversion since there are too many things happening during converting the model (including the xy quantizing, etc.). +1 on disallowing LoRA when pre-converted model is specified.

Update pip package RWKV.model and v2/chat.py to support LoRA, based o…

edcc328

…n Blealtan/RWKV-LM-LoRA implementation. Notice that pip package need to be uploaded again with the new changes to work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update pip package RWKV.model and v2/chat.py to support LoRA #82

Update pip package RWKV.model and v2/chat.py to support LoRA #82

RafaRed commented Apr 5, 2023

KerfuffleV2 commented Apr 6, 2023

RafaRed commented Apr 8, 2023

KerfuffleV2 commented Apr 9, 2023

Blealtan commented Apr 9, 2023

Update pip package RWKV.model and v2/chat.py to support LoRA #82

Are you sure you want to change the base?

Update pip package RWKV.model and v2/chat.py to support LoRA #82

Conversation

RafaRed commented Apr 5, 2023

KerfuffleV2 commented Apr 6, 2023

RafaRed commented Apr 8, 2023

KerfuffleV2 commented Apr 9, 2023

Blealtan commented Apr 9, 2023