LoRA: Extract small function #614

madroidmaq · 2024-03-25T01:45:04Z

Extract the load_with_lora function for easy use in other files.

mzbac · 2024-03-25T01:48:55Z

Maybe we should consider moving the function out of utils.py. It has becomed too large now, it might be better to create a new file named lora_utils.py and place it there?

madroidmaq · 2024-03-25T01:59:33Z

@mzbac Great suggestion. Initially, I tried placing the load_with_lora function in both tuner/lora.py and tuner/utils.py, but encountered a circular dependency issue. When I attempted to resolve this issue, I found that it required significant changes. So, I ended up putting it in the utils.py.

I will reassess the extent of changes needed to solve the circular dependency problem. The worst-case scenario would be creating a new file named tuner/lora_utils.py, which I will try my best to avoid.

madroidmaq · 2024-03-25T04:09:20Z

The main reason for the circular dependency is the reliance on the load() function in utils.py, which in turn depends on the linear_to_lora_layers function in tuner/utils.py.

I have now removed the logic of actively loading models within functions and instead accept an already loaded model. This can avoid issues with sequential dependencies. Meanwhile, it will be necessary to first load the model and then call the prepare_for_training function when using it, which I think is acceptable.

awni · 2024-05-03T16:57:41Z

@madroidmaq sorry for the delay! I think we can merge this. But could you first rebase to resolve conflicts?

madroidmaq · 2024-05-05T17:05:17Z

@madroidmaq sorry for the delay! I think we can merge this. But could you first rebase to resolve conflicts?

@awni It's DONE.

awni

Thanks!

madroidmaq force-pushed the lora-ref branch from 9acee68 to 8e6f047 Compare March 25, 2024 04:05

madroidmaq changed the title ~~LoRA: Extract load_with_lora function~~ LoRA: Extract prepare_for_training function Mar 27, 2024

madroidmaq force-pushed the lora-ref branch from 3883088 to 2b29dac Compare May 5, 2024 17:03

madroidmaq changed the title ~~LoRA: Extract prepare_for_training function~~ LoRA: Extract pre_processing_model function May 5, 2024

madroidmaq changed the title ~~LoRA: Extract pre_processing_model function~~ LoRA: Extract small function May 5, 2024

madroidmaq force-pushed the lora-ref branch from 73317ab to d6b9678 Compare May 5, 2024 18:21

awni approved these changes May 14, 2024

View reviewed changes

madroidmaq force-pushed the lora-ref branch from e75401e to 98d6151 Compare May 15, 2024 16:54

madroidmaq and others added 4 commits May 31, 2024 16:10

LoRA: Extract pre_processing_model function

40f192b

LoRA: Extract small functions(train_model,evaluate_model)

ccbebaa

move test case to test_tuner_utils.py

323aa8f

nits

abd6de8

awni force-pushed the lora-ref branch from 98d6151 to abd6de8 Compare May 31, 2024 23:41

awni added 2 commits May 31, 2024 17:11

nits

047dd79

remove extra param, validate at it 0

bcbcb3b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoRA: Extract small function #614

LoRA: Extract small function #614

madroidmaq commented Mar 25, 2024

mzbac commented Mar 25, 2024

madroidmaq commented Mar 25, 2024

madroidmaq commented Mar 25, 2024

awni commented May 3, 2024

madroidmaq commented May 5, 2024 •

edited

awni left a comment

LoRA: Extract small function #614

Are you sure you want to change the base?

LoRA: Extract small function #614

Conversation

madroidmaq commented Mar 25, 2024

mzbac commented Mar 25, 2024

madroidmaq commented Mar 25, 2024

madroidmaq commented Mar 25, 2024

awni commented May 3, 2024

madroidmaq commented May 5, 2024 • edited

awni left a comment

Choose a reason for hiding this comment

madroidmaq commented May 5, 2024 •

edited