Sync Fork #206

mikecovlee · 2024-04-24T06:40:09Z

No description provided.

* support classification tasks, add glue config example, support generate without cache * replace sts-b with mrpc, try fix error * add mrpc eval * separate tasks * replace len(tensor) to shape * fix issue * add evaluate * fix CasualLM * support lr scheduler and other stuff * rearrange codes and APIs * update rquirements * update LLMModel * update pyproject * fix lint error * fix dataloader of glue tasks * fix pytest * now can run train * fix inference * fix evaluate and lint error * fix mixlora configs * fix error, add hint of max tokens len * add mmlu * fix gradient accumulation * add interate tests * remove print * fix bug * update auto eval * support config as json * add categories * support max sequence length * update to match original implementation * fix model output precision * rearrange codes * remove usless assert * fix inconsistent behaviours for llama-2-hf models * read max seq len automatically * move rope angles into each layers to avoid useless arguments * remove cache * add requires_grad * fix trainer bug * reduce cutoff_len * remove regression tasks * make lint happy

Integrate evaluation methods and supports for LLaMA compatible models

Support Windows

performance: add batch lora function

* fix ci scripts * update ci script * remove setup python * remove torch specific version * change to local image * update * fix image * fix use flash attn on rtx20 cards

* support phi * rename * rename * rename * refectory entire framework * fix inference * fix bug * fix phi model * fix compatible with phi model * fix llama * support qwen2 and mistral * support google gemma model * fix llama flash attention * support flash attention for phi models * fix gemma model * support xformers attention for phi models * add phi dummy example * fix phi mixlora * improve efficiency * update docs * fix router_profile * rearrange codes * fix bugs * replace deprecated apis * fix launcher of mlora

* support cpu as backend * update README

* reads hidden act from configuration * fix lint error

* replace encode with official impl * update ci script * fix ci script * add device constraint * fix ci script * fix ci script

* add QuestionAnswerTask to global namespaces * fix mixlora ffn act_fn * fix config * fix docs

* support MoRAL (without load balance) * add intermediate_size

* support hellaswag dataset * support WinoGrande * support SIQA * fix lint error

* improve efficiency of mixlora * ignore adapters * replace checkpoint impl with torch * support router outputs from tail * fix router loss * fix model compatibility * fix bus * fix lint error * fix bugs

* fix ci script * fix ci script * update docs

* support medical qa * use long context when evaluate

mikecovlee added 30 commits February 1, 2024 10:36

fix dispatcher

79d2af7

fix bug, add dropout

c426801

add arc tests

6c70379

update

0ca47e9

fix bug, add dropout (#6)

4b927c3

support lora+

851921a

Merge branch 'main' into arc_tests

3197c04

fix trainer

271f67c

Merge branch 'main' into arc_tests

187709e

add boolq test

dd8a7a9

remove passage of boolq

97f7c47

add obqa benchmark

4876030

rearrange codes

fbce886

integrate evaluators

ee1500f

change name

1319a2d

add mmlu scorer

d9ebc3c

update data prepare

6d22049

remove old evaluator

04d9de0

add PIQA

6766af8

update prompts

84de785

support qwen2

360bff6

fix inference

d87ddc2

fix quantization

4dd6707

sync with main

b9ceaf7

improve RoPE precision, add safety check

295b777

fix PIQA

c45950c

Merge pull request #7 from mikecovlee/evaluate

3d7cbe4

Integrate evaluation methods and supports for LLaMA compatible models

add expert lora settings, add patch to mixtral routing

04e818c

update with tudb:main, remove old componments

826f013

mikecovlee added 30 commits April 6, 2024 19:12

fix in mps

a0ed6d7

Merge pull request #21 from scukdde-llm/win_support

6608abc

Support Windows

fix pyproject.toml

1593e5c

update install.md

1d24d37

sync with main

735f5da

Merge pull request #22 from scukdde-llm/lora_op

18eae98

performance: add batch lora function

Fix CI Scripts (#24)

9384c80

* fix ci scripts * update ci script * remove setup python * remove torch specific version * change to local image * update * fix image * fix use flash attn on rtx20 cards

add docker instructions

a5772bb

add release tag

4d26f98

fix pyproject.toml, add blank docs

2618eff

Support cpu as backend (#32)

4786dd5

* support cpu as backend * update README

Reads hidden act from configuration (#31)

35b53ed

* reads hidden act from configuration * fix lint error

rearrange entrance of backends

da2a725

fix launch.py on macOS

c11371b

add version check

492c972

add template of phi models

af6c37e

improve inference experience (#33)

e8f601f

Fix Tokenizer (#34)

c716825

* replace encode with official impl * update ci script * fix ci script * add device constraint * fix ci script * fix ci script

fix mmlu evaluator (#38)

0368667

Fix bugs (#39)

1d08c4e

* add QuestionAnswerTask to global namespaces * fix mixlora ffn act_fn * fix config * fix docs

support llama-3 (#42)

0b2c539

fix tokenizer (#43)

01b2704

Support More LoRA MoEs (#44)

ac9fe84

* support MoRAL (without load balance) * add intermediate_size

fix dora precision

9358771

fix dora, improve efficiency

74b4fc7

Support SIQA, HellaSwag and WinoGrande Dataset (#47)

3c5a351

* support hellaswag dataset * support WinoGrande * support SIQA * fix lint error

Fix Load Balance Loss of MixLoRA (#46)

75a4700

* improve efficiency of mixlora * ignore adapters * replace checkpoint impl with torch * support router outputs from tail * fix router loss * fix model compatibility * fix bus * fix lint error * fix bugs

Fix CI and update docs (#48)

aa641b1

* fix ci script * fix ci script * update docs

Support Medical QA Datasets (#50)

c807cef

* support medical qa * use long context when evaluate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync Fork #206

Sync Fork #206

mikecovlee commented Apr 24, 2024

Sync Fork #206

Are you sure you want to change the base?

Sync Fork #206

Conversation

mikecovlee commented Apr 24, 2024