marlin #189

flozi00 · 2024-01-18T08:52:11Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Was this discussed/approved via a Github issue or the discord / slack channel? Please add a link
to it if that's the case.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

flozi00 · 2024-01-19T14:27:56Z

Looks like I need help at debugging @tgaddair
The kernel is incompatible with the flash attention kernels
Illegal Memory access error occures every time

docker run --pull always -v ./data:/data --gpus all -d --shm-size 1g -p 8080:80 ghcr.io/predibase/lorax:marlin --model-id TheBloke/dolphin-2.6-mistral-7B-dpo-GPTQ --quantize marlin

flozi00 · 2024-01-23T20:44:03Z

@tgaddair on the disco research server i read an comment about the incompitability with fused attention
Don't have any idea if they want to support it in future or not.

I think without flash attention this feature would not makes much sense because of the much higher memory requirements for longer sequences.

Will keep this PR as draft until it's compatible but won't work actively on it

tgaddair · 2024-01-23T21:50:18Z

Thanks @flozi00 , we can hold off until that's supported then.

first try marlin

2047dfa

flozi00 linked an issue Jan 18, 2024 that may be closed by this pull request

marlin #188

Open

flozi00 added 11 commits January 19, 2024 09:23

marlin build

cf69a5c

introduce docker dev, fix marlin build

5f9e7d1

refactor marlin kernels

b9ccbde

docker stuff

7e64ed8

fix import

4e5478f

fix marlin for tests

0d21463

add cli

a42749b

make loading work

b4f3b10

marlin build

20d95a8

fix launcher

1a3b151

marlin cli

29d40d6

flozi00 marked this pull request as draft January 19, 2024 16:24

flozi00 and others added 3 commits January 19, 2024 17:35

make other models work

b122d94

fix property

ef37dd1

update kernel

0290eff

flozi00 closed this May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

marlin #189

marlin #189

flozi00 commented Jan 18, 2024

flozi00 commented Jan 19, 2024 •

edited

flozi00 commented Jan 23, 2024

tgaddair commented Jan 23, 2024

marlin #189

marlin #189

Conversation

flozi00 commented Jan 18, 2024

What does this PR do?

Before submitting

Who can review?

flozi00 commented Jan 19, 2024 • edited

flozi00 commented Jan 23, 2024

tgaddair commented Jan 23, 2024

flozi00 commented Jan 19, 2024 •

edited