Support local LLMs such as CodeLLaMa (llama-2 based) #26

aaronkaplan · 2023-09-11T14:54:29Z

Hi!

I contributed a tiny set of patches so that gepetto can also access locally running LLMs such as CodeLLama (or see here).

Added documentation to the README.md file
Added sample config in config.ini
Added code to access it.

Ping me pls. in case you want to test it, I can demo it if needed.

Have a good week!

JusticeRage · 2023-09-11T15:11:14Z

Hi! Thanks a lot for this PR! Let me try to run codellama-2 locally if it's not too complicated, to see if I can test the code and make it work!

aaronkaplan · 2023-09-11T15:23:57Z

You are welcome! It takes a bit of resources (I am running it on an A100 GPU!) . You could try with oobabooga in llama.cpp mode (read: on CPUs / mixed GPU+CPU) in case you don't have the resources.
Or shoot me an email (see the link in the README.md) and we can set up a call so that I can demo it to you.

Note that sometimes codellama does not return proper JSON yet. That's something which probably could be addressed with things like langchain's output parsers.

JusticeRage · 2023-09-14T13:21:28Z

Hi! So, I've started playing with the code, to check if the UI works, etc. Seems to be good, but just trying to send a request with the default config value (http://3.78.xxx/v1), I was surprised to get a reply from... OpenAI! Is that normal?
Also, I see a few # XXX and FIXME, is this version ready to be merged or is it still a work in progress?

aaronkaplan · 2023-09-20T07:32:01Z

Hi! So, I've started playing with the code, to check if the UI works, etc. Seems to be good, but just trying to send a request with the default config value (http://3.78.xxx/v1), I was surprised to get a reply from... OpenAI! Is that normal? Also, I see a few # XXX and FIXME, is this version ready to be merged or is it still a work in progress?

Let me check... it might be that you discovered a developer hack which was still in there.

ghost · 2023-12-25T18:41:12Z

how to adapt to https://huggingface.co/Phind/Phind-CodeLlama-34B-v2

ghost · 2023-12-25T20:35:57Z

Tested. It works! Thanks!

aaronkaplan added 8 commits September 11, 2023 09:49

initial hack go get local LLMs (codellama-2) running via oobabooga

e4b5fae

fix indent

5c10eb0

no need to set api_base again

6512016

bug - imported the wrong file ofc

af4e55d

bug in ui element. Was not defined

352db0e

print answer to console

027bd25

clarify some things in the ini file

1e4ee64

document how to get codellama running

d83fb26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support local LLMs such as CodeLLaMa (llama-2 based) #26

Support local LLMs such as CodeLLaMa (llama-2 based) #26

aaronkaplan commented Sep 11, 2023

JusticeRage commented Sep 11, 2023

aaronkaplan commented Sep 11, 2023

JusticeRage commented Sep 14, 2023 •

edited

aaronkaplan commented Sep 20, 2023

ghost commented Dec 25, 2023

ghost commented Dec 25, 2023

Support local LLMs such as CodeLLaMa (llama-2 based) #26

Are you sure you want to change the base?

Support local LLMs such as CodeLLaMa (llama-2 based) #26

Conversation

aaronkaplan commented Sep 11, 2023

JusticeRage commented Sep 11, 2023

aaronkaplan commented Sep 11, 2023

JusticeRage commented Sep 14, 2023 • edited

aaronkaplan commented Sep 20, 2023

ghost commented Dec 25, 2023

ghost commented Dec 25, 2023

JusticeRage commented Sep 14, 2023 •

edited