Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

Open
ajmal-yazdani opened this issue Dec 19, 2023 · 7 comments

Comments

@ajmal-yazdani
Copy link

I can download the model locally and can write a simple chat application.

But question is how we can run this model over Kubernetes cluster and run the RAG application.

Could you please guide with some sample?

@doberst
Copy link
Contributor

doberst commented Dec 19, 2023

Great feedback - and definitely something on our near-term roadmap. Will come back to you soon. Also, let us know if you would be interested potentially to collaborate with us on building up the Kubernetes script around this use case.

@chair300
Copy link
Contributor

@ajmal-yazdani This is a great suggestion. I have been working on a websocket chat agent already. While its not ready for checkin and/or demo yet it will be coming soon.

The secondary point about sample kubernetes code to deploy and run RAG is very interesting. Some of the kubernetes code is dependent of the specific use case and model since the model is embedded within the application. I am happy to put together very simple kubernetes deployment code bases on docker containers (I need to build the containers first).

I hope to have the kubernetes code by mid January, with the docker containers built and available before then.

@ajmal-yazdani
Copy link
Author

Great feedback - and definitely something on our near-term roadmap. Will come back to you soon. Also, let us know if you would be interested potentially to collaborate with us on building up the Kubernetes script around this use case.

Something for sure I can try on my AKS cluster or local docker.

@ajmal-yazdani
Copy link
Author

Thank you very much!

@chair300
Copy link
Contributor

I just updated the docker image which can run llmware examples in a aks cluster or local cluster. There is a docker-compose file which provides the extra database infrastructure. You will still need to setup the env's but its a start.

@chair300
Copy link
Contributor

I have a pull request outstanding which will allow you to play with a full docker-compose full service infrastructure. Please see my pull request for the update.

@chair300
Copy link
Contributor

with docker-compose from the devcontainer folder you can run the following command and then all the infrastructure will be up. Docker exec -it into the llmware container and run the examples. Its all set and easy. Please let me know if you have any further questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants