Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

ajmal-yazdani · 2023-12-19T12:52:14Z

I can download the model locally and can write a simple chat application.

But question is how we can run this model over Kubernetes cluster and run the RAG application.

Could you please guide with some sample?

doberst · 2023-12-19T16:37:14Z

Great feedback - and definitely something on our near-term roadmap. Will come back to you soon. Also, let us know if you would be interested potentially to collaborate with us on building up the Kubernetes script around this use case.

chair300 · 2023-12-19T17:47:13Z

@ajmal-yazdani This is a great suggestion. I have been working on a websocket chat agent already. While its not ready for checkin and/or demo yet it will be coming soon.

The secondary point about sample kubernetes code to deploy and run RAG is very interesting. Some of the kubernetes code is dependent of the specific use case and model since the model is embedded within the application. I am happy to put together very simple kubernetes deployment code bases on docker containers (I need to build the containers first).

I hope to have the kubernetes code by mid January, with the docker containers built and available before then.

ajmal-yazdani · 2023-12-20T02:21:43Z

Great feedback - and definitely something on our near-term roadmap. Will come back to you soon. Also, let us know if you would be interested potentially to collaborate with us on building up the Kubernetes script around this use case.

Something for sure I can try on my AKS cluster or local docker.

ajmal-yazdani · 2023-12-20T02:22:04Z

Thank you very much!

chair300 · 2024-02-12T16:38:15Z

I just updated the docker image which can run llmware examples in a aks cluster or local cluster. There is a docker-compose file which provides the extra database infrastructure. You will still need to setup the env's but its a start.

chair300 · 2024-02-28T03:24:29Z

I have a pull request outstanding which will allow you to play with a full docker-compose full service infrastructure. Please see my pull request for the update.

chair300 · 2024-02-29T19:34:05Z

with docker-compose from the devcontainer folder you can run the following command and then all the infrastructure will be up. Docker exec -it into the llmware container and run the examples. Its all set and easy. Please let me know if you have any further questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

ajmal-yazdani commented Dec 19, 2023

doberst commented Dec 19, 2023

chair300 commented Dec 19, 2023

ajmal-yazdani commented Dec 20, 2023

ajmal-yazdani commented Dec 20, 2023

chair300 commented Feb 12, 2024

chair300 commented Feb 28, 2024

chair300 commented Feb 29, 2024

Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

Could you please showcase one end-to-end example to run a model over Kubernetes cluster (AKS, etc) with one RAG implementation please? #206

Comments

ajmal-yazdani commented Dec 19, 2023

doberst commented Dec 19, 2023

chair300 commented Dec 19, 2023

ajmal-yazdani commented Dec 20, 2023

ajmal-yazdani commented Dec 20, 2023

chair300 commented Feb 12, 2024

chair300 commented Feb 28, 2024

chair300 commented Feb 29, 2024