-
Notifications
You must be signed in to change notification settings - Fork 715
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: support WASI-NN streaming extension for NNRPC #3319
Comments
Hi @hydai, I'm a student at the University of Texas at Austin. I was interested in working on this issue as a part of a project in my virtualization class (CS 360V). I've got the necessary background on LLMs but am new to WasmEdge – do you think it would make sense for me to work on this? Thanks! |
@arvganesh Please have a look at Akihiro Suda's talk on NNRPC here (from 2:08) https://youtu.be/D0D8ufWtILI?si=nTF5nklApkmO7cCu&t=128 If you can build and run NNRPC from source, then you are ready for this. :) |
Would you be able to assign this to me? Thanks! |
Done. Look forward to your contributions! |
Hi @juntao, @hydai! I wanted to outline my understanding of this issue / WasmEdge, let me know if this makes sense before I begin working on it: Background:
What I need to do:
Questions I have:
Thanks for your help! I appreciate it. |
The series of
It allows you to call the host function directly. So, you can have a better way to interact with the host function with this simplified interface. |
Hi @hydai! I've opened a draft PR adding the features discussed and keep failing this build check. Unfortunately, the only error message is These are the commands I'm using to build and run tests on MacOS. This set of commands runs tests, just not the ones for WasiNN.
Do you have any thoughts on what I'm doing wrong here? |
Please follow the same command running in the CI to reproduce the issue. |
There is an error message:
You should check if there is anything related to the files / any casting issue. |
Summary
WasmEdge forked the WASI-NN spec with the following extension for a better LLM experience.
Three new functions need to be supported:
Details
Make the NNRPC support streaming LLM mode.
Appendix
The text was updated successfully, but these errors were encountered: