New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] ADD Support DBRX #621
Comments
Yes please! |
If possible please test ^ |
Thank you, i will test on 4*A800-80GB |
I will test it too, thank you |
@maziyarpanahi Please help me test and validate the quality of the marlin 4bit dbrx-base at https://huggingface.co/LnL-AI/dbrx-base-converted-v2-4bit-gptq-marlin and let me if you are getting coherent responses. Note the loading time is quite long. |
Hi @Qubitium |
@maziyarpanahi Thanks! non-marlin version is currently uploading and should be finished upload in ~60 minutes: https://huggingface.co/LnL-AI/dbrx-base-converted-v2-4bit-gptq-gptq |
Perfect! I'll pull and build from the PR then I'll test both of them with some of my samples. Thank you |
@maziyarpanahi Two quants I sent may have severe quality issues due to quant calibration. Already started 2 new quants. |
@maziyarpanahi Please test the following 2 (marlin+non-marlin) quants instead. The previous quants had calibration issues. |
My review of
As you can see, it followed the 3 bullet points, it is pretty coherent, it just didn't stop at Overall, for a work in progress I really like it! I'll try to test the second model with |
@maziyarpanahi You can try turboderp/exllamav2#388 (comment) |
I'll try to add those to the tokenizer config, but apart from the stop, the quality of the response is solid |
Hey all, we recently updated the official HF Hub models E.g: https://huggingface.co/databricks/dbrx-instruct/blob/main/tokenizer_config.json |
Hi, let me have a look next week. |
Is your feature request related to a problem? Please describe.
DBRX Instruct is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. DBRX Instruct specializes in few-turn interactions.
Describe the solution you'd like
A clear and concise description of what you want to happen.
https://huggingface.co/databricks/dbrx-instruct
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: