Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why can llama understand Chinese so much and can't reply directly in Chinese? #17

Open
XksA-me opened this issue Jul 27, 2023 · 4 comments

Comments

@XksA-me
Copy link

XksA-me commented Jul 27, 2023

Why can't llama understand Chinese so much and can't reply directly in Chinese?

I tested Llama-2-7b-chat-hf again today.

Test using GPU platform: matpool.com
Memory usage:
Open 8BIT occupies 8G+, GPU utilization: 13-20%
If 8BIT is not enabled, it takes up 14G+, GPU utilization: 55-70%

image image image
@XksA-me XksA-me changed the title Why can't llama understand Chinese so much and can't reply directly in Chinese? Why can llama understand Chinese so much and can't reply directly in Chinese? Jul 27, 2023
@liltom-eth
Copy link
Owner

liltom-eth commented Jul 27, 2023

I believe this happened because of the system prompt. this is added to your chat text sending to llm. Try delete system prompt in UI or use Chinese system prompt.
The results might be better.
Screenshot 2023-07-27 at 1 11 24 AM

Lllama2 I believe not well supported on Chinese now since Chinese have more tokens.

I list a feature to support some Chinese llama2 models. Haven't got a chance to test it.

@XksA-me
Copy link
Author

XksA-me commented Jul 27, 2023

thanks for the reply. Useful, but limited.
image
image
Hope I can contribute for feature.

@liltom-eth
Copy link
Owner

@XksA-me do you want to contribute to this feature? Might need take a look at this repo Chinese-Llama-2-7b.

@liltom-eth
Copy link
Owner

liltom-eth commented Jul 29, 2023

@XksA-me welcome contributing your benchmark performance here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants