Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error during Training RuntimeError: mat1 and mat2 shapes cannot be multiplied (511x2 and 3x4096) #25

Open
kasakh opened this issue Mar 26, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@kasakh
Copy link

kasakh commented Mar 26, 2023

I followed the installation steps and used the example-data-maya-wiki.txt dataset as a finetuning examples. I keep getting this error towards the end of training. I tried with other datasets as well still the same error persists

output += torch.matmul(subA, state.subB)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (511x2 and 3x4096)

Can you please help me resolve it? Thanks in advance.

@kasakh
Copy link
Author

kasakh commented Mar 28, 2023

It seems that this only happens when I try using Multi-gpu set up. On Single Gpu, it works fine.

@lxe
Copy link
Owner

lxe commented Mar 28, 2023

Yeah, I basically hardcoded it for single GPU. The tloen/alpaca-lora code has multi-gpu support.

@lxe lxe added the bug Something isn't working label Mar 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants