Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running install.sh after git clone requires over 200GB Ram #732

Open
weicheng59 opened this issue Apr 2, 2024 · 6 comments
Open

Running install.sh after git clone requires over 200GB Ram #732

weicheng59 opened this issue Apr 2, 2024 · 6 comments

Comments

@weicheng59
Copy link

I have a server with 128GB ram and it will be freeze when I follow the quick start procedure. On another server with 512GB ram it's fine. I think adding this warning in read.me could be helpful for other with limited ram.
Also, when I follow along the quick start procedure. I noticed I need to do

pip install packaging
pip install torch

manually before install.sh can be successfully run.

@weicheng59
Copy link
Author

The ram spike happaned in this stage before collecting numpy and other packages.

(lmflow) ai@server:~/llm/LMFlow$ bash install.sh
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Obtaining file:///home/asus/llm/LMFlow
  Preparing metadata (setup.py) ... done

@research4pan
Copy link
Contributor

Thanks for reporting this issue! We will update the README accordingly to help others. Thanks for this meaningful contribution 👍

@weicheng59
Copy link
Author

there seemed to be some confusion. I faced two issues.

  1. You need to do
pip install packaging
pip install torch

no matter you have limited ram or not.

  1. System with less than 200 GB ram can not use current install.sh to set up the environment.
    In my case, if you have 128 GB ram, you just can not run the install.sh successfully. Your sever will freeze and you will notice ram is all used up. I guess after some really long hard disk swap (i waited for a hour but still not finish so i just restart the server and try it on a bigger ram machine), it can be installed but its not a good experience.

@research4pan
Copy link
Contributor

Thanks for sharing more information! Let us update the document and dependency to reflect the change. Meanwhile, it can also be relevant to system versions and the local environment. Previously we have installed lmflow in Google Colab (with RAM 80G), which does not seem to have issues.

It would be greatly appreciated if you could share more information about the 128 GB system, and if possible, test if a similar problem occurs with other pip mirror sites. Thanks for your meaningful contribution!

research4pan added a commit that referenced this issue Apr 2, 2024
@yingxin-chen
Copy link

Hello, excuse me, when I use the “bash install.sh" command with 96GB RAM, the server is stuck, why?
(lmflow) root@VM-0-4-ubuntu:/home/ubuntu/LMFlow# bash install.sh Looking in indexes: http://mirrors.cloud.tencent.com/pypi/simple Obtaining file:///home/ubuntu/LMFlow Preparing metadata (setup.py) ...

@research4pan
Copy link
Contributor

Thanks for your interest in LMFlow! It is possible that some dependency installation requires local compilation, which may consume a lot of memory. To check which package stuck the process, you may run pip install -e . or pip install -r requirements.txt to check. Thanks very much 😄

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants