OOM while running sample.py #87

daeunni · 2024-01-23T08:17:41Z

Hi, I have OOM problem while running the code below.
I tried to run sample.py in the finetune_gen.sh.

CUDA_VISIBLE_DEVICES=7 python -u sample.py \
        --n_samples 10 \
        --n_iter 20 \
        --scale 6 \
        --ddim_steps 50  \
        --ckpt ${ARRAY[5]} \
        --ddim_eta 1. \
        --outdir "${ARRAY[2]}" \
        --prompt "photo of a ${ARRAY[0]}"

Anyone can help me? ;)

  File "/d1/daeun/custom-diffusion/stable-diffusion/ldm/modules/attention.py", line 258, in forward
    x = block(x, context=context)
  File "/d1/daeun/anaconda3/env/diff/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/d1/daeun/custom-diffusion/stable-diffusion/ldm/modules/attention.py", line 209, in forward
    return checkpoint(self._forward, (x, context), self.parameters(), self.checkpoint)
  File "/d1/daeun/custom-diffusion/stable-diffusion/ldm/modules/diffusionmodules/util.py", line 116, in checkpoint
    return func(*inputs)
  File "/d1/daeun/custom-diffusion/stable-diffusion/ldm/modules/attention.py", line 212, in _forward
    x = self.attn1(self.norm1(x)) + x
  File "/d1/daeun/anaconda3/env/diff/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/d1/daeun/custom-diffusion/src/model.py", line 165, in new_forward
    sim = einsum('b i d, b j d -> b i j', q, k) * self.scale
RuntimeError: CUDA out of memory. Tried to allocate 5.00 GiB (GPU 0; 11.91 GiB total capacity; 9.39 GiB already allocated; 848.94 MiB free; 10.24 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The text was updated successfully, but these errors were encountered:

daeunni · 2024-01-23T08:23:16Z

I am using A5000, 23G GPU..
Is there solution to run this code?

A-raniy-day · 2024-01-29T06:03:13Z

same problem.

A-raniy-day · 2024-01-29T06:10:01Z

Changing the parser of "H" and "W" to smaller would help.

nupurkmr9 · 2024-01-30T15:58:42Z

Hi, you can try reducing --n_samples 10 to 5 or less.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM while running sample.py #87

OOM while running sample.py #87

daeunni commented Jan 23, 2024 •

edited

daeunni commented Jan 23, 2024 •

edited

A-raniy-day commented Jan 29, 2024

A-raniy-day commented Jan 29, 2024

nupurkmr9 commented Jan 30, 2024

OOM while running sample.py #87

OOM while running sample.py #87

Comments

daeunni commented Jan 23, 2024 • edited

daeunni commented Jan 23, 2024 • edited

A-raniy-day commented Jan 29, 2024

A-raniy-day commented Jan 29, 2024

nupurkmr9 commented Jan 30, 2024

daeunni commented Jan 23, 2024 •

edited

daeunni commented Jan 23, 2024 •

edited