not enough space in the buffer with long prompts #129

RachelShalom · 2024-01-23T11:04:31Z

Prerequisites

Before submitting your question, please ensure the following:

I am running the latest version of PowerInfer. Development is rapid, and as of now, there are no tagged versions.
I have carefully read and followed the instructions in the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).

Question Details

I am trying to run llama-7b-relu.q4.powerinfer.gguf with the following command:
PowerInfer/build/bin/main -m ReluLLaMA-7B-PowerInfer-GGUF/llama-7b-relu.q4.powerinfer.gguf -n 128 -t 8 -p "$PROMPT" -c 1000 --n-gpu-layers 20
the prompt size is ~ 800 tokens. when I run this I get:

not enough space in the buffer (needed 1409024, largest block available 1389584)
GGML_ASSERT: /home/user_name/PowerInfer/ggml-alloc.c:116: !"not enough space in the buffer"
[Thread debugging using libthread_db enabled]

when I use small prompt size (10-20 tokens in the prompt) it works as expected. any ideas on how to solve this?

The text was updated successfully, but these errors were encountered:

hodlen · 2024-01-24T16:02:43Z

Hi @RachelShalom can you test it again on the latest main branch? I have tested with the same model with a prompt for about 1.9K tokens and 2048 context window size, and it looks good.

If the error persists, would you mind posting the entire error log? It will help us to pinpoint the root cause.

tusharsoni42909 · 2024-03-24T06:25:57Z

Shorten the prompt or use abbreviations

PROMPT="Shortened version of your prompt"
PowerInfer/build/bin/main -m ReluLLaMA-7B-PowerInfer-GGUF/llama-7b-relu.q4.powerinfer.gguf -n 128 -t 8 -p "$PROMPT" -c 1000 --n-gpu-layers 20

RachelShalom added the question Further information is requested label Jan 23, 2024

hodlen added bug-unconfirmed Unconfirmed bugs and removed question Further information is requested labels Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

not enough space in the buffer with long prompts #129

not enough space in the buffer with long prompts #129

RachelShalom commented Jan 23, 2024 •

edited

hodlen commented Jan 24, 2024

tusharsoni42909 commented Mar 24, 2024

not enough space in the buffer with long prompts #129

not enough space in the buffer with long prompts #129

Comments

RachelShalom commented Jan 23, 2024 • edited

Prerequisites

Question Details

hodlen commented Jan 24, 2024

tusharsoni42909 commented Mar 24, 2024

Shorten the prompt or use abbreviations

RachelShalom commented Jan 23, 2024 •

edited