rate_limit_error (429) on paid account for only a dozen async requests #496

baogorek · 2024-05-14T11:47:40Z

This Async example below works if you cut the number of messages down to, say 4. I've had it work with 6, even. But with 12 async calls to the haiku model, I get the 429 error:

RateLimitError: Error code: 429 - {'type': 'error', 'error': {'type': 'rate_limit_error', 'message': 'Number of concurrent connections has exceeded your rate limit. Please try again later or contact sales at https://www.anthropic.com/contact-sales to discuss your options for a rate limit increase.'}}

I'm on a paid plan and know I have at least 1,000 calls per minute. (It might even be 4,000).

This has been replicated with another user's account. I am not especially good at async programming, but I believe I should be able to send out as many queries as my limit.

from anthropic import AsyncAnthropic
import asyncio

anthropicAsync = AsyncAnthropic(
    api_key=api_key,  # your API key
)

async def send_message(content):
    response = await anthropicAsync.messages.create(
        model="claude-3-haiku-20240307",
        max_tokens=300,
        messages=[{"role": "user", "content": content}]
    )
    return response

async def runAsyncLLM():
    message1 = "How does a court case get to the Supreme Court?"
    message2 = "What is the role of a Supreme Court justice?"

    responses = await asyncio.gather(
        send_message(message1),
        send_message(message2),
        send_message(message2),
        send_message(message2),
        send_message(message2),
        send_message(message2),
        send_message(message1),
        send_message(message2),
        send_message(message2),
        send_message(message2),
        send_message(message2),
        send_message(message2)

    )
    return responses

responses = asyncio.run(runAsyncLLM())

print(responses[0].content[0].text.strip())
print("\n---------------\n")
print(responses[1].content[0].text.strip())

The text was updated successfully, but these errors were encountered:

rattrayalex · 2024-05-25T22:16:06Z

Anthropic has many rate-limits in place, not just calls per minute. This is standard in APIs. This particular limit is specifically number of concurrent requests. You are correct that sending fewer of these at once will resolve the problem.

baogorek · 2024-05-27T01:29:50Z

Thanks for the clarification. Just note that this has failed with as few as 4 requests sent out asynchronously. Nothing like it happens with the OpenAI equivalent. I would suggest some sort of warning, somewhere.

baogorek changed the title ~~rate_limit_error (429) on paid paid for only a dozen async requests~~ rate_limit_error (429) on paid account for only a dozen async requests May 14, 2024

rattrayalex closed this as not planned Won't fix, can't repro, duplicate, stale May 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rate_limit_error (429) on paid account for only a dozen async requests #496

rate_limit_error (429) on paid account for only a dozen async requests #496

baogorek commented May 14, 2024

rattrayalex commented May 25, 2024

baogorek commented May 27, 2024

rate_limit_error (429) on paid account for only a dozen async requests #496

rate_limit_error (429) on paid account for only a dozen async requests #496

Comments

baogorek commented May 14, 2024

rattrayalex commented May 25, 2024

baogorek commented May 27, 2024