info Open to new work opportunities! Contact me
Daniel Hladik AI Automation Engineer

← All terms

Rate Limit

The maximum number of API requests that can be sent in a given time period before the server temporarily suspends access.

What is a rate limit?

A rate limit is the maximum number of requests that can be sent to an API within a given time period - most commonly per minute or per day. If you exceed the limit, the API returns a 429 Too Many Requests error and temporarily blocks access.

Typical rate limit units

  • RPM (Requests Per Minute) - number of calls per minute
  • TPM (Tokens Per Minute) - number of tokens per minute (for LLM APIs)
  • RPD (Requests Per Day) - total daily limit

How to handle rate limits in automations

  • Add a delay (pause) between requests
  • Implement retry logic with exponential backoff
  • Upgrade your subscription plan with the API provider
  • Spread processing into a queue