Rate Limit
The maximum number of API requests that can be sent in a given time period before the server temporarily suspends access.
What is a rate limit?
A rate limit is the maximum number of requests that can be sent to an API within a given time period - most commonly per minute or per day. If you exceed the limit, the API returns a 429 Too Many Requests error and temporarily blocks access.
Typical rate limit units
- RPM (Requests Per Minute) - number of calls per minute
- TPM (Tokens Per Minute) - number of tokens per minute (for LLM APIs)
- RPD (Requests Per Day) - total daily limit
How to handle rate limits in automations
- Add a delay (pause) between requests
- Implement retry logic with exponential backoff
- Upgrade your subscription plan with the API provider
- Spread processing into a queue