Vercel API Gateway rate limiting: how are limits defined and scaled?

sibitrung4-1680 · May 3, 2026, 4:09pm

Hi everyone,

I’m trying to better understand the rate limiting behavior of the Vercel API Gateway.

From the documentation, I couldn’t find clear details about limits such as:

Tokens per minute (TPM)
Requests per minute (RPM)

In platforms like OpenAI, rate limits are typically structured by usage tiers (higher tiers get higher limits).

So I have a few questions:

Does Vercel API Gateway enforce rate limits (RPM/TPM or similar)?
If yes, how are these limits determined? Are they based on plan, usage, or purchased credits?
Is there any automatic scaling of limits as usage increases?
Are there best practices to avoid hitting limits when building production applications?

For context, I plan to use the API Gateway with paid usage (buying credits), and I want to understand how it performs under high traffic in production.

Thanks in advance for any clarification!

Topic		Replies	Views
Questions about pricing on Vercel Firewall rate limiting Help firewall	3	88	August 21, 2025
Free credits temporarily have restricted access due to abuse AI SDK	7	834	October 13, 2025
Are there Vercel monthly plans with unlimited credits? v0 account , usage , billing , pricing	3	43	January 20, 2026
Pro plan limit v0	6	231	June 3, 2025
Unable to understand how NEW pricing works! v0	4	166	June 6, 2025

Vercel API Gateway rate limiting: how are limits defined and scaled?

Related topics