Ai Gateway: Gemini Pricing Calculation Error

jmcharks · November 30, 2025, 6:50am

I think there may be an issue with how AI Gateway is billing for google/gemini-3-pro-preview, or I am misunderstanding how the pricing is applied.

According to the published price for this model

Input tokens $2 per million
Output tokens $12 per million

One of my recent calls through Vercel AI Gateway shows the following usage and charge

Provider Google Vertex, model google/gemini-3-pro-preview
Input tokens 1.1K
Output tokens 182
Total charge $0.016054

If I calculate this manually

Input cost, 1,100 ÷ 1,000,000 × 2.00 = $0.0022
Output cost, 182 ÷ 1,000,000 × 12.00 = $0.002184

Total expected cost is therefore about $0.004384.

However I was actually billed $0.016054 for that call, which is roughly 3.6x higher than the expected amount.

I have repeated this with several different requests and each time the charge shown by AI Gateway is around 4x what I would expect from the published per token pricing.

Is this wrong or am I am misunderstanding how the pricing is calculated in this case?

Thanks in advance!

jmcharks · November 30, 2025, 7:16am

I ran another request that returned the following from the Gemini API response usageMetadata:

“thoughtsTokenCount”: 443,
“promptTokenCount”: 36,
“candidatesTokenCount”: 26,
“totalTokenCount”: 505

In Vercel AI Gateway this single call is shown as costing $0.0057.

Using the published Gemini rates, if I calculate cost from promptTokenCount plus candidatesTokenCount I get about $0.00038, so the charge is roughly 15x higher than expected!

However, if I treat:

input tokens as promptTokenCount (36)

output tokens as candidatesTokenCount (26) + thoughtsTokenCount (443) = 469

Total tokens: 36 + 459 = 505

and then apply the same rates, the result is exactly $0.0057.

So it looks like AI Gateway is also billing thoughtsTokenCount as output tokens, which explains why all my Gemini charges are coming out at a much higher amount than my own calculations.

Is this correct?

anshumanb · December 1, 2025, 9:39am

Hi @jmcharks, from on top of my head I will say your finding is correct. I think it’ll be same if you make the request directly to the Gemini API.

Nevertheless, I’ve asked our team to confirm if that’s the correct understanding.

Confirmation: The reasoning tokens are billed at output token rates. We don’t have any markups on the Provider’s Model pricing. So, costs will always be same as if you were directly making the request to the provider’s API.

system · December 15, 2025, 9:39am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
AI Gateway pricing discrepancy between documented rates and actual charges Help billing , ai-gateway , pricing	6	45	January 22, 2026
SOLVED: Pricing Issues for zai/glm-4.7, in AI Gateway? Help ai-gateway	4	99	February 8, 2026
Clarifying input vs output token costs in Vercel usage v0 ai-sdk , usage , billing , ai-gateway , pricing	3	36	January 23, 2026
Questions Regarding AI Gateway Billing, Usage Tracking etc AI SDK ai-sdk , ai	2	240	May 25, 2025
Groq AI Gateway Pricing Help ai-gateway	7	25	February 12, 2026

Ai Gateway: Gemini Pricing Calculation Error

Related topics