AI Gateway pricing discrepancy between documented rates and actual charges

grantsingleton · January 21, 2026, 7:49pm

I’m seeing significant discrepancies between the documented AI Gateway pricing and what’s actually being charged. I’ve done detailed analysis on two models and the numbers don’t add up.

Gemini 2.5 Pro

Documented pricing: $1.25/1M input, $10.00/1M output

Actual requests from my dashboard:

Back-calculating from the actual charges gives approximately $1.73/1M input and $43.74/1M output - significantly higher than documented, especially for output tokens (4.4x higher).

Request 1: 4900i + 46o = 0.01049125  
Request 2: 3300i + 443o = 0.02508875

Solve for i from Request 1:  
i = (0.01049125 - 46o) / 4900

Substitute into Request 2:  
3300 × (0.01049125 - 46o) / 4900 + 443o = 0.02508875

Solve for o:  
0.007068 - 30.98o + 443o = 0.02508875  
412.02o = 0.018021  
o = 0.0000437 per token = $43.74 per 1M output tokens

Plug o back in to get i:  
i = (0.01049125 - 46 × 0.0000437) / 4900  
i = 0.00848 / 4900  
i = 0.00000173 per token = $1.73 per 1M input tokens

Verification

Using $1.73/1M input and $43.74/1M output:

Request 1: 4900 × $1.73/1M + 46 × $43.74/1M = $0.00848 + $0.00201 = $0.01049  
Request 2: 3300 × $1.73/1M + 443 × $43.74/1M = $0.02509

Both match the actual charges.

I had similar results with gpt-5-mini. Not as extreme as this but they were still consistently wrong.

bestcodes · January 21, 2026, 10:56pm

Gemini 2.5 Pro is $1.25/M input, $10.00/M output, and $35.00/K (+input tokens) for web searches. Did you have web searching enabled? Also, could you please provide a full screenshot of the charges, because I can’t tell what columns are what in the dashboard screenshot you shared

pawlean · January 22, 2026, 10:26am

Thank you for the detailed post! I’ve shared it with the AI Gateway team, and we’re looking at deploying a fix. I’ll let you know once we do!

grantsingleton · January 22, 2026, 1:54pm

No web search. Columns from left to right are date, model, provider, cost, input tokens, output tokens, duration.

grantsingleton · January 22, 2026, 1:56pm

Thank you! Im already seeing new columns for “cache read”, “cache write”, and “reasoning”, are those related to this fix?

jerilynzheng · January 22, 2026, 5:20pm

Hi Grant! I’m on the AI Gateway team - thanks for sharing this! We looked into your usage and can confirm this was due to reasoning tokens, which were not previously shown in the dashboard. We’ve added the new columns that you flagged above to make it easier to match to the cost of the request. Hopefully this fixes any confusion and lets you track your AI Gateway requests more accurately, thanks again for the feedback.

Topic		Replies	Views
Ai Gateway: Gemini Pricing Calculation Error Help ai-gateway	2	149	December 1, 2025
SOLVED: Pricing Issues for zai/glm-4.7, in AI Gateway? Help ai-gateway	3	181	January 25, 2026
Clarifying input vs output token costs in Vercel usage v0 ai-sdk , usage , billing , ai-gateway , pricing	3	94	January 23, 2026
Question About AI Gateway Production Rate Limits Help ai-gateway	1	98	May 14, 2026
Groq AI Gateway Pricing Help ai-gateway	8	118	February 23, 2026

AI Gateway pricing discrepancy between documented rates and actual charges

Related topics