I’m seeing significant discrepancies between the documented AI Gateway pricing and what’s actually being charged. I’ve done detailed analysis on two models and the numbers don’t add up.
Back-calculating from the actual charges gives approximately $1.73/1M input and $43.74/1M output - significantly higher than documented, especially for output tokens (4.4x higher).
Request 1: 4900i + 46o = 0.01049125
Request 2: 3300i + 443o = 0.02508875
Solve for i from Request 1:
i = (0.01049125 - 46o) / 4900
Substitute into Request 2:
3300 × (0.01049125 - 46o) / 4900 + 443o = 0.02508875
Solve for o:
0.007068 - 30.98o + 443o = 0.02508875
412.02o = 0.018021
o = 0.0000437 per token = $43.74 per 1M output tokens
Plug o back in to get i:
i = (0.01049125 - 46 × 0.0000437) / 4900
i = 0.00848 / 4900
i = 0.00000173 per token = $1.73 per 1M input tokens
Gemini 2.5 Pro is $1.25/M input, $10.00/M output, and $35.00/K (+input tokens) for web searches. Did you have web searching enabled? Also, could you please provide a full screenshot of the charges, because I can’t tell what columns are what in the dashboard screenshot you shared
Hi Grant! I’m on the AI Gateway team - thanks for sharing this! We looked into your usage and can confirm this was due to reasoning tokens, which were not previously shown in the dashboard. We’ve added the new columns that you flagged above to make it easier to match to the cost of the request. Hopefully this fixes any confusion and lets you track your AI Gateway requests more accurately, thanks again for the feedback.