10000 requests
- Input token cost
- $6.00
- Output token cost
- $7.50
- Cost per request
- $0.00135
Model pricing changes. Use the current rate card from your provider before budgeting real usage.
Use this free AI token cost calculator to estimate input token cost, output token cost, total cost, and cost per request without hardcoded stale model prices.
10000 requests
Model pricing changes. Use the current rate card from your provider before budgeting real usage.
Estimate the monthly cost of an AI support bot, writing helper, or internal tool.
Compare two model price cards using the same token and request assumptions.
Turn a token estimate into a rough budget before building a prototype.
Explain why long prompts and long answers can cost different amounts.
$24 input + $40 output = $64 total, or $0.0064 per request
$0.045 input + $0.09 output = $0.135 total
$20 input + $7 output = $27 total
Need a slower walkthrough, a related calculator, or the full library? These links keep you close to the task you started.
Plain-language answers about when to use the tool, what it does with your inputs, what to double-check, and how privacy works.
Use it when your task matches one of these common needs: Estimate the monthly cost of an AI support bot, writing helper, or internal tool. Compare two model price cards using the same token and request assumptions. It works best when you already know the measurements, amounts, units, or options the page asks for.
In plain language: Input cost = input tokens per request * request count / 1,000,000 * input price per 1M tokens. Output cost = output tokens per request * request count / 1,000,000 * output price per 1M tokens. Total cost = input cost + output cost. Cost per request = total cost / request count. The examples on the page are there so you can compare your inputs with a worked example before copying the answer.
Requests: How many model calls you want to estimate, such as one day, one month, or one product test. Input tokens per request: Tokens sent to the model each time, including instructions, prompt text, context, and tool messages. Output tokens per request: Tokens generated by the model in each response. Input price per 1M tokens: The current provider rate for one million input tokens for the model you plan to use. Output price per 1M tokens: The current provider rate for one million output tokens. This is often different from the input price.
Read the AI result as a best-effort clue or draft. Look at labels, scores, notes, and warnings together, then compare the result with the original text or image before using it anywhere important.
This is a planning estimate, not a live provider bill. AI providers can change prices, count cached tokens differently, round usage, add batch discounts, include tool-call costs, or apply credits and taxes. Use the current provider rate card and your real usage logs for budgets that matter. Also check the unit, scale, mode, and result limit because small input changes can change the answer.
Model prices change and different providers charge different rates for input, output, cached input, fine-tuned models, batch jobs, and special tools. Entering the rate yourself keeps the calculator useful without pretending one price is always current.
No. It is a plain estimate for normal input and output tokens. If your provider has cached-token pricing, batch discounts, minimum charges, or credits, calculate those separately or adjust the prices you enter.
Use the number of requests you expect in a month, then enter the average input tokens and output tokens for one request. The total cost is the input spend plus output spend for that whole request count.
Many model providers charge different rates for tokens you send and tokens the model generates. Long prompts raise input cost, while long answers raise output cost, so keeping them separate makes the estimate easier to check.
Only if your token numbers came from the exact tokenizer or usage logs for the model. Rough text estimates can be useful for planning, but code, symbols, non-English text, whitespace, and tool messages can change the real token count.
Yes. Keep the request count and token assumptions the same, then enter one model price card and compare it with another. This shows the pricing effect, not quality, latency, rate limits, or reliability.
No. It only estimates model token charges from the rates you enter. Add hosting, storage, retrieval, image/audio/video tools, retries, monitoring, and other platform costs separately.
No. The calculator runs in your browser tab. Your recent answers stay only on the page while you use it, and they are not sent to a server.