API cost / month
$0
Tokens + OCR + embeddings + search
Most estimates stop at tokens. This one models the full picture — inference, search, embeddings, retrieval, and overhead. Build with your numbers, not guesses.
Inputs
Output
API / usage costs calculated from published rates × your volume estimates
API cost / month
$0
Tokens + OCR + embeddings + search
API cost per workflow
$0
Cost per request / run at expected volume
+ Operating overhead / month
$0
Infra + storage — your estimates
Total estimate / month
$0
API cost + operating overhead
Cost per active user
$0
API + infra ÷ users (excl. contingency)
Price for target gross margin
$0
Based on variable API costs only
Yearly run-rate
$0
Monthly × 12 — flat rate, no price decay
P10 / P50 / P90 — volume multipliers only, rates are fixed. Distribution is right-skewed: P90 is further from median than P10.
Monthly volume your architecture needs to absorb.
Which components dominate the estimate.
Where to optimise first.
Sources for the editable defaults in this calculator.
Need a personalised cost model?
We can build a tailored estimate for your specific stack, team size, and use case.