ModelPricing

Know what your AI system
actually costs.

Most estimates stop at tokens. This one models the full picture — inference, search, embeddings, retrieval, and overhead. Build with your numbers, not guesses.

Full-stack cost model Low / expected / high ranges Editable provider pricing Revenue & break-even analysis

Inputs

Project assumptions

Quick setup

Five inputs for a working estimate. Presets fill the rest.

Tokens, OCR, review rates, pricing overrides

Output

Cost estimate

Model preset

API / usage costs calculated from published rates × your volume estimates

API cost / month

$0

Tokens + OCR + embeddings + search

API cost per workflow

$0

Cost per request / run at expected volume

Full stack estimate includes your operating inputs — not calculated

+ Operating overhead / month

$0

Infra + storage — your estimates

Total estimate / month

$0

API cost + operating overhead

Cost per active user

$0

API + infra ÷ users (excl. contingency)

Price for target gross margin

$0

Based on variable API costs only

Yearly run-rate

$0

Monthly × 12 — flat rate, no price decay

Scenario range

P10 / P50 / P90 — volume multipliers only, rates are fixed. Distribution is right-skewed: P90 is further from median than P10.

P10 $0 lean adoption
P50 $0 median case
P90 $0 heavy usage

Workload summary

Monthly volume your architecture needs to absorb.

Workflows / month 0
Model calls / month 0
Input tokens / month 0
Output tokens / month 0
Search calls / month 0
OCR pages / month 0

Cost breakdown

Which components dominate the estimate.

What's driving cost

Where to optimise first.

Need a personalised cost model?

We can build a tailored estimate for your specific stack, team size, and use case.

consulting@modelpricing.io