Tools · Free

AI Workflow Cost Estimator

Most agencies hide the ongoing cost of AI. We don't. Pick a workflow, set the volume, see what it would actually cost to run per month — including LLM calls, third-party APIs, vector storage, and our Care fee.

Built for honest budgeting. Every estimate runs in your browser — we never see your numbers unless you send them to us.

01 — Pick a starting point

Start with a template, then tune from there. Or pick "Blank" and describe your own.

02 — Volume & per-request sizeMonthly request volumeHow many times will the workflow run per month?

Avg input tokens / requestIncludes prompt, system message, RAG contextAvg output tokens / requestWhat the model writes back

03 — Model

Prompt cache hit rate40%

Anthropic prompt cache reduces repeated input cost by ~90%. Higher cache hit rate = lower bill.

04 — Workflow shape

Engineering complexity

Affects build estimate and Care tier recommendation.

05 — Third-party APIs per request

Calls per request (set to 0 if not used). Multiplied by volume to get monthly cost.

VIN decoder

Vehicle data (year, make, model, specs) — $0.0500 per call

SMS (Twilio)

Inbound or outbound SMS message — $0.0080 per call

Voice telephony (Twilio)

Inbound or outbound voice call — $0.0220 per minute

Transcription (AssemblyAI / Whisper)

Audio → text transcription — $0.0100 per minute

Web scraping (Apify / Browse)

Fetch + render a page — $0.0200 per call

Geocoding (Google Maps)

Address → lat/lng — $0.0050 per call

Email parsing (Postmark inbound)

Receive + parse inbound email — $0.0100 per call

Image generation (DALL-E / SDXL)

Standard quality image — $0.0400 per call

Estimated monthly

$478 – $501

Includes AI Care · Standard ($399/mo) + estimated API spend with a 30% safety buffer on the high side.

AI Care · Standard$399/mo

Estimated API spend$79 – $102/mo

Cost per request$0.10

Range: $0.07 – $0.15 (p25–p75)

Expected latency~14.9s / request

Vector DB lookup · 80ms

Claude Sonnet 4.6 generation · 14.8s

One-time build

$5,000 – $10,000+

3–5 weeks · scoped to your complexity rating. Range is a typical starting point — projects can extend higher based on scope. Quoted firm in your proposal.

API spend breakdown

Claude Sonnet 4.6 (LLM)
40% cache hit, saves $2.16/mo
$9/mo
Vector DB (Pinecone serverless)
Fixed monthly minimum
$70/mo
Prompt cache savings−$2.16/mo

Recommended Care tier

AI Care · Standard

Up to 6 hrs/mo · Multi-step workflows with memory, integrations, or moderate volume. Most clients land here.

Per-request flow

What happens each time the workflow runs.

1
Inbound trigger
User input, webhook, scheduled event, or inbound message
2
Vector DB lookup
Find relevant context from the knowledge base
80ms
3
Claude Sonnet 4.6 generation
Model produces 400 avg output tokens
$0.0135/req
14.8s
4
Response delivered
Returned to user / written back / next step

Discuss this estimate →

Pricing data updated May 8, 2026

How it works

LLM API costs

Calculated as (volume × avg input tokens × $/M input) + (volume × avg output tokens × $/M output). Anthropic models include a prompt-cache slider — cached input is roughly 90% cheaper than uncached.

Third-party API costs

Per-request multipliers for things like VIN decoders, SMS, voice, and transcription. RAG workflows include a fixed monthly cost for vector database storage (Pinecone serverless minimum is ~$70/mo).

Care tier

Recommended automatically based on workflow complexity, volume, memory, and whether voice is involved. Care covers Stratus's monitoring, prompt tuning, model upgrades, and small fixes — distinct from API spend (which is always pass-through).

Safety buffer

The high end of the monthly invoice adds a 30% buffer on API spend. Most workflows run below the high end; this is the ceiling we'd quote in a real proposal so you aren't surprised.

What this isn't

A binding quote. It's a directional tool. Your actual quote depends on scope, integrations, and exact usage patterns — which is what we figure out in the discovery call. The estimator gets you 80% of the way there in five minutes.

Bring your estimate to a discovery call. We'll pressure-test the assumptions, scope the build, and send you a firm proposal within two business days.

Start a project See full pricing