Skip to content
Claude Code Cost

What does Claude Code actually cost in 2026?

Per-token prices for Opus, Sonnet, and Haiku. Real-world subscription numbers. Where prompt caching saves 30-70 percent. And how to see what you're actually paying — live, by project — without waiting for a monthly invoice.

Claude API pricing by model

All prices in USD per million tokens. Output tokens are charged at 4-5x the input rate, so writing-heavy use cases cost more than reading-heavy ones.

ModelInputOutputCache writeCache readBest for
Claude Opus 4.8$5.00$25.00$6.25$0.50Hardest reasoning, complex refactors
Claude Sonnet 4.6$3.00$15.00$3.75$0.30Daily coding work, speed/intelligence balance
Claude Haiku 4.5$1.00$5.00$1.25$0.10High-throughput / cheap classification

Last updated: June 2026. Prices reflect current public pricing. Verify on the Anthropic pricing page before locking in a budget.

Pro vs Max vs Team — which is cheapest for Claude Code?

Subscription pricing is almost always cheaper than per-token API billing for individuals. The math flips at high volume or when you need the higher-tier API rate limits.

$20/mo

Claude Pro

Base 5-hour rolling window. Plenty for moderate daily use. Cheapest entry point.

Individual devs, casual Claude Code use

$100/mo

Claude Max ($100)

5x the Pro allowance. Designed for developers who use Claude Code as their primary coding tool.

Daily heavy users, professional engineers

$200/mo

Claude Max ($200)

20x the Pro allowance. Effectively eliminates rate limit interruptions for solo work.

All-day coding sessions, power users

What does a typical day cost?

Real numbers from instrumented Claude Code sessions. Your mileage will vary depending on project size, model mix, and how often Claude has to re-read context.

Light day — small refactors, mostly reading code

Cache reads dominate; output volume small.

in 120K · out 20K
Sonnet 4.6 with caching
$0.55

Average day — feature work, multi-file edits

Most developers fall here. ~6 hours of focused coding.

in 850K · out 180K
Sonnet 4.6 with caching
$5.10

Heavy day — large refactor with Opus escalation

Opus output tokens drive most of the cost.

in 2.4M · out 650K
Sonnet baseline + Opus 4.8 for hard sub-tasks
$19.40

Production batch — codebase-wide migration

Caching saved an estimated $90 vs naive replay.

in 8.2M · out 1.1M
Sonnet 4.6 with aggressive caching
$42.80

Prompt caching: the biggest cost lever

Anthropic charges 25 percent more for the first request that creates a cache entry, but 90 percent less for every subsequent read. For Claude Code workflows that re-use the same context across many requests, this is the single biggest savings opportunity.

Without caching

10 requests, each with 100K tokens of repeated codebase context. Sonnet pricing.

$3.00

10 × 100K input × $3/M = $3.00

cache wins

With caching

First request writes the cache, next 9 read it. Same 100K token context.

$0.65

$0.375 cache write + 9 × $0.030 cache read = $0.65 — 78% cheaper

Track Claude Code cost live, by project

The Anthropic Console shows yesterday's cost. Tokemon shows what your tokens are doing right now — broken down by project, with live burn rate and a forecast of where you'll land for the month. Free, open source, runs in your menu bar.

  • • Per-project cost breakdown — see exactly which codebase used 1.2B tokens this month
  • • Real-time burn rate with model-aware cost calculation
  • • Monthly budget forecast with auto-alerts at 50/75/90%
  • • PDF / CSV / JSON export for invoices and finance reports

FAQ

How much does Claude Code cost per month?

On a Claude subscription: Pro is $20/month, Max is $100 or $200/month, Team is $30 per seat. On the API, a typical day of intensive Claude Code use ranges from $5 to $40 depending on model mix and how aggressively you use prompt caching. For most individual developers, a Pro or Max subscription is cheaper than per-token API billing.

What is the per-token cost of Claude Opus, Sonnet, and Haiku?

Claude Opus 4.8 is $5 per million input tokens and $25 per million output tokens. Claude Sonnet 4.6 is $3 per million input and $15 per million output. Claude Haiku 4.5 is the cheapest at $1 per million input and $5 per million output. Output tokens are 5x more expensive than input across all three.

Does prompt caching reduce Claude Code cost?

Yes, significantly. Prompt cache writes cost 25 percent more than a normal input token, but cache reads cost 90 percent less. For long-running coding sessions where the same context (files, conversation history) is reused across many requests, caching can reduce total spend by 30-70 percent.

Is Claude Pro or the API cheaper for Claude Code?

Claude Pro at $20/month is almost always cheaper than the equivalent API spend for individual developers. The 5-hour rolling window gives you generous capacity, and there are no per-token charges. Move to the API only when you need programmatic access, want to bypass rate limits with higher tiers, or are building a product on top of Claude.

How can I see what I'm actually paying for Claude Code?

On the Anthropic Console you can see daily API spend after the fact. For real-time visibility — including which projects are consuming the most tokens — install Tokemon. It tracks per-project costs from your local Claude Code session logs and shows live burn rate, so you can see costs accruing as you work.

What's the cheapest way to use Claude Code?

Combine three things: subscribe to Claude Pro instead of using the API, use Sonnet (not Opus) for routine work and only escalate to Opus for hard problems, and let Claude Code's automatic prompt caching do its job by working in focused sessions on a single codebase.

Related reading