A
AgentCosts.xyz

LLM Token Pricing

Token Cost Calculator

Convert input, cached input, and output tokens into USD cost, then reverse the math to see how many requests a fixed budget can cover.

Track and route your API spend

Token Cost Calculator

Convert tokens into real model spend.

Estimate LLM cost from input, cached input, output tokens, and request volume. Switch to budget mode to see how far a fixed spend can go.

Definition: Lead scoring, intent labels, moderation flags, routing decisions, and other short-label tasks.

Token preset: Small prompt, very short output, high request volume.

Workload profiles only prefill token/request assumptions. They do not decide model quality or peer comparison groups.

Estimated total cost

$6.048

Cost per request

$0.00012096

Weighted cost / 1M tokens

$0.1234

Cost / 1K tokens

$0.00012343

Billable breakdown

Uncached input35M tokens$4.90
Cached input10M tokens$0.0280
Output4M tokens$1.12

Current model

deepseek-v4-flash

$6.048

Comparable route

glm-4.7-flashx

$4.15

Comparison basis

Routine low-cost peer

Filtered by model capability tier first, then by price for this token shape. This is not a global cheapest-model ranking.

Potential savings

$1.898

Pricing source: DeepSeek · Reviewed: 2026-05-21

Legacy deepseek-chat and deepseek-reasoner map to DeepSeek V4 Flash modes.

Prices are public list prices for text tokens and can change. Verify provider billing pages before production billing decisions.

Track this spend in Router