PlatformClaude API 1M context window for Sonnet 4.5 (beta)
Sonnet 4.5 now supports 1M tokens via beta header. Pricing: $6/MTok input, $22.50/MTok output for context >200K.
Enables loading entire small/medium codebases or 200+ page documents in a single request. The 200K base context remains at standard pricing; only tokens beyond 200K are charged at the 1M-context rate. Most workloads should still use 200K + retrieval (RAG) for cost efficiency.
Read: 1M context guide · Cost calculator
ModelClaude Haiku 4.5 released
Haiku 4.5 launches at $0.80 / $4.00 per MTok. 3.75x cheaper than Sonnet on input. Korean quality 8.5/10 (self-measured).
The fastest model in the 4.5 family. Best for high-volume classification, summarization, and simple Q&A. Dramatically reduces cost for chatbots and agents that route 80% of traffic to Haiku.
Read: Model decision tree · Korean chatbot ($48/mo for 10K conversations)
Claude CodeClaude Code v2 stable release
Claude Code 2.x with improved CLAUDE.md support, hook system stable, MCP server marketplace.
Hooks now stable for blocking/auditing tool calls. MCP server marketplace lets you install 50+ community-built integrations. CLAUDE.md routing rules supported for skill invocation.
Read: Claude Code Complete Guide · Hooks reference
APIFiles API stable (32MB limit)
Files API moves from beta to stable. Up to 32MB per file. Use for PDFs and images larger than inline 5MB limit.
Replaces the awkward base64 inline pattern for large attachments. Upload once, reference by file_id across multiple requests. Reduces input token billing for repeated document use.
Read: PDF parsing guide · Files API errors
PricingBatch API 50% discount confirmed for 2026
Anthropic confirmed Batch API 50% discount continues. 24-hour SLA. Up to 10,000 requests per batch.
Best for nightly reports, weekly summaries, backfill jobs, and any non-realtime workload. Combine with prompt caching for compound savings.
Read: Batch vs Streaming · Batch troubleshooting