Claude Opus 4.8 Fast
About this model
Claude Opus 4.8 Fast, released on May 28, 2026, is a speed-optimized configuration of Claude Opus 4.8, which Anthropic describes as its most capable generally available Opus model for coding and agents. It runs the identical underlying model — the same 1M-token context window, 128k maximum output tokens, and adaptive thinking — but routes inference through a higher-throughput "fast" path.
The fast mode's benefit centers on output token generation rather than time to first token, making it suited to live coding sessions and real-time agentic loops, where it trades premium pricing for reduced latency. Standard Opus 4.8 offers the same intelligence at lower cost for batch and less time-sensitive workloads.
Relative to its predecessor, Anthropic says Opus 4.8 targets behavioral improvements over Claude Opus 4.7 in long-horizon agentic coding — including better long-context handling, fewer compactions, improved compaction recovery, and more reliable tool triggering, addressing a reported tendency in Opus 4.7 to skip required tool calls. Anthropic also highlights honesty gains, noting early testers found the model more likely to flag uncertainty and less likely to make unsupported claims.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 6d ago