Anthropic·💬 Text Generation·VS Pick

Claude Opus 4.8 Fast

ReasoningVisionCodeFunction CallingWeb Searchanonymized

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

Claude Opus 4.8 Fast — TLDR

🆕 Speed-optimized variant of Anthropic's most capable generally available Opus model.
⚡ Fast mode runs up to 2.5× output tokens per second.
📏 1M-token context window by default on the Claude API.
🔧 Builds on Opus 4.7 with better tool triggering, fewer compactions.
👁️ Multimodal with vision, function calling, and web search.
🎯 Tuned for long-horizon agentic coding and enterprise knowledge work.
💬 Adaptive thinking with up to 128k max output tokens.
🏢 Premium pricing tier; same underlying model as standard Opus 4.8.

💰 Pricing

$12.00 / $60.00

per 1M · input / output

📏 Context

1M tokens

📅 On Venice since

May 28, 2026

53 days ago

Provider

Anthropic

Anthropic PBC is an American artificial intelligence company headquartered in San Francisco. Structured as a public benefit corporation, the lab develops large language models under the Claude name, with a research emphasis on building reliable, steerable,…

Read full profile →

10 models on Venice

10 text

Since Jan 15, 2025

Wikipedia ↗Official site ↗

See 9 other models from Anthropic →

About this model

Claude Opus 4.8 Fast is the latency-optimized configuration of Claude Opus 4.8, Anthropic's most capable generally available model, released on May 28, 2026. Rather than a separate model, fast mode serves the same Opus 4.8 weights at higher throughput: setting speed to "fast" yields up to 2.5× more output tokens per second at premium pricing, offered initially as a research preview on the Claude API. It retains the full 1M-token context window that runs by default on the Claude API, Amazon Bedrock, and Vertex AI, plus up to 128k max output tokens and adaptive thinking.

Compared with the prior generation, Claude Opus 4.7 Fast, the underlying 4.8 model targets behavioral gains that Anthropic attributes to long-horizon agentic coding, including better long-context handling, fewer compactions, and improved compaction recovery. Anthropic also reports better tool triggering — the model is less likely to skip a required tool call, an issue some users flagged on Opus 4.7 — and improved honesty, with reduced tendency to overclaim progress.

A notable practical change: Anthropic states that fast mode for Opus 4.8 is three times cheaper than fast mode was on previous Opus models, narrowing the cost gap with the standard tier.

This is the newest entry in the Opus fast lineage. It sits alongside the broader Claude family, including Claude Fable 5, which Anthropic describes as its most capable model in Claude Code for tasks larger than a single sitting.

Sources

Introducing Claude Opus 4.8 \ Anthropicanthropic.com ↗

What's new in Claude Opus 4.8 - Claude API Docsplatform.claude.com ↗

Model configuration - Claude Code Docscode.claude.com ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 3d ago