MiniMaxMiniMax·💬 Text Generation·New

MiniMax M3 Preview

ReasoningCodeFunction CallingWeb Searchfp8private
🧠 Try in Intelligence →Try on Venice.ai ↗
Quick reference
MiniMax M3 Preview — TLDR
  • 🧠 Frontier 1.4-trillion-parameter MiniMax model for coding, agents, reasoning.
  • 📏 512K-token context window in this preview, served at fp8.
  • 🆕 Built on new MiniMax Sparse Attention (MSA) for long context.
  • ⚡ MSA enables efficient native ultra-long-context pretraining.
  • 🔧 Function calling, tool use, and structured agentic task execution.
  • 👁️ Sibling M3 is natively multimodal (text, image, video input).
  • 🌐 Web search and long-horizon agentic workflows supported.
  • 🎯 Targets autonomous coding and multi-step agentic reasoning.
💰 Pricing
$0.300 / $1.20
per 1M · input / output
📏 Context
524K tokens
📅 On Venice since
Jun 12, 2026
5 days ago
Provider

MiniMax is an AI company building generative models across multiple modalities, with a focus that spans both language understanding and audio creation. Their rapid release cadence in early 2026—delivering several new models within just a few months—reflects…

Read full profile →
8 models on Venice
4 text · 3 music · 1 tts
Since Feb 12, 2026

About this model

MiniMax M3 Preview is a preview build of MiniMax's flagship M-series language model, described in this catalog as a 1.4-trillion-parameter frontier model for coding, agentic workflows, and complex reasoning, served at fp8 with a 512K-token context window. It is positioned alongside the full MiniMax M3 release, which MiniMax presents as a model combining frontier coding, ultra-long context, and native multimodal input in a single architecture.

The central change from earlier M-series models is MSA (MiniMax Sparse Attention), which replaces the quadratic cost of full attention to enable native ultra-long-context pretraining, according to MiniMax. The production M3 supports up to 1M tokens with a guaranteed 512K minimum; this preview exposes the 512K tier.

Compared with prior family members such as MiniMax M2.7 and MiniMax M2.5, which remain available for existing workflows, MiniMax frames coding and agentic capability as M3's key areas of improvement, with autonomous task decomposition, tool invocation, and multi-step reasoning.

As a function-calling and web-search-capable model, M3 Preview is aimed at long-horizon agentic and computer-use tasks, including code generation and tool-driven workflows. Being a preview, weights, the technical report, and full availability above 512K tokens were still being rolled out around launch, with the model exposed here at the 512K context tier.

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 4d ago