About this model
MiniMax M3 Preview is a preview build of MiniMax's flagship M-series language model, described in this catalog as a 1.4-trillion-parameter frontier model for coding, agentic workflows, and complex reasoning, served at fp8 with a 512K-token context window. It is positioned alongside the full MiniMax M3 release, which MiniMax presents as a model combining frontier coding, ultra-long context, and native multimodal input in a single architecture.
The central change from earlier M-series models is MSA (MiniMax Sparse Attention), which replaces the quadratic cost of full attention to enable native ultra-long-context pretraining, according to MiniMax. The production M3 supports up to 1M tokens with a guaranteed 512K minimum; this preview exposes the 512K tier.
Compared with prior family members such as MiniMax M2.7 and MiniMax M2.5, which remain available for existing workflows, MiniMax frames coding and agentic capability as M3's key areas of improvement, with autonomous task decomposition, tool invocation, and multi-step reasoning.
As a function-calling and web-search-capable model, M3 Preview is aimed at long-horizon agentic and computer-use tasks, including code generation and tool-driven workflows. Being a preview, weights, the technical report, and full availability above 512K tokens were still being rolled out around launch, with the model exposed here at the 512K context tier.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 4d ago