Mistral AI·💬 Text Generation

Mistral Small 4

ReasoningVisionCodeFunction CallingWeb Searchfp8private

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

Mistral Small 4 — TLDR

🆕 Unifies instruct, reasoning, and coding into one model
🧠 119B-parameter MoE, only ~6.5B active per token
🔧 128 experts, 4 active per token
📏 Native 256K-token context window
👁️ Accepts text and image input, text output
🎯 Configurable reasoning effort, toggle fast vs. thinking mode
🔒 Apache 2.0 license, open weights
💬 Native function calling and JSON output for agentic use

💰 Pricing

$0.188 / $0.750

per 1M · input / output

📏 Context

256K tokens

📅 On Venice since

Mar 16, 2026

125 days ago

Provider

Mistral AI

Mistral AI is a French artificial intelligence company headquartered in Paris, founded in 2023. The company focuses on developing large language models offered under both open-weight and proprietary licenses. Mistral AI has quickly risen to prominence in the…

Read full profile →

2 models on Venice

2 text

Since Jan 15, 2026

Wikipedia ↗Official site ↗

See 1 other model from Mistral AI →

About this model

Mistral Small 4, released March 16, 2026, is Mistral AI's open-weight hybrid model that consolidates three previously separate model families—Instruct, Reasoning (formerly Magistral), and Devstral coding—into a single checkpoint. It uses a Mixture-of-Experts architecture with 119 billion total parameters spread across 128 experts, activating only 4 experts (about 6.5 billion parameters) per token, giving it the inference profile of a much smaller dense model. It accepts text and image input, supports a 256K-token context window, and exposes a configurable reasoning-effort parameter that toggles between fast instant replies and a slower thinking mode.

This marks a substantial change from its same-family predecessor, Mistral Small 3.2 24B Instruct, a 24B dense instruction model from January 2026. Where Small 3.2 focused on text instruction following, Small 4 moves to a sparse MoE design, adds native vision, integrates dedicated reasoning and agentic-coding behavior, and expands deployment to enterprise-scale tasks.

Mistral distributes Small 4 under the Apache 2.0 license with multiple checkpoints, including an NVFP4 4-bit quantized version and a trained Eagle head for speculative decoding. Mistral positions it for chat assistants, coding, agentic workflows, and reasoning tasks, with native function calling and JSON output.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

Mistral Small 4 - Mistral AI | Mistral Docsdocs.mistral.ai ↗

mistralai / mistral-small-4-119b-2603docs.api.nvidia.com ↗

mistral-small-4-119b-2603 Model by Mistral AIbuild.nvidia.com ↗

mistralai/Mistral-Small-4-119B-2603 · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 4d ago