Google·💬 Text Generation

Gemma 4 31B Instruct🔒Private

ReasoningWeb SearchE2EEprivate

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

Gemma 4 31B Instruct — TLDR

🔒 Runs in a Trusted Execution Environment with hardware attestation
🆕 Google's Gemma 4 generation, dense 31B-parameter open model
🧠 Configurable thinking mode for step-by-step reasoning before answering
📏 This deployment exposes a 32K-token context window
👁️ Multimodal input: accepts both text and images
🌐 Pre-trained across many languages
🔧 Native function calling and system-prompt support
📚 Apache-2.0 licensed open weights

💰 Pricing

$0.139 / $0.430

per 1M · input / output

📏 Context

32K tokens

📅 On Venice since

May 20, 2026

60 days ago

Provider

Google

Google is an American multinational technology corporation and one of the world's most valuable brands. A subsidiary of parent company Alphabet Inc., Google operates across search, cloud computing, consumer electronics, and artificial intelligence. Its…

Read full profile →

30 models on Venice

11 video · 10 text · 3 image · 3 inpaint · 1 music · 1 embedding · 1 tts

Since Oct 15, 2024

Wikipedia ↗Official site ↗

See 29 other models from Google →

About this model

Gemma 4 31B Instruct is a member of Google's Gemma 4 open-weights family, here served inside a Trusted Execution Environment so that enclave identity and configuration can be independently verified via hardware attestation. The underlying model is a roughly 31B-parameter dense transformer with a vision encoder enabling text-and-image input. It is compact enough to run on capable single-GPU workstations.

Compared with its same-family predecessor, Gemma 3 27B, Gemma 4 introduces several concrete changes documented on Google's model card. It adds a built-in thinking mode for explicit step-by-step reasoning, native system-role support, and native function calling for agentic workflows. These features make it more directly usable for tool-driven and structured-output tasks than the earlier generation.

On safety, Google's model card describes improvements across content-safety categories relative to prior Gemma releases while aiming to keep unjustified refusals low. The model retains broad multilingual pre-training, and the 31B variant can process video supplied as frames.

Within Venice's catalog this instance carries a 32K context window and web-search plus end-to-end-encryption capabilities. Siblings include the non-enclave Google Gemma 4 31B Instruct and the Gemma 4 26B A4B Uncensored mixture-of-experts variant. Weights are released under Apache-2.0.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

gemma-4-31b-it Model by Googlebuild.nvidia.com ↗

google/gemma-4-31B-it · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2607.02770Jul 2026

Gemma 4 Technical Report(2026)

Gemma Team, Sherif El Abd, Vaibhav Aggarwal et al.

We introduce Gemma 4, a new generation of open-weight, natively multimodal language models in the Gemma model family. Designed to advance compute efficiency and reasoning, the Gemma 4 model suite features dense and Mixture-of-Experts architectures, ranging from 2.3B to 31B…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 2d ago