Google·💬 Text Generation

Gemini 3.5 Flash

ReasoningVisionFunction CallingWeb SearchAudioanonymized

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

Gemini 3.5 Flash — TLDR

🆕 Google's newest, most intelligent Flash-tier model, GA in 2026.
📏 1M-token input context for long documents and agentic tasks.
🧠 Built on Gemini 3 Flash foundation with adjustable thinking levels.
👁️ Accepts text, image, video, audio, and PDF inputs.
🔧 First-party function calling, code execution, structured output.
⚡ Targets near-Pro reasoning at Flash-class latency and cost.
🌐 Google Search grounding for agentic, tool-using workflows.
💬 Designed for multi-turn chat and coding assistance.

💰 Pricing

$1.55 / $9.45

per 1M · input / output

📏 Context

1M tokens

📅 On Venice since

May 22, 2026

58 days ago

Provider

Google

Google is an American multinational technology corporation and one of the world's most valuable brands. A subsidiary of parent company Alphabet Inc., Google operates across search, cloud computing, consumer electronics, and artificial intelligence. Its…

Read full profile →

30 models on Venice

11 video · 10 text · 3 image · 3 inpaint · 1 music · 1 embedding · 1 tts

Since Oct 15, 2024

Wikipedia ↗Official site ↗

See 29 other models from Google →

About this model

Gemini 3.5 Flash is Google's speed-optimized frontier model, released in May 2026 and positioned for agentic workflows, multi-turn chat, and coding assistance. It is built directly on the Gemini 3 Flash reasoning foundation, adding explicit thinking levels that let developers trade quality against cost and latency per request. The model accepts text, image, video, audio, and PDF inputs, outputs text, and carries a roughly 1M-token context window.

Compared with its same-family predecessor Gemini 3 Flash Preview, Gemini 3.5 Flash keeps the same 1M context, tool set, and platform features while improving performance, so code and short agentic tasks run at strong quality with lower latency. Google frames it as its most intelligent and capable Flash model to date.

The model integrates first-party function calling, code execution, structured output, and Google Search grounding, making it suited to tool-using agents rather than only single-turn prompts. Google positions it to deliver near-Pro reasoning while retaining Flash-class latency and cost characteristics.

It is distributed through Google Cloud's Gemini Enterprise Agent Platform and Vertex AI, alongside Google DeepMind's model channels, where the model card and platform documentation detail its capabilities and deployment options.

Sources

Gemini 3.5 Flash | Gemini Enterprise Agent Platform | Google Cloud Documentationdocs.cloud.google.com ↗

Gemini 3.5 Flash - Model Card — Google DeepMinddeepmind.google ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 1d ago