Z.ai·💬 Text Generation

GLM 5 Turbo

ReasoningCodeFunction CallingWeb Searchanonymized

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

GLM 5 Turbo — TLDR

🎯 Fast-inference model tuned for agents and coding
🔧 Function calling plus built-in web search
🧠 Reasoning-capable, code-optimized workflows
📏 200K-token context window
📜 Open-source MIT license
🏢 From China's Z.ai (GLM family)

💰 Pricing

$1.20 / $4.00

per 1M · input / output

📏 Context

200K tokens

📅 On Venice since

Mar 15, 2026

126 days ago

Provider

Z.ai

Z.ai, formally Knowledge Atlas Technology Joint Stock Co., Ltd., is a Chinese technology company specializing in artificial intelligence. Previously known internationally as Zhipu AI, the company rebranded to Z.ai in 2025. Its core focus is the GLM family of…

Read full profile →

12 models on Venice

11 text · 1 image

Since Apr 1, 2024

Wikipedia ↗Official site ↗

See 11 other models from Z.ai →

About this model

GLM 5 Turbo is Z.ai's speed-optimized member of the GLM 5 line, released in March 2026 and built for low-latency inference in agent-driven environments and production coding workflows. It pairs reasoning and code-optimization with native function calling and web search, and exposes a 200K-token context window that comfortably handles large codebases and multi-step tool chains. Like the rest of the GLM lineup, it ships under the permissive MIT license, continuing Z.ai's open-source approach since the GLM family rebrand in 2025.

Within Z.ai's catalogue, GLM 5 Turbo sits as the throughput-focused counterpart to the heavier flagship GLM 5 releases. It is now joined by GLM 5V Turbo (released April 2026), which extends the same fast-inference Turbo recipe, while the broader line has advanced through releases like GLM 5.2. Users may still favor the Turbo tier specifically for its speed and cost profile.

It is best suited for developers building responsive coding assistants, autonomous agents, and tool-using applications where fast turnaround and reliable function calling matter more than maximum model size.

🤗View model card on HuggingFace ↗View source on GitHub ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2602.15763Feb 2026

GLM-5: from Vibe Coding to Agentic Engineering(2026)

GLM-5-Team, :, Aohan Zeng et al.

We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 4d ago