Z.ai·💬 Text Generation

GLM 4.6

ReasoningFunction CallingWeb Searchfp4private

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

GLM 4.6 — TLDR

🧠 Reasoning model with tool use enabled during inference
🆕 Z.ai's GLM upgrade over GLM-4.5 across coding and agents
📏 Roughly 200K-token context for long documents and code
⚡ Vendor reports ~30% lower token consumption versus GLM-4.5
🔧 Native function calling and search-based agentic workflows
🔒 Open weights under the permissive MIT license
🏢 Built by China's Z.ai (formerly Zhipu AI)

💰 Pricing

$0.850 / $2.75

per 1M · input / output

📏 Context

198K tokens

📅 On Venice since

Apr 1, 2024

794 days ago

Provider

Z.ai

Z.ai, formally Knowledge Atlas Technology Joint Stock Co., Ltd., is a Chinese technology company specializing in artificial intelligence. Previously known internationally as Zhipu AI, the company rebranded to Z.ai in 2025. Its core focus is the GLM family of…

Read full profile →

11 models on Venice

10 text · 1 image

Since Apr 1, 2024

Wikipedia ↗Official site ↗

See 10 other models from Z.ai →

About this model

GLM 4.6 is a large reasoning-focused language model from Z.ai, the Chinese lab formerly known as Zhipu AI, released under the MIT license with open weights. It supports a context window of roughly 200K tokens, native function calling, and search-based agentic use, with reasoning ("thinking") that can be toggled on or off at request time. Within the GLM family it succeeds GLM-4.5 and precedes GLM 4.7 and the later GLM 5 and GLM 5.1 releases.

Compared with its predecessor GLM-4.5, Z.ai reports comprehensive gains in real-world coding, long-context processing, reasoning, search, writing, and agentic applications, plus a notable efficiency improvement. The provider states GLM 4.6 consumes over 30% fewer tokens on average than GLM-4.5 while using the same inference method, and that it shows stronger tool-use and search-agent performance. Z.ai evaluated it across eight public benchmarks spanning agents, reasoning, and coding.

The model targets agentic coding workflows and integrates into frameworks and coding agents through its function-calling support. Z.ai has published its test questions and agent trajectories publicly to support reproduction of its reported results.

For context on the family's later direction, Z.ai's own GLM 4.7 model card reports further coding and reasoning gains over GLM 4.6, including 73.8% on SWE-bench (a stated +5.8%) and 42.8% on Humanity's Last Exam (a stated +12.4%), positioning GLM 4.6 as the bridge between GLM-4.5 and that newer generation.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

GLM-4.6 - Overview - Z.AI DEVELOPER DOCUMENTdocs.z.ai ↗

zai-org/GLM-4.6 · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2508.06471Aug 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(2025)

GLM-4. 5 Team, :, Aohan Zeng et al.

We present GLM-4.5, an open-source Mixture-of-Experts (MoE) large language model with 355B total parameters and 32B activated parameters, featuring a hybrid reasoning method that supports both thinking and direct response modes. Through multi-stage training on 23T tokens and…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 1d ago