Z.ai·💬 Text Generation·↑ Newer: GLM 5.2

GLM 4.7🔒Private

ReasoningCodeWeb SearchE2EEprivate

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

GLM 4.7 — TLDR

🔒 Runs in a Trusted Execution Environment with hardware attestation.
🔧 Z.ai's flagship tuned for agentic coding and terminal workflows.
🧠 Adds Preserved Thinking and Turn-level Thinking atop interleaved reasoning.
👁️ Improved frontend/UI generation with cleaner layouts and styling.
📏 Catalog lists a 128K-token context window.
🏢 Mixture-of-Experts design, released under the permissive MIT license.
🌐 Supports web search and multi-step agent execution.
🆕 Works natively with Claude Code, Cline, Roo Code, Kilo Code.

💰 Pricing

$1.10 / $4.15

per 1M · input / output

📏 Context

128K tokens

📅 On Venice since

Mar 18, 2026

123 days ago

Provider

Z.ai

Z.ai, formally Knowledge Atlas Technology Joint Stock Co., Ltd., is a Chinese technology company specializing in artificial intelligence. Previously known internationally as Zhipu AI, the company rebranded to Z.ai in 2025. Its core focus is the GLM family of…

Read full profile →

12 models on Venice

11 text · 1 image

Since Apr 1, 2024

Wikipedia ↗Official site ↗

See 11 other models from Z.ai →

About this model

GLM 4.7 is Z.ai's flagship text model, offered here in a confidential-computing configuration: per the catalog and provider description, it runs inside a Trusted Execution Environment (TEE) so that hardware attestation evidence can be independently verified. Z.ai (formerly Zhipu AI) ships its GLM family under the MIT license, and GLM 4.7 is positioned for "task delivery" — an end-to-end loop from requirement understanding to executable code across multiple programming languages.

Relative to its same-family predecessors such as GLM 4.6, GLM 4.7 concentrates on two upgrades: stronger programming capability and more stable multi-step reasoning. Its headline change is a three-tier thinking design that extends the Interleaved Thinking introduced in GLM-4.5 by adding Preserved Thinking and Turn-level Thinking, so reasoning persists across actions and conversational turns instead of restarting each round. This is aimed at the "context collapse" problem in long coding sessions. Z.ai's documentation also highlights improved visual-code and UI understanding for layout, color harmony, and component styling.

On specifications, the catalog lists a 128K-token context window, and the model uses a Mixture-of-Experts architecture.

Within this confidential-computing line, GLM 4.7 precedes the newer GLM 5.2. For users who need verifiable execution privacy alongside agentic coding, the TEE build pairs GLM 4.7's capabilities with attestable hardware isolation.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

GLM-4.7 - Overview - Z.AI DEVELOPER DOCUMENTdocs.z.ai ↗

zai-org/GLM-4.7 · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2508.06471Aug 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(2025)

GLM-4. 5 Team, :, Aohan Zeng et al.

We present GLM-4.5, an open-source Mixture-of-Experts (MoE) large language model with 355B total parameters and 32B activated parameters, featuring a hybrid reasoning method that supports both thinking and direct response modes. Through multi-stage training on 23T tokens and…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 4d ago