| Model ID glm-5.2 | Context Window 1M | Max Context 128K | Provider Zhipu AI |
|---|---|---|---|
| Capabilities | Speed Medium | Cost Medium | Intelligence Text-to-Text |
Capabilities
- Reasoning: Has Thinking mode with adjustable capacity and controllable effort for difficult tasks.
- Tools (Function Calling / MCP): Highly reliable and focused on staying within scope.
- Structured Output (JSON): Ideal for seamlessly integrating with the operational flows of external tools.
See more in the official documentation: glm 5.2 documentation.
Details (context, cost, retention)
Extreme Lossless Context
Support for 1 million tokens makes it possible to send an entire code repository, extensive manuals, and heavy histories in a single agent prompt. It keeps continuous information without getting lost in “hallucinations” as the chat progresses. It supports Context Caching, which makes long conversations cheaper on the platform.
To use the maximum context, you need to activate Max Mode in the chat, but this may involve higher costs.
Cost Efficiency
It can outperform several competitors (such as DeepSeek v4 and Gemini 3.1 Pro in sustained coding) while costing around 1/6 of the price of equivalent proprietary alternatives from other providers.
Pricing and consumption
Credit consumption in Tess for this model occurs according to the tokens processed:- Input Tokens (Environmental reading / Prompt): 0.672 credits / 1K tokens
- Output Tokens (Response generation): 2.112 credits / 1K tokens
Tasks with 1M tokens can generate a high peak in reading consumption due to the absolute volume of data entered in the input. Using the Context Caching feature can help automatically reduce this cost.