Zhipu GLM 5.2 closes gap on Claude Opus 4.8; open-weight coding enters frontier tier
Zhipu AI's GLM 5.2, released June 13 and ranked June 16, is the first open-weight model to genuinely compete with frontier proprietary coding agents. On Terminal-Bench 2.1, GLM 5.2 scores 81.0, trailing Claude Opus 4.8 by only a few points (85.0); on SWE-Bench Pro it hits 62.1, ahead of GPT-5.5 (58.6) and within striking distance of Opus 4.8. The 753-billion-parameter Mixture-of-Experts model includes MIT-licensed weights distributed via HuggingFace, a 1-million-token context window, and 131,072-token max output—all runnable locally on consumer hardware with quantization.
The model improves dramatically over GLM 5.1 (62.0 → 81.0 on Terminal-Bench) through architecture refinements including IndexShare (reducing per-token FLOPs by 2.9× at 1M context) and MTP layer improvements. Pricing is aggressive: $1.40 input / $4.40 output per million tokens via Fireworks API (roughly one-sixth the blended cost of GPT-5.5 at $35 combined), or flat-rate subscription plans for power users. Developers report GLM 5.2 outperforms Opus 4.8 on some agentic benchmarks (Design Arena, MCP-Atlas) while matching it on long-horizon coding tasks.
For engineers shipping autonomous agents and code generation at scale, GLM 5.2 removes the tradeoff between cost and capability. The open-weight licensing eliminates deployment restrictions; multi-cloud hosting and quantization enable on-premise runs for regulated workloads. This shift signals that open models are no longer a distant second—they now force pricing conversations and architectural decisions around data residency, IP, and inference margin for teams building production AI systems.
Sources
- Primary source
- avenchat.com
“GLM 5.2 scores 81.0 on Terminal-Bench 2.1, landing within a few points of Claude Opus 4.8 (85.0) and ahead of the rest of the open-weight field. On SWE-bench Pro (62.1), it edges ahead of GPT-5.5 (~58.6) and is within striking range of Opus 4.8.”
- kie.ai
“GLM-5.2 was released on June 13, 2026 by Z.ai (formerly Zhipu AI). Fireworks lists GLM-5.2 as 743B-parameter Mixture-of-Experts model with a 1,040K context window and 131,072-token max output. The model is MIT-licensed open weights at $1.40/$4.40 per 1M tokens.”
- codersera.com
“GLM 5.2 (Z.ai, June 13 2026) ships a 1M-token context window plus MIT-licensed open weights. Pick GLM 5.2 for cost-controlled, self-hostable, repo-scale agents. Claude Opus 4.8 remains the agentic-coding benchmark at premium per-token pricing.”