Research Saturday, June 27, 2026 at 04:04 AM

Zhipu GLM 5.2 closes gap on Claude Opus 4.8; open-weight coding enters frontier tier

Zhipu AI's GLM 5.2, released June 13 and ranked June 16, is the first open-weight model to genuinely compete with frontier proprietary coding agents. On Terminal-Bench 2.1, GLM 5.2 scores 81.0, trailing Claude Opus 4.8 by only a few points (85.0); on SWE-Bench Pro it hits 62.1, ahead of GPT-5.5 (58.6) and within striking distance of Opus 4.8. The 753-billion-parameter Mixture-of-Experts model includes MIT-licensed weights distributed via HuggingFace, a 1-million-token context window, and 131,072-token max output—all runnable locally on consumer hardware with quantization.

The model improves dramatically over GLM 5.1 (62.0 → 81.0 on Terminal-Bench) through architecture refinements including IndexShare (reducing per-token FLOPs by 2.9× at 1M context) and MTP layer improvements. Pricing is aggressive: $1.40 input / $4.40 output per million tokens via Fireworks API (roughly one-sixth the blended cost of GPT-5.5 at $35 combined), or flat-rate subscription plans for power users. Developers report GLM 5.2 outperforms Opus 4.8 on some agentic benchmarks (Design Arena, MCP-Atlas) while matching it on long-horizon coding tasks.

For engineers shipping autonomous agents and code generation at scale, GLM 5.2 removes the tradeoff between cost and capability. The open-weight licensing eliminates deployment restrictions; multi-cloud hosting and quantization enable on-premise runs for regulated workloads. This shift signals that open models are no longer a distant second—they now force pricing conversations and architectural decisions around data residency, IP, and inference margin for teams building production AI systems.

Sources

Primary source
avenchat.com
“GLM 5.2 scores 81.0 on Terminal-Bench 2.1, landing within a few points of Claude Opus 4.8 (85.0) and ahead of the rest of the open-weight field. On SWE-bench Pro (62.1), it edges ahead of GPT-5.5 (~58.6) and is within striking range of Opus 4.8.”
kie.ai
“GLM-5.2 was released on June 13, 2026 by Z.ai (formerly Zhipu AI). Fireworks lists GLM-5.2 as 743B-parameter Mixture-of-Experts model with a 1,040K context window and 131,072-token max output. The model is MIT-licensed open weights at $1.40/$4.40 per 1M tokens.”
codersera.com
“GLM 5.2 (Z.ai, June 13 2026) ships a 1M-token context window plus MIT-licensed open weights. Pick GLM 5.2 for cost-controlled, self-hostable, repo-scale agents. Claude Opus 4.8 remains the agentic-coding benchmark at premium per-token pricing.”

Zhipu GLM 5.2 closes gap on Claude Opus 4.8; open-weight coding enters frontier tier

Sources

Get the signal before the noise.