Industry

GLM 5.2 Explained: What the New Open Weight Leader Can and Cannot Do

A Chinese lab just shipped the highest scoring open weight AI model yet, under a license that lets anyone download and use it. Here is what is real, what the benchmarks actually say, and the catches the hype skips.

June 27, 2026 9 min read
Key takeaways
  • Every few weeks a new model claims the title of best open AI you can download for free. Most of the time the claim is shaky. This time it has teeth. In mid June 2026, a Chinese lab called Z.ai released GLM 5.2, and independent testing put it at the top of the open model rankings, close enough to the best paid models from American companies to make people stop and look.
  • The internet reaction has been a mix of genuine excitement and wild exaggeration. So here is a clear, honest guide: what GLM 5.2 actually is, what the benchmarks really say, what it costs, and the catches that most of the hype skips over.

What Is GLM 5.2?

GLM 5.2 is a large language model made by Z.ai, a Beijing based company that grew out of Tsinghua University and was formerly known as Zhipu AI. GLM stands for General Language Model.

The specifications are serious. It uses a design called Mixture of Experts, which means that although the model holds about 744 billion parameters in total, only around 40 billion of them switch on for any single request. That keeps it powerful while holding down the running cost. It can handle a context window of one million tokens, which is roughly 750,000 words of text in a single conversation, and it was released on June 13, 2026.

The headline feature, though, is not the size. It is the license.

Why the License Is the Real Story

GLM 5.2 ships under an MIT license. In plain terms, that is about as open and permissive as software licenses get. Anyone, anywhere, can download the model, change it, build a business on top of it, and run it on their own computers, all without paying a fee and without asking permission. Z.ai describes it as having no regional limits and access without borders.

This matters because the powerful models from OpenAI and Anthropic work the opposite way. You rent them through a connection, you pay for every request, and the company can change the rules or cut off access. An MIT licensed model that you fully control removes that dependence. For a business that needs to know its tools cannot be switched off or repriced overnight, that is a powerful selling point, and it is the main reason GLM 5.2 caused such a stir.

Source: Z.ai model documentation and VentureBeat reporting, June 2026.

What the Benchmarks Actually Say

Here is where it pays to be precise, because this is where hype and reality drift apart.

On the Artificial Analysis Intelligence Index, a respected independent scorecard that combines many tests, GLM 5.2 scored 51. That is the highest score any open download model has ever reached, ahead of other strong open models like MiniMax M3 and DeepSeek V4 Pro at 44 and Kimi K2.6 at 43.

But, and this is the part the hype skips, that score places it fourth overall, not first. The top three are still paid models from American labs: Anthropic's Fable 5 and Opus 4.8, and OpenAI's GPT 5.5. On certain coding tests GLM 5.2 does beat GPT 5.5, and on long, planning heavy coding tasks it trades blows with the best. On other tests of raw reasoning it falls clearly behind. The independent reviewer Simon Willison confirmed the headline ranking is real, while some other testers argue parts of it are tuned to look good on benchmarks. The fair summary: it is the best open model yet and genuinely competitive at coding, but it does not beat the top paid models across the board.

Source: Artificial Analysis Intelligence Index v4.1, indexed June 2026.

Artificial Analysis Intelligence Index: Top Open Weight Models
GLM 5.2$51· New open weight leader
MiniMax M3$44· Strong all rounder
DeepSeek V4 Pro$44· Price performance pick
Kimi K2.6$43· Agentic focus

The Price Story, and Its Catch

The reason GLM 5.2 matters commercially is price. Running it costs about 1.40 dollars per million input tokens and 4.40 dollars per million output tokens. Compare that with Claude Opus 4.8 at around 25 dollars per million output tokens, or GPT 5.5 at around 30 dollars. That is roughly five to seven times cheaper for similar quality on many coding tasks.

But there is a catch the price tag hides. GLM 5.2 is what people call token hungry. It thinks out loud a lot, using around 43,000 output tokens to finish a single benchmark task, most of that internal reasoning. Because you pay per output token, a model that produces far more tokens to reach an answer can erase part of its low per token price. So the real cost depends heavily on your task. For long, complex jobs the savings are still large. For short ones the gap narrows.

Source: pricing from Artificial Analysis and OpenRouter, June 2026.

API Output Price per 1 Million Tokens
GLM 5.2$4.4· Open weight
Claude Opus 4.8$25· Premium closed
GPT 5.5$30· Premium closed
Get AI pricing updates biweekly
Verified pricing changes, new model launches, and cost-saving tips.

The Catches You Should Know

Beyond the token appetite, a few more honest caveats.

Running it yourself is hard. The full model is about 1.5 terabytes in size. You cannot simply run it on a normal laptop. People have squeezed shrunken versions onto high end desktop computers, but at reduced quality and slow speed. For most people, using it through a hosting provider is the only practical option, which means you are back to paying per use, just at a lower rate.

It only handles text. Unlike some rivals, GLM 5.2 does not understand images or video.

The benchmark debate is not fully settled. The top ranking is independently confirmed, but some respected voices argue the model is tuned to score well on tests in ways that may not fully carry over to messy real work. Treat single test results as a guide, not gospel.

And if you use Z.ai's own service rather than a Western host, check where your data goes and whether it can be used for training, the same caution that applies to any model.

Why It Landed at the Perfect Moment

Timing made GLM 5.2 a bigger story than its specs alone would suggest. It arrived just days after a United States export control directive forced Anthropic to switch off broad access to its newest top models, Fable 5 and Mythos 5, to prevent access by people outside approved groups.

So at the exact moment a leading closed model became harder to reach, a near top open model that anyone can download for free appeared. For organisations that were suddenly cut off, or that simply want tools nobody can take away, the appeal was obvious. This is the deeper shift GLM 5.2 represents: open models are now close enough to the best closed ones that, for many jobs, the freedom to own and control your tool outweighs the last bit of quality.

Should You Care?

If you build software or run AI heavy workloads, yes. GLM 5.2 is the first open download model that is a realistic first choice for serious coding and planning work, not just a cheap backup. Test it on your own tasks, watch the token usage, and mind your data privacy, and it may cut your costs sharply.

If you are a casual user who just chats with an assistant, it changes less for you directly, but it matters anyway, because this kind of cheap, capable, open competition is what keeps the price of every AI tool in check.

At AI Tools Mentor we track GLM 5.2 and every other major model with prices we re verify every Tuesday and Friday, so you can compare the true cost of open and closed options for whatever you are building.

Sources: Z.ai documentation, Artificial Analysis Intelligence Index v4.1, VentureBeat, and OpenRouter, all June 2026.

glmopen sourcez.aibenchmarkscoding
AT
AI Tools Mentor
We verify pricing for 300+ AI tools against official docs. No estimates — just the actual numbers. Updated weekly.
Share this article
Related Articles

AiToolsMentor.com · Verified AI tool pricing