Ground Truth.
AI, checked against the source.

News · 2026-07-02

China's GLM-5.2 Ships as the Top Open-Weight Model, Under MIT License

Z.ai has released GLM-5.2, a 753-billion-parameter mixture-of-experts model, as open weights under an MIT license, and independent benchmarking now ranks it the strongest open-weight model available. It lands just behind the leading closed models from Anthropic and OpenAI on a third-party intelligence index while costing roughly a fifth as much to run through a provider, tightening the gap between what anyone can download and what only the top labs can sell.

Key facts

For most of the modern AI era there has been a comfortable gap: the best models were closed, sold by a handful of American labs, and the open models you could download yourself trailed a generation or more behind. GLM-5.2, from the Chinese lab Z.ai (the company behind the GLM series), is the clearest sign yet that the gap is closing. This is not a research demo or a weights-coming-soon promise; the full model sits on Hugging Face right now under an MIT license, one of the most permissive terms in software, which lets anyone use it commercially with almost no strings.

Under the hood it is a mixture-of-experts model, meaning that although it has 753 billion parameters in total, only a fraction activate for any given token, which keeps it far cheaper to run than a dense model of the same size. It also handles a 1-million-token context, enough to hold an entire codebase or a stack of long documents in view at once.

The number that matters is the ranking. Artificial Analysis, a third-party benchmarking outfit with no stake in Z.ai, places GLM-5.2 at 51 on its combined intelligence index, the highest of any open-weight model it tracks, against 56 for Anthropic's Opus 4.8, 53 for GPT-5.5, and 50 for Gemini 3.5. In other words, the best model you can download is now within striking distance of the best models you can only rent, and it undercuts them dramatically on price: roughly $1.40 per million input tokens versus $5 for Opus 4.8. That is the story practitioners care about, and it echoes the recent milestone that you can now run a Claude-class model on your own hardware.

There are two honest caveats, and they matter. First, Z.ai's own marketing includes head-to-head coding-benchmark numbers that supposedly beat GPT-5.5; those specific figures trace to the company's own model card, not an independent reproduction, so treat them as vendor claims until an outside party confirms them. "Top open-weight model on an independent index" is verified; "beats GPT-5.5 at coding" is not. Second, open weights do not mean easy to run: at 753B parameters, actually self-hosting GLM-5.2 takes somewhere in the range of 800 to 1,600 gigabytes of GPU memory, well beyond a home rig, as practitioners on r/LocalLLaMA were quick to note. Most people will still access it through a hosting provider.

Why it matters goes beyond one model. On community forums the reaction has fixated less on raw capability than on control: self-hosting an open model means not handing your data to a foreign company, and, pointedly, insurance against a government or vendor cutting off access, a fresh worry after the recent export-control episode around Anthropic's top model. One widely-upvoted comment captured the mood: "the ability to self-host will be the determining factor for which models succeed in the long run." GLM-5.2 turns that argument from aspiration into a concrete, downloadable option.


Primary source, verified: read the paper →

Key questions

Is GLM-5.2 really open weights?

Yes. The weights are published on Hugging Face under an MIT license, one of the most permissive licenses available, meaning anyone can download, run, modify, and build on the model.

How does GLM-5.2 compare to closed frontier models?

On the independent Artificial Analysis Intelligence Index it scores 51, the highest of any open-weight model tracked, versus 56 for Anthropic's Opus 4.8 and 53 for GPT-5.5, so it is close behind rather than ahead.

How much does it cost to use versus Claude?

Provider pricing lands around $1.40 per million input tokens and $4.40 per million output, roughly a fifth of Claude Opus 4.8's $5 and $25, though running the full model yourself requires hundreds of gigabytes of GPU memory.
Cite this

APA

Ground Truth. (2026, July 2). China's GLM-5.2 Ships as the Top Open-Weight Model, Under MIT License. Ground Truth. https://groundtruth.day/news/glm-5-2-open-weights-top-the-class.html

BibTeX

@misc{groundtruth:glm-5-2-open-weights-top-the-class,
  title  = {China's GLM-5.2 Ships as the Top Open-Weight Model, Under MIT License},
  author = {{Ground Truth}},
  year   = {2026},
  month  = {jul},
  url    = {https://groundtruth.day/news/glm-5-2-open-weights-top-the-class.html}
}

Topics: open-weight-models · china · mixture-of-experts · model-release · inference-cost

Comments are replies to this story on Bluesky — reply with any Bluesky account to join in.