GLM 5.2

Better Stack · 35 Claims

Agree

The best open model in the world right now is GLM 5.2 from ZAI, a Chinese lab, not from OpenAI.

Author states 'The best open model in the world right now isn't from a company called OpenAI. It's of course from a Chinese lab. And this one is GLM 5.2 from ZAI.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 matches GPT 5.5 on certain benchmarks.

Author says 'matching GPT 5.5 on certain benchmarks.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 appears to beat Fable in at least one benchmark category.

Author says 'there's even a category where it appears to be beating Fable all while being MIT licensed open weight.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 made a leap on Artificial Analysis intelligence index, scoring 51, which is 11 points ahead of GLM 5.1.

Author says 'it's very impressive that they made such a leap... GLM 5.2 here got a score of 51, which is 11 ahead of its previous iteration.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 is the top open model by a healthy margin on Artificial Analysis combined score, ahead of Quen 3.7, Miniax M3, and Kimmy K2.6.

Author lists next models and says 'top open model by a pretty healthy margin.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2's intelligence score places it in the same realm as Gemini 3.5 Flash and GPT 5.4 on a max effort.

Author says 'places it in the same realm as Gemini 3.5 Flash and GPT 5.4 on a max effort, which is pretty insane.'

Source: GLM 5.2 is my new favorite model...

Agree

On the GPD Valve benchmark included in Artificial Analysis, GLM 5.2 outscores GPT 5.5.

Author says 'on a few of the benchmarks... like GPD Valve, actually outscores GPT 5.5.'

Source: GLM 5.2 is my new favorite model...

Agree

On the Deep S benchmark, GLM 5.2 outscores Opus 4.7 on a medium effort, though not all models have been tested and the harness used was clawed code.

Author says 'it actually outscores Opus 4.7 on a medium effort... It is worth noting... not every single model has been tested... harness used was actually clawed code.'

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 is MIT licensed with open weights.

Factual statement about license.

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 is a 744 billion total parameter model with 40 billion active parameters.

Presented as a specification.

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 is the same size as its predecessor GLM 5.1.

Specification detail.

Source: GLM 5.2 is my new favorite model...

Agree

On the coding index, GLM 5.2 scores the same as Gemini 3.1 Pro and beats Sonic 4.6, and is not far off top frontier models.

Author states 'on the coding index, it scores the same as Gemini 3.1 Pro and actually beats Sonic 4.6, and it isn't even that far off the top frontier models.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 is a fair bit ahead of Kimmy K 2.7 Code on the coding index.

Author says 'It's also a fair bit ahead of Kimmy K 2.7 Code, which is their newest model.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 took first place overall on Design Arena's single turn HTML web design leaderboard, becoming the first model ever to beat the Claude line including Fable 5.

Author says 'GLM 5.2 just took first place overall... becoming the first model ever to beat the Claude line, including Fable 5.'

Source: GLM 5.2 is my new favorite model...

Agree

Design Arena investigation shows GLM 5.2 has strong expert templates that avoid common AI antiatterns (less purple gradients) and works well with ChartJS, 3JS, and Tailwind.

Author cites Design Arena findings: 'a strong set of expert templates that avoid common AI antiatterns, so you should get less purple gradients... works really well with common libraries like ChartJS, 3JS, and Tailwind.'

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 ranks second on GameDev, data viz, and 3D, and fourth on UI components in Design Arena.

Author states 'It sits second on GameDev, data viz, and 3D and fourth when it comes to UI components.'

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 is a bit slower compared to other top models, presenting a speed trade-off.

Author says 'It does come with a little trade-off that it's a bit slower, but I'll come back to that later.'

Source: GLM 5.2 is my new favorite model...

Agree

In speed, GLM 5.2 outperformed most open models near its intelligence level (Deep Seek V4, Kimmy 2.7 Code, Miniax) but was slower than Gemini 3.1 Pro.

Author says 'When it comes to speed, GLM 5.2 is actually not bad at all. It outperformed most of the open models... and it's a bit behind a Frontier model like Gemini 3.1 Pro.'

Source: GLM 5.2 is my new favorite model...

Neutral

Design Arena reported that GLM 5.2 scored highest on user preference but was the slowest among top models, which were all frontier models.

Author says 'Design Arena... say that GLM 5.2 scores the highest on user preference... but it was also the slowest out of the top models... all of those top models are Frontier ones and not open ones.'

Source: GLM 5.2 is my new favorite model...

Disagree

GLM 5.2 only accepts text modalities and cannot process image inputs like screenshots.

Author notes as an annoyance: 'it only accepts text modalities. So you can't upload a screenshot and say recreate this.'

Source: GLM 5.2 is my new favorite model...

Agree

When given a text prompt to recreate Linear's page, GLM 5.2 correctly captured the overall elements and UI, producing an impressive recreation.

Author says 'the results I got back were super impressive... got the overall elements right... recreated the UI, which I think was very cool.'

Source: GLM 5.2 is my new favorite model...

Disagree

Without design direction, GLM 5.2 produced a website with heavy purple gradients, contradicting Design Arena's claim that it avoids AI antiatterns.

Author says 'I'm not sure that I can agree with Design Arena that this doesn't have the usual AI look. This is really using those purple gradients to the max.'

Source: GLM 5.2 is my new favorite model...

Neutral

In a 3JS F1 racing game test, GLM 5.2 took about 10 minutes, used 40,000 tokens, cost 32 cents, and produced a playable but imperfect game with inverted controls.

Author says 'this one got to work... took overall about 10 minutes. ...used 40,000 tokens and cost 32.' Then describes controls issues.

Source: GLM 5.2 is my new favorite model...

Neutral

In the same F1 game test, Claude Opus 4.8 produced the most playable demo in a single prompt, while Kimmy K2.7 Code required a second prompt and gave a less playable result.

Author compares models: 'Claude Opus 4.8... gave us the most playable demo in a single prompt'. Kimmy needed error fixing.

Source: GLM 5.2 is my new favorite model...

Agree

For a personal finance dashboard, GLM 5.2 built a functional Next.js application with Prisma database and working navigation in a single prompt.

Author: 'Here's GLM 5.2's attempt... everything appears to be working... did a very good job from that single prompt... went with a Nex.js application and it used Prisma for the database.'

Source: GLM 5.2 is my new favorite model...

Agree

Kimmy K2.7 Code's finance dashboard was less polished and used React/Express/node SQL, while Claude Opus 4.8 used an in-memory store with no database, making GLM 5.2's backend more robust.

Author compares: Kimmy gave almost same app but missing extras; Opus used in-memory store. He says 'I actually think GLM 5.2 may have won this one.'

Source: GLM 5.2 is my new favorite model...

Agree

The author believes that for many tasks, GLM 5.2 could secretly replace Sonet or Opus without noticeable difference.

Author states 'I think for a lot of tasks, you could secretly swap GLM 5.2 in the place of Sonet or even Opus for simpler tasks and I probably wouldn't notice.'

Source: GLM 5.2 is my new favorite model...

Agree

GLM 5.2 is one of the first open models that the author has not had to fight to use and did not feel Claude could do better or faster.

Author says 'It's one of the first open models that I haven't felt like I'm fighting to use and also one of the first open models where using it I haven't had that feeling of I know Claude could do this better or faster.'

Source: GLM 5.2 is my new favorite model...

Agree

A person (implied to be an Anthropic critic) notes that open models may catch Fable on benchmarks but actual usefulness feels different, and the author agrees that GLM 5.2 is one of the first to bridge that gap.

Author says 'I do have to agree with that sentiment where actually using these models feels a little bit different. But I think GLM 5.2 is one of the first ones that's broken that cycle for me.'

Source: GLM 5.2 is my new favorite model...

Agree

A year ago, the author would not have believed that open models would be anywhere near as capable as GLM 5.2.

Author says 'If you told me a year ago that these open models would be anywhere near this good, I would have been absolutely shocked and probably not believed you.'

Source: GLM 5.2 is my new favorite model...

Disagree

GLM 5.2 is more token hungry than Kimmy K2.6, Miniax, and Deepseek, averaging 43,000 tokens per task.

Author says 'One of the downsides... it's a little more token hungry when compared to other models... used an average of 43,000 tokens a task, which is more than...'

Source: GLM 5.2 is my new favorite model...

Neutral

GLM 5.2 API pricing is approximately $140 per million input tokens and $440 per million output tokens, resulting in about 50 cents per task on Artificial Analysis benchmarks.

Presented as factual price information: 'it's around $140 for a million input tokens and $440 for a million output tokens. ...it actually cost around 50 cents a task.'

Source: GLM 5.2 is my new favorite model...

Agree

At its intelligence level, GLM 5.2 is the cheapest model according to Artificial Analysis cost vs intelligence chart.

Author says 'at its intelligence level, GLM 5.2 is the cheapest model.'

Source: GLM 5.2 is my new favorite model...

Neutral

If you can accept lower intelligence, Miniax and Deep Seek V4 offer very good value.

Author states 'if you can take a hit to the intelligence, I do think Miniax and especially Deep Seek V4 are very good for that price.'

Source: GLM 5.2 is my new favorite model...

Agree

Open models are currently around 4 to 6 months behind closed frontier models, and ZAI is promising a fable-level model by Q1 next year.

Author says 'it really feels like we're at a point where these open models are, let's say, 4 to 6 months behind... we could be looking at a fable model by next year. And I mean, they themselves are actually promising by Q1.'

Source: GLM 5.2 is my new favorite model...