Skip to content
X · @teortaxesTex · X / Twitter

GLM 5.2 is the best Chinese model on ARC-AGI-2, at 22.8% (is that high or max?), on par with Opus 4.5 (16K). …Whereas Grok 4.20 is in the range of Op…

GLM 5.2 is the best Chinese model on ARC-AGI-2, at 22.8% (is that high or max?), on par with Opus 4.5 (16K). …Whereas Grok 4.20 is in the range of Opus 4.7, at 65%.Maybe the first time I seriously doubted ARC. Even mediocre Western labs are far ahead on hill-climbing it.ARC Prize: GLM-5.2 from @Zai_org on ARC-AGI (Veri