Z.ai's GLM-5.2 claims the open-weight coding crown with a usable 1M-token context
Z.ai released GLM-5.2, an agentic coding model with a reliable one-million-token context and top open-source scores on long-horizon software benchmarks, with an MIT-licensed weight release promised within weeks.
Orca proposes a single 'world latent space' to replace next-token, next-frame, and next-action prediction
Researchers introduced Orca, a world foundation model that learns one unified latent space from multimodal signals and predicts the next world state rather than the next token or frame, outperforming similar-sized specialists on text, image, and action tasks.
Meta caps employee AI token use after a 'Claudeonomics' leaderboard drove costs toward billions
Meta imposed centralized quotas on employee AI usage after staff burned an estimated 73.7 trillion tokens in about a month, gamifying consumption on an internal leaderboard, with costs projected to reach billions in 2026.
Oracle's own filing lays out how its hundreds-of-billions AI datacenter bet could go wrong
Oracle's regulatory filing candidly enumerates the risks of its massive AI datacenter buildout for clients like OpenAI, including customer non-payment, contract non-renewal, demand misjudgment, and constrained, volatile power supply.
'Dockerless' verifies AI code patches by reading the repo instead of running it
A new method called Dockerless judges whether an AI's code patch is correct by having an agent explore the repository for evidence rather than executing tests in a Docker container, enabling a fully environment-free training pipeline for coding agents.
Two new papers push 'on-policy distillation' to fix privileged teachers and merge specialist skills
DOPD and MOPD advance on-policy distillation -- training a student on its own outputs -- with DOPD routing supervision to avoid a 'privilege illusion' and MOPD merging multiple specialist RL teachers into one model without cross-domain interference.
Strix ships an open-source AI agent that hacks your app to find real vulnerabilities
Strix is an open-source security tool whose autonomous AI agents dynamically find and exploit vulnerabilities in applications, generating working proof-of-concepts and plugging into CI/CD to block insecure code before it ships.
'agency-agents' packages 150+ role-playing AI agents into one open-source 'AI agency'
The open-source agency-agents project defines more than 150 specialized AI agent personas across 13-plus professional divisions, from engineering to marketing to finance, designed to run full multi-agent workflows natively in Claude Code and other agentic coding tools.