2026-07-01 — Ground Truth

← 2026-06-30 2026-07-01later →

Z.ai's GLM-5.2 claims the open-weight coding crown with a usable 1M-token context

2026-07-01

Z.ai released GLM-5.2, an agentic coding model with a reliable one-million-token context and top open-source scores on long-horizon software benchmarks, with an MIT-licensed weight release promised within weeks.

glm · open-weight-models · coding-agents · long-context · z.ai

Orca proposes a single 'world latent space' to replace next-token, next-frame, and next-action prediction

2026-07-01

Researchers introduced Orca, a world foundation model that learns one unified latent space from multimodal signals and predicts the next world state rather than the next token or frame, outperforming similar-sized specialists on text, image, and action tasks.

world-models · multimodal · foundation-models · embodied-ai · representation-learning

Meta caps employee AI token use after a 'Claudeonomics' leaderboard drove costs toward billions

2026-07-01

Meta imposed centralized quotas on employee AI usage after staff burned an estimated 73.7 trillion tokens in about a month, gamifying consumption on an internal leaderboard, with costs projected to reach billions in 2026.

meta · ai-economics · enterprise-ai · coding-agents · cost

Oracle's own filing lays out how its hundreds-of-billions AI datacenter bet could go wrong

2026-07-01

Oracle's regulatory filing candidly enumerates the risks of its massive AI datacenter buildout for clients like OpenAI, including customer non-payment, contract non-renewal, demand misjudgment, and constrained, volatile power supply.

oracle · openai · ai-infrastructure · datacenters · ai-economics

'Dockerless' verifies AI code patches by reading the repo instead of running it

2026-07-01

A new method called Dockerless judges whether an AI's code patch is correct by having an agent explore the repository for evidence rather than executing tests in a Docker container, enabling a fully environment-free training pipeline for coding agents.

coding-agents · swe-bench · verification · rl-post-training · software-engineering

Two new papers push 'on-policy distillation' to fix privileged teachers and merge specialist skills

2026-07-01

DOPD and MOPD advance on-policy distillation -- training a student on its own outputs -- with DOPD routing supervision to avoid a 'privilege illusion' and MOPD merging multiple specialist RL teachers into one model without cross-domain interference.

distillation · rl-post-training · llm-training · on-policy · capability-integration

Strix ships an open-source AI agent that hacks your app to find real vulnerabilities

2026-07-01

Strix is an open-source security tool whose autonomous AI agents dynamically find and exploit vulnerabilities in applications, generating working proof-of-concepts and plugging into CI/CD to block insecure code before it ships.

ai-agents · security · pentesting · open-source · devsecops

'agency-agents' packages 150+ role-playing AI agents into one open-source 'AI agency'

2026-07-01

The open-source agency-agents project defines more than 150 specialized AI agent personas across 13-plus professional divisions, from engineering to marketing to finance, designed to run full multi-agent workflows natively in Claude Code and other agentic coding tools.

ai-agents · open-source · multi-agent · claude-code · workflows

← 2026-06-30 2026-07-01later →