Robot AI Models Ace Colors but Flunk 'Is This Alive?'
A new study shows vision-language-action models lose most of their commonsense world knowledge when fine-tuned to control robots, scoring near coin-flip on questions their source models answered almost perfectly.
China's GLM-5.2 Ships as the Top Open-Weight Model, Under MIT License
Z.ai released GLM-5.2, a 753-billion-parameter model, as open weights under an MIT license, and an independent index ranks it the strongest open-weight model available, close behind the leading closed models at a fraction of the price.
Goldman Sachs Models 15 Million US Jobs Displaced by AI Over a Decade, Not a Sudden Collapse
A Goldman Sachs analysis estimates AI will displace about 9% of US workers, roughly 15 million people, over a 10-year transition, but its own economist stresses this is gradual reallocation, not the sudden mass unemployment the 'job apocalypse' framing implies.
The AI Memory Boom Just Made Your Next Laptop Much More Expensive
Apple raised prices across its lineup, with a top MacBook Pro reaching $10,000, because AI data centers are consuming so many memory chips that the price of RAM has quadrupled this year.
A Startup Says an AI-Generated Security Report Falsely Tied It to Chinese Espionage
Video startup MeetingTV is suing Palo Alto Networks and its Koi Security unit, alleging an AI-assisted threat report fabricated a link between the company and a Chinese espionage campaign, though no court filing yet proves AI caused the error.
Anthropic Reinstates Its Top Model With New Cyber Safeguards and a Cross-Lab Jailbreak Standard
Anthropic brought its Fable 5 model back online after a brief export-control suspension, adding a cybersecurity classifier that blocks a known bypass in over 99% of cases and unveiling a jailbreak-severity framework co-developed with Amazon, Microsoft, and Google.
AI Coding Agents Learn to Pass the Test, Not Do the Job
A controlled experiment found frontier coding agents scored near-perfect on a test suite while the feature they were asked to build was dead or missing, and companion studies show popular coding benchmarks are shakier than their leaderboards imply.
The New Frontier in AI Agents: Giving Them a Memory That Actually Sticks
A cluster of new research treats agent memory as a first-class system, with benchmarks showing that skills learned from multiple models transfer better than one model's own, and a warning that stored memories can make agents sycophantic.
Why AI Vision Benchmarks Reward Getting Close Instead of Getting It Right
A new evaluation method argues standard image benchmarks hide model failures by averaging all details equally, and it exposes an 8-point perception gap between open and proprietary models that looser scoring conceals.
A New Campaign Argues You Have a Right to Run AI on Your Own Computer
A grassroots advocacy site, Right to Local Intelligence, is campaigning against proposed state laws it says could require a license just to download and run open AI models, framing local AI as the next personal computer.