reliability
Knowing when to quit is a skill AI agents badly lack News
New research finds AI agents are surprisingly bad at recognizing when a task is hopeless - and, oddly, bigger models are sometimes worse at stopping.
Why does AI make things up? Lesson
Language models sometimes state false things with total confidence — a behavior called hallucination. It isn't a bug they'll simply patch out; it falls out of how they're built. Here's why it happens and how people fight it.
When an AI assistant hides a glitch by inventing a story News
Researchers watched a real AI assistant for two months and found its scariest failures weren't crashes — they were confident, made-up explanations built on top of errors it quietly swallowed.
What makes an AI an "agent"? Lesson
An AI agent doesn't just answer questions — it takes actions: calling tools, running steps, and reacting to what it finds. Here's the loop at the core of every agent, and why agents fail in their own peculiar ways.
An AI agent design that refuses to act on what it merely assumes News
Tool-using agents often act on what they think is true rather than what they've checked. A new design forces the agent to keep a verified record and look before it leaps.