coding

Everything on Ground Truth tagged “coding” — 15 items.

Anthropic's own data says the best coders gain the most from AI News

By studying hundreds of thousands of real coding sessions, Anthropic found that experienced engineers get more out of AI assistants, not less, a direct challenge to the idea that AI levels the playing field.

Can an AI Agent Reproduce Real Science? A New Test Says: Rarely News

A new benchmark points coding agents at the actual computational results behind ninety papers in top journals. The strongest models matched the published science on fewer than one in five.

A Coding AI Ran Through Uber's Yearly Budget in Four Months News

Uber gave Claude Code to about 5,000 engineers, who loved it. By April the company had burned through its entire 2026 AI budget, exposing how badly old software pricing fits new agent tools.

The AI That Now Writes Most of Its Maker's Code News

Anthropic says more than 80 percent of the code it ships is now written by its own model, Claude, and the more interesting numbers are about judgment.

A Free Model That Splits Your Work Across 300 Helpers News

Moonshot AI's Kimi K2.6 is a frontier-grade model anyone can download, and its headline trick is fanning a single job out to hundreds of helpers working in parallel.

AI coding skill in Python doesn't carry over to other languages News

A widely-trusted coding benchmark was Python-only. Expanding it to a dozen languages revealed that models acing Python often stumble badly elsewhere — Python skill isn't general coding skill.

Qwen3.6 (open weights) Tool

Alibaba's stable Qwen3.6 release: open-weight general chat and coding models you can self-host, the same family at the center of this week's open-vs-closed pricing debate.

Kimi K2.6 weights (Hugging Face) Tool

The actual Kimi K2.6 model weights, published under a modified-MIT license for anyone to download, run, and build on; large enough that full-strength use needs a multi-GPU node.

Kimi (Kimi K2.6) Tool

Moonshot AI's web assistant and agent, running the open-weight Kimi K2.6 model; free to use in the browser for chat and long-horizon agent tasks, with the weights also downloadable for self-hosting.

Gemma-4 12B Coder (GGUF) Tool

A fine-tuned, locally-runnable version of Google's Gemma-4 model specialized for programming tasks, packaged in a format that runs efficiently on everyday consumer hardware.

GLM-5.2 on Baseten Tool

The top trending open-weight model served as a fast hosted endpoint, reported at 280+ tokens/sec on Blackwell-class hardware -- an open model you can call like a closed one.

GLM 5.2 (GGUF, runnable locally) Tool

Zhipu AI's open, MIT-licensed mixture-of-experts model with a roughly million-token context, now packaged as ready-to-run quantized files you can host on your own machine. Strong on agent and coding workflows; this week it beat Claude on a narrow security benchmark at a fraction of the cost.

DeepSeek V4 Pro (API) Tool

A strong open-weight reasoning and coding model now offered through DeepSeek's own API at a permanently cut, low per-token price, undercutting frontier closed models for high-volume work.

Claude Sonnet 5 Tool

Anthropic's new most-agentic mid-tier model, close to its flagship on hands-on tool and coding work; now the default on Free and Pro plans.

Claude Code Tool

Anthropic's command-line coding agent that reads a whole codebase, edits files, runs tests and fixes failures on its own; it is the tool behind Anthropic's disclosure that Claude now authors most of its production code.