News · 2026-07-03

ByteDance says AI agents double their learning speed every three months

ByteDance's Seed AI team published evidence this week for what could be a new scaling law: over roughly 38,000 hours of agent activity across 134 demanding tasks, the rate at which AI agents learn from real environments has roughly doubled every three months from late 2025 to mid 2026. The finding, released with a benchmark called EdgeBench, matters because it points to a growth axis that is not raw pretraining compute -- it is learning by doing.

Key facts

The number: Environment-learning performance follows a log-sigmoid curve with R-squared of 0.998, and the learning rate roughly doubled every three months over the measured period.
The benchmark: EdgeBench -- 134 ultra-long-horizon tasks, each run for 12 hours or more, across software engineering, scientific discovery, formal math, and knowledge work.
The scale: about 38,000 hours of agent-environment interaction analyzed; 51 tasks and the full framework released publicly.
Source: edge-bench.org, the paper, and the ByteDance-Seed GitHub repo.

Here is the background a non-expert needs. For years, the dominant story of AI progress was the scaling laws of pretraining: pour in more data and more compute, and the model gets predictably better. That worked spectacularly, but it is expensive and, many researchers now argue, showing diminishing returns. ByteDance's team asked a different question -- not how well a model does when it is first switched on, but how fast it improves once it is turned loose in a realistic, executable environment and allowed to iterate for hours, the way a human employee learns a job over weeks.

To measure that, they built EdgeBench. Most benchmarks hand a model a task and grade the single answer that comes back. EdgeBench instead drops an AI agent into a working environment -- a codebase, a research problem, a math proof -- with realistic, multi-level feedback, and lets it grind for 12 hours or more, tracking the entire arc of improvement rather than the final grade. Analyzing that mountain of activity, the team reports the first clean evidence that overall performance during this environment-learning process follows a log-sigmoid law with striking precision, hitting an R-squared of 0.998 -- statistically, almost a perfect fit.

The headline claim is the doubling. On tasks with matched starting performance, comparing model releases from late 2025 through mid 2026, the researchers found the speed of environment learning roughly doubled every three months. An analogy: imagine two new hires with identical resumes, one from last fall and one from this spring, thrown into the same job. The spring hire does not start out smarter, but climbs the learning curve about twice as fast. If that pace is real and holds, it is a Moore's-law-like cadence for how quickly agents get good at real work.

Why it matters: as South China Morning Post reported, a result like this reframes where the next round of AI investment should go -- away from ever-larger pretraining clusters and toward rich, executable environments where agents can practice. It also lands squarely in a live debate: prominent researchers have warned that brute-force data-and-compute scaling is running out of road, and EdgeBench offers a candidate replacement engine. If it becomes the long-horizon equivalent of the coding benchmarks that shaped the last two years, it could steer the field.

The honest caveat, which the authors themselves flag: the paper "leaves open whether this trend will continue or flatten once models saturate the current task set." A three-month doubling measured over roughly nine months is a real historical pattern, not a law of nature -- extrapolating it years forward would be exactly the kind of overreach the pretraining-scaling story got accused of. And because ByteDance both built the benchmark and reported the trend, independent replication on the released 51 tasks will be the real test. For now, it is a serious, unusually well-fit measurement that deserves scrutiny, not a settled new law.

Primary source, verified: read the paper →

Key questions

What is the new scaling law ByteDance found?

Across roughly 38,000 hours of agent activity, overall performance during environment learning followed a log-sigmoid curve with very high precision, and the rate of learning roughly doubled every three months from late 2025 to mid 2026.

What is EdgeBench?

EdgeBench is a benchmark of 134 ultra-long-horizon tasks -- spanning software engineering, science, math, and knowledge work -- where an agent works continuously for 12 hours or more per task, and its full improvement trajectory is tracked rather than just a final score.

Does the doubling trend definitely continue?

No -- the paper explicitly leaves open whether the trend will keep going or flatten once models saturate the current tasks, so the three-month doubling is a measured historical rate, not a guaranteed future one.

Cite this

APA

Ground Truth. (2026, July 3). ByteDance says AI agents double their learning speed every three months. Ground Truth. https://groundtruth.day/news/bytedance-edgebench-agents-double-their-learning-speed-every-three-months.html

BibTeX

@misc{groundtruth:bytedance-edgebench-agents-double-their-learning-speed-every-three-months,
  title  = {ByteDance says AI agents double their learning speed every three months},
  author = {{Ground Truth}},
  year   = {2026},
  month  = {jul},
  url    = {https://groundtruth.day/news/bytedance-edgebench-agents-double-their-learning-speed-every-three-months.html}
}

Topics: bytedance · scaling-laws · ai-agents · benchmarks · reinforcement-learning

Comments are replies to this story on Bluesky — reply with any Bluesky account to join in.