News · 2026-06-22

Suddenly, downloadable AI models look like an insurance policy

For most of the last few years, 'open' AI models -- the kind you can download and run on your own hardware -- were treated as the enthusiast's choice: cheaper, more private, fun to tinker with, but a step behind the polished hosted products from the big labs. This week that calculus changed, and not because of any single release. It changed because a top hosted model vanished overnight on a government order, and everyone who builds on AI suddenly asked the same question: what happens to my product if the model I depend on disappears? A model you've already downloaded can't be switched off by a memo. That's the new appeal, and the timing has lit a fire under an already crowded field.

The field is genuinely crowded. This cycle alone brought a fresh wave of heavyweight open models: a new top-tier release from DeepSeek (DeepSeek-V4-Pro) and a large multimodal model from MiniMax (MiniMax-M3), both racking up downloads near the very top of the charts within a day. They join GLM-5.2, whose recent arrival is now being judged not on its launch but on how it actually performs in real work.

That's where an important nuance comes in, and it's one the hype tends to flatten. An independent evaluation group, Artificial Analysis, ran these models through a test of practical knowledge-work tasks (AA-Briefcase) and the honest ranking is more interesting than the headlines. The leading open model holds its own -- it lands ahead of one of OpenAI's well-regarded models -- but it still sits behind the two Anthropic models at the top. So the accurate story is 'the best open model now beats a major closed competitor and is closing in on the frontier,' not 'open models have won.' Anyone telling you the open model simply beats everything is quoting half a leaderboard. For why benchmark comparisons need this kind of care, see our guide to how AI is benchmarked and the recent piece on why a leaderboard can mislead.

There's a second shift worth naming: speed stopped being the closed labs' advantage. One hosting company, Baseten, showed it could serve the leading open model at hundreds of tokens a second on the newest chips (how they built it). The practical meaning: 'open' no longer has to mean 'slow' or 'run it yourself on a sluggish home rig.' You can get frontier-class responsiveness from a model whose weights are public, which removes one of the last reasons businesses defaulted to closed providers.

Here's a simple way to think about why this all matters. Renting versus owning. A hosted model is renting: convenient, always maintained, but the landlord can change the locks. An open model is owning: more responsibility, more setup, but nobody can evict you. For years renting was clearly the better deal because the rentals were nicer. This week reminded everyone that you can be evicted with no notice -- and, separately, that the houses you can own have gotten very nice indeed. The combination is what's driving the surge of attention.

The honest caveats are real and worth stating plainly. First, the specifications these labs advertise -- how big the models are, how they're built -- are largely self-reported and haven't been independently verified, so treat the spec sheets as marketing until outside analysis catches up. Second, 'matches the frontier in one test of office tasks' is not 'matches the frontier everywhere'; these models can still trail on the hardest reasoning and the longest, messiest jobs. Third, the biggest of them demand serious, expensive hardware to run well, which means the 'insurance policy' is genuinely practical for a company with a server budget and mostly aspirational for an individual with a single graphics card. The shift is real, but it's a shift in the strategic logic of who depends on whom -- not a claim that open has already won.

Primary source, verified: read the paper →