open-weight-models
Ollama nearly doubles Gemma's speed on Macs by guessing ahead News
A free local-AI tool now runs Google's Gemma model far faster on Apple computers using a trick where a small model drafts words and the big one checks them in bulk.
Mistral releases a lean, open model built for formal math proofs News
Leanstral 1.5 is a free, open model specialized for writing machine-checked mathematical proofs, using a design that keeps only a small slice of itself active at a time.
A 35-billion-parameter agent that punches like a trillion-parameter model News
Shanghai AI Lab argues you can reach giant-model performance on long tasks not by adding parameters, but by training on much longer chains of real work.
An open model from China beat Claude on a security test -- at a sixth of the cost News
Semgrep ran GLM 5.2 against Claude on a narrow vulnerability-finding task and the free, open-weight model came out ahead for far less money.
A model that rivals the frontier now squeezes onto a single high-end desktop News
Aggressive compression shrinks GLM 5.2 by more than 80 percent while keeping most of its accuracy, putting a near-frontier model within reach of local hardware.
Quantization: Shrinking AI Models to Run on Modest Hardware Lesson
Storing a model's numbers with less precision - 8, 4, or even fewer bits instead of 16 - makes it dramatically smaller and faster, often with almost no loss in quality. It's why big models can run on a laptop or a single GPU.
Frontier AI is getting more expensive while open models keep getting cheaper News
Closed frontier models are raising prices and tightening access just as Chinese open-weight models slash theirs, a structural reversal with big consequences for who builds with AI.
The Model Ban Is Quietly Redrawing the AI Map News
Two weeks after the US pulled its top models off the market, a Chinese open model sits atop the global download charts and the community is busy rebuilding the banned capability in the open.
AI Agents Are Learning to Build the Worlds They Train In News
Three new open research projects point the same way: instead of only learning what to do, agents are learning to simulate the environment itself, so they can practice in their own imagination.
A Free Model That Splits Your Work Across 300 Helpers News
Moonshot AI's Kimi K2.6 is a frontier-grade model anyone can download, and its headline trick is fanning a single job out to hundreds of helpers working in parallel.
Qwen-AgentWorld Tool
Alibaba's open language world model that simulates agent environments -- browser, terminal, phone, coding workspace and more -- so other agents can be trained inside the simulation. Released with open weights and code in two sizes.
MiniMax-M3 Tool
A natively multimodal open model trained on text, image, and video from the first step, with a million-token context and a sparse-attention design built for speed; downloadable for self-hosting and also offered through MiniMax's own API and agent platform.
Kimi K2.6 weights (Hugging Face) Tool
The actual Kimi K2.6 model weights, published under a modified-MIT license for anyone to download, run, and build on; large enough that full-strength use needs a multi-GPU node.
Kimi (Kimi K2.6) Tool
Moonshot AI's web assistant and agent, running the open-weight Kimi K2.6 model; free to use in the browser for chat and long-horizon agent tasks, with the weights also downloadable for self-hosting.
GLM 5.2 (GGUF, runnable locally) Tool
Zhipu AI's open, MIT-licensed mixture-of-experts model with a roughly million-token context, now packaged as ready-to-run quantized files you can host on your own machine. Strong on agent and coding workflows; this week it beat Claude on a narrow security benchmark at a fraction of the cost.