open-weights
DeepSeek's new open models give everyone a million-word memory by default News
DeepSeek previewed two free-to-download V4 models that can read a million tokens at once, no longer as a premium add-on but as the standard setting.
Distillation: how a small AI learns from a big one Lesson
Distillation trains a smaller, cheaper model to imitate a larger, smarter one, the idea behind both efficient deployment and the 'copying' accusations now driving AI geopolitics.
Are closed AI models overpriced luxury goods? News
An essay argues open-weight models now undercut the big closed AIs by huge margins, and that 'China fears' are being used to protect those prices.
A language model that doesn't write left to right News
iLLaDA is an 8-billion-parameter model that generates text by refining a blurry whole rather than one word at a time, and it's catching up to the mainstream.
Qwen3.6 (open weights) Tool
Alibaba's stable Qwen3.6 release: open-weight general chat and coding models you can self-host, the same family at the center of this week's open-vs-closed pricing debate.
Qwen-Image-2.0-Pro Tool
Alibaba's latest open image-generation model in the Qwen family, downloadable and runnable locally, part of a broad open-weight release wave that also refreshed the Qwen3.6 chat models.
LLaDA / iLLaDA Tool
An openly released diffusion language model (weights and code) that generates text by refining a whole passage at once rather than one word at a time, useful for experimenting with non-autoregressive generation and infilling.
DeepSeek-V4 (Pro & Flash) Tool
Two newly previewed open-weight models with a 1-million-token context window on by default - a large mixture-of-experts flagship and a smaller, fast everyday model. Downloadable weights plus an API.
DeepSeek V4 Pro (API) Tool
A strong open-weight reasoning and coding model now offered through DeepSeek's own API at a permanently cut, low per-token price, undercutting frontier closed models for high-volume work.