Ground Truth.
AI, checked against the source.

← All topics

open-weights

Everything on Ground Truth tagged “open-weights” — 9 items.

DeepSeek's new open models give everyone a million-word memory by default News

DeepSeek previewed two free-to-download V4 models that can read a million tokens at once, no longer as a premium add-on but as the standard setting.

Distillation: how a small AI learns from a big one Lesson

Distillation trains a smaller, cheaper model to imitate a larger, smarter one, the idea behind both efficient deployment and the 'copying' accusations now driving AI geopolitics.

Are closed AI models overpriced luxury goods? News

An essay argues open-weight models now undercut the big closed AIs by huge margins, and that 'China fears' are being used to protect those prices.

A language model that doesn't write left to right News

iLLaDA is an 8-billion-parameter model that generates text by refining a blurry whole rather than one word at a time, and it's catching up to the mainstream.

Qwen3.6 (open weights) Tool

Alibaba's stable Qwen3.6 release: open-weight general chat and coding models you can self-host, the same family at the center of this week's open-vs-closed pricing debate.

Qwen-Image-2.0-Pro Tool

Alibaba's latest open image-generation model in the Qwen family, downloadable and runnable locally, part of a broad open-weight release wave that also refreshed the Qwen3.6 chat models.

LLaDA / iLLaDA Tool

An openly released diffusion language model (weights and code) that generates text by refining a whole passage at once rather than one word at a time, useful for experimenting with non-autoregressive generation and infilling.

DeepSeek-V4 (Pro & Flash) Tool

Two newly previewed open-weight models with a 1-million-token context window on by default - a large mixture-of-experts flagship and a smaller, fast everyday model. Downloadable weights plus an API.

DeepSeek V4 Pro (API) Tool

A strong open-weight reasoning and coding model now offered through DeepSeek's own API at a permanently cut, low per-token price, undercutting frontier closed models for high-volume work.