cost-optimization
Everything on Ground Truth tagged “cost-optimization” — 2 items.
Headroom Tool
A drop-in proxy that sits between your coding assistant and the AI model and automatically compresses bulky tool outputs, logs, and retrieved text before they reach the model — cutting token usage sharply without changing your code.
Doubleword (async + batch inference) Tool
Run the same models you already use, but on async and batch tiers that trade latency for a large cost cut on workloads that don't need an instant reply: long-running agents, evaluations, and bulk jobs.