fine-tuning
Everything on Ground Truth tagged “fine-tuning” — 3 items.
Fine-tuning and LoRA: teaching an old model a new job without retraining it Lesson
You almost never train an AI from scratch. You take one that already knows language and nudge it toward your specific task - and a trick called LoRA lets you do that by adding a tiny sticky note instead of rewriting the whole brain.
veRL Tool
The open RL post-training framework used by most research labs training reasoning models today. Run GRPO, PPO, and related reward-training methods on your own models.
Unsloth Tool
Toolkit and documentation for running and fine-tuning large open models faster and on smaller hardware, including aggressive dynamic quantization recipes that shrink models like GLM 5.2 by 80-plus percent while keeping most of their accuracy. The practical on-ramp to running near-frontier models privately.