credit-assignment
Everything on Ground Truth tagged “credit-assignment” — 1 item.
Crediting an AI for the right steps — without a second model to judge them News
When you reward an AI for a good final answer, it's hard to know which of its steps earned the credit. The usual fix is training a second 'judge' model. This skips that.