arXiv cs.LG
· Papers
Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
arXiv:2510.18874v3 Announce Type: replace Abstract: Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating this phenomenon, we systematical