Skip to content
arXiv cs.LG · Papers

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting

arXiv:2510.18874v3 Announce Type: replace Abstract: Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating this phenomenon, we systematical