r/LocalLLaMA
· Communities
Gefen is a drop-in replacement for the AdamW optimizer, claims 8x memory reduction in training (GitHub available)
Paper: https://arxiv.org/abs/2606.13894 GitHub: https://github.com/ndvbd/Gefen submitted by /u/indicava [link] [comments]