Skip to content
r/LocalLLaMA · Communities

Gefen is a drop-in replacement for the AdamW optimizer, claims 8x memory reduction in training (GitHub available)

Paper: https://arxiv.org/abs/2606.13894 GitHub: https://github.com/ndvbd/Gefen submitted by /u/indicava [link] [comments]