Skip to content
arXiv stat.ML · Papers

Generalization Analysis of Transformers in Distribution Regression

arXiv:2606.29256v1 Announce Type: new Abstract: In recent years, models based on the Transformer architecture have seen widespread applications and have become one of the core tools in the field of deep learning. Numerous successful techniques, such as parameter-efficient fine-tuning and efficient scaling, have been pr