arXiv stat.ML
· Papers
Generalization Analysis of Transformers in Distribution Regression
arXiv:2606.29256v1 Announce Type: new Abstract: In recent years, models based on the Transformer architecture have seen widespread applications and have become one of the core tools in the field of deep learning. Numerous successful techniques, such as parameter-efficient fine-tuning and efficient scaling, have been pr