X · @huggingface
· X / Twitter
RT NVIDIA AI: The rise of MoE models introduced new challenges in training, and @huggingface's Transformers v5 brought first-class support for solving…
RT NVIDIA AIThe rise of MoE models introduced new challenges in training, and @huggingface's Transformers v5 brought first-class support for solving them.Now, NeMo AutoModel builds on top of v5. Part of the NeMo framework for building models at scale, NeMo AutoModel brings optimizations to a broad set of model families