Skip to content
X · @huggingface · X / Twitter

RT NVIDIA AI: The rise of MoE models introduced new challenges in training, and @huggingface's Transformers v5 brought first-class support for solving…

RT NVIDIA AIThe rise of MoE models introduced new challenges in training, and @huggingface's Transformers v5 brought first-class support for solving them.Now, NeMo AutoModel builds on top of v5. Part of the NeMo framework for building models at scale, NeMo AutoModel brings optimizations to a broad set of model families