X · @togethercompute
· X / Twitter
MiniMax-M3 expands what agents can carry into context: long histories, images, video, documents, and tool outputs. Together’s inference work makes th…
MiniMax-M3 expands what agents can carry into context: long histories, images, video, documents, and tool outputs.Together’s inference work makes that practical at scale by improving token throughput across the serving path.More tokens per GPU means more work automated per dollar. We go deeper in this deep dive from @y