News from Together AI (via openrss)

Together AI (via openrss) Open Source 1 day ago

ParallelKernelBench: Frontier LLMs can't write fast multi-GPU kernels (yet)

ParallelKernelBench tests whether LLMs can write fast multi-GPU CUDA kernels across 87 real workloads. The best model solves under a third, but a few generated kernels…

Together AI (via openrss) Open Source June 17, 2026

Kimi K2.7 Code vs Claude Fable 5: Landing pages that cost 94% less

We generated 12 landing pages with Kimi K2.7 Code and Claude Fable 5. Kimi cost 94% less and scored within a few points on every page.…

Together AI (via openrss) Open Source June 10, 2026

Building trust in enterprise AI: Together AI earns ISO 27001:2022 certification

Together AI has earned ISO 27001:2022 certification, validating our commitment to enterprise-grade security for production AI workloads.

Together AI (via openrss) Open Source June 2, 2026

Serving MiniMax-M3 for efficient inference: Unlocking 1M-Token Context and Multimodality Without Regrets

How Together served MiniMax-M3 efficiently with KV-block-major sparse attention, paged MSA decode, optimized index scoring, and a Rust-based multimodal gateway.

Latest

ParallelKernelBench: Frontier LLMs can't write fast multi-GPU kernels (yet)

Kimi K2.7 Code vs Claude Fable 5: Landing pages that cost 94% less

Building trust in enterprise AI: Together AI earns ISO 27001:2022 certification

Serving MiniMax-M3 for efficient inference: Unlocking 1M-Token Context and Multimodality Without Regrets