Merge pull request #6 from hnyls2002/fine-tune-sglang-command
Merge pull request #6 from hnyls2002/fine-tune-sglang-command Finetune sglang launching command
Merge pull request #6 from hnyls2002/fine-tune-sglang-command Finetune sglang launching command
GPUs are expensive and setting up the infrastructure to make GPUs work for you properly is complex, making experimentation on cutting-edge models challenging for researchers and…
LFM2-Audio defines a new class of audio foundation models: lightweight, multimodal, and real-time. By unifying audio understanding and generation in one compact system, it enables conversational…
How are artists using AI to make music? 🎶That’s what our Audio Research team set out to understand when they analyzed 337 musical works. The research…
The race between human-centered work and infinite PowerPoints
RT Kai-Fu LeeStarting your week with more slides to write? Our @popai team just dropped "Slide Agent" to make presentations simple with AI 👉 Prompt >…
Merge pull request #5 from SplittyDev/patch-1 Fix TileLang link in README
Merge pull request #3 from youkaichao/main Minor typo fix
Merge pull request #1 from zhyncs/main docs: Update README with HuggingFace and SGLang instructions
docs: Update README with HuggingFace and SGLang instructions Added instructions for using HuggingFace and SGLang with Docker.
We invested in improving Claude's ability to help defenders detect, analyze, and remediate vulnerabilities in code and deployed systems. This work allowed Claude Sonnet 4.5 to…
LoRA Without Regret by John Schulman in collaboration with others at Thinking Machines
Context is a critical but finite resource for AI agents. In this post, we explore strategies for effectively curating and managing the context that powers them.
Update README.md for blog links. Links for the blog has changed.
Looking through those little hidden gem stories in the footnote, you will find it so inspiring that researchers with interests on the same topic are able…
We’re launching Liquid Nanos — a family of 350M–2.6B parameter foundation models that deliver frontier‑model quality on specialized, agentic tasks while running directly on phones, laptops,…
Expanding ‘xAI For Government’ with more accessible AI tools for the Federal Government
Additional funding supports our growing global operations and development of frontier enterprise AI technology.
A new web search API is now available in Ollama. Ollama provides a generous free tier of web searches for individuals to use, and higher rate…
Using Cohere Grants to transform students into AI builders.
Here is the ultimate comparison post on all the latest image editing models.
We're excited to announce LFM2-2.6B, the newest and currently largest model in our Liquid Foundation Model 2 series. Building on our 350M, 700M, and 1.2B models,…
Ollama now includes a significantly improved model scheduling system, reducing crashes due to out of memory issues, maximizing GPU utilization and performance, especially on multi-GPU systems.
Introducing Remote MCP Support in Beta on GroqCloud