Securing AI supply chains: Cohere’s commitment to model signing
Cohere has implemented model signing for all Cohere Command models hosted on Hugging Face to improve integrity and authenticity efforts.
Cohere has implemented model signing for all Cohere Command models hosted on Hugging Face to improve integrity and authenticity efforts.
Today we announced a strategic alliance with @UMG to co-develop professional AI music creation tools, powered by responsibly trained generative AI and built to support the…
New Bulk Data Operations: Update, Delete, and Fetch by Metadata
SWE-1.5 is our latest frontier model, delivering near-SOTA coding performance at unprecedented speed.
Ollama is partnering with OpenAI and ROOST (Robust Open Online Safety Tools) to bring the latest gpt-oss-safeguard reasoning models to users for safety classification tasks. gpt-oss-safeguard…
Day Zero Support for OpenAI Open Safety Model
Today, we release LFM2-ColBERT-350M, a late interaction retriever with excellent multilingual performance. It allows you to store documents in one language (for example, a product description…
MiniMax M2 is now available on Ollama's cloud. It's a model built for coding and agentic workflows.
On-policy distillation provides an elegant way to use the teacher model as a process reward model to provide dense reward while preventing SFT style "OOD shock"…
Merge pull request #1694 from xTimeCrystal/main Qwen3-VL 32B Thinking Does Not Actually Link to Qwen3-VL 32B Thinking
On-Policy Distillation by Kevin Lu in collaboration with others at Thinking Machines
The Production AI Platform.
At PyTorch Conference 2025 in San Francisco, we unveiled five new projects spanning kernel languages, distributed systems, reinforcement learning, agentic frameworks, and edge AI deployment.
Today we announced that we’ve formed a strategic partnership with @EA to co-develop transformative generative AI models, tools, and workflows that empower EA’s artists, designers, and…
Together, Liquid AI, AMD, and Robotec.ai have deployed compact foundation models for autonomous agentic robotics: showcasing a specialized 3-billion parameter Liquid vision-language model (LFM2-VL-3B), running efficiently…
We ran performance tests on release day firmware and an updated Ollama version to see how Ollama performs.
The Hidden Cost of Building: Lessons from Aquant
We’re excited to release LFM2-VL-3B, the newest and most capable addition to our family of vision LFMs (450M and 1.6B). Built on the LFM2-2.6B backbone, this…
When we needed to deploy our hybrid LFM models on-device, we faced a critical challenge: existing inference engines couldn't handle the unique combination of attention and…
We joined our friends at @ComfyUI for a live chat about what's new with Stable Audio 2.5 🥁Check out CJ Carr from our Audio Research team…
Turn whole documents into markdown or grab line-level polygons with two new models from Datalab.
Claude Code's new sandboxing features, a bash tool and Claude Code on the web, reduce permission prompts and increase user safety by enabling two boundaries: filesystem…
What AI to use in late 2025
Based on what I've learned from role models and mentors in Amazon