Direct agents with visual prompts in Design Mode
Point, draw, or narrate UI changes in the browser while agents edit the code underneath.
Point, draw, or narrate UI changes in the browser while agents edit the code underneath.
Nexus in the Wild: Real Results from Our Early Access Customers
Most AI pipelines are only as good as the data we provide them with, and that usually means PDFs or other unstructured documents.Contracts, invoices, reports... All…
NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows.
At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems.…
Engram, Weaviate's managed memory and context service for agentic applications, is now generally available.
Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it’s…
Inside AskData: How We Slashed Token Consumption by Over 90%
Together AI built the fastest speech-to-text stack on Artificial Analysis by treating ASR as a full-path systems problem, not just a GPU inference problem.
OpenJarvis v1.0 is now available: an open-source framework for building personal AI agents that run on your own hardware, with Ollama support built-in.
Weaviate Cloud now supports more granular role-based access control with new Editor and Viewer roles for improved security and organizational management.
Turn Azure Data into an AI-Ready Knowledge Base
Use Weaviate's built-in MCP server to give Claude Code, Cursor, and VS Code hybrid search over your codebase and docs. No glue code.
Grok Imagine Video 1.5 is the most exciting video model release from xAI. You can generate realistic video with synchronized audio in a single pass, capable…
At this year’s Google I/O conference, NVIDIA and Google Cloud are accelerating the work of more than 100,000 developers in the companies’ joint developer community, which…
Real-world inference benchmarks for coding agents: 31% more TPS than TensorRT-LLM, 2× better TTFT at saturation, and 76% lower cost than Claude Opus 4.6.
Agentic AI inference at one-tenth the cost per token with NVIDIA Vera Rubin NVL72. Agent sandboxes run 50% faster on NVIDIA Vera than traditional CPUs —…
A substantial improvement in intelligence and behavior over Composer 2, particularly on long-horizon agentic tasks.
Together AI partners with Pearl Research Labs to launch a discounted Pearl-powered inference endpoint for Gemma-4-31B-it-pearl, using Proof of Useful Work to turn AI workloads into…
Violin is an open-source AI video translation tool that combines speech recognition, LLM translation, and text-to-speech to make video content accessible across languages.
Tokenization makes or breaks hybrid search. See how Weaviate's accent folding, custom stopwords, and /v1/tokenize endpoint power multilingual BM25.
Claude Opus 4.7 (fast mode) is now available in Windsurf with the full intelligence of Opus 4.7 and ~2.5x higher output speeds.
Voice finder helps developers search, match, filter, and audition 600+ voices across Together AI TTS models using natural-language prompts or uploaded audio samples.
DeepSeek-V4 makes million-token context a serving-systems problem. Together AI explores the inference work behind V4 on NVIDIA HGX B200, including compressed KV layouts, prefix caching, kernel…