Source March 19, 2025 · Daily Brief

AI Daily Brief — 19 March 2025

GTC Day 3 turned into NVIDIA partner day. The lab unspooled the open Nemotron reasoning family, Dynamo as the inference substrate, Spectrum-X / Quantum-X co-packaged optics, and a Boston quantum research center — while Oracle, Microsoft, Cisco, HPE and IBM all locked in deeper enterprise stacks. OpenAI counter-programmed with o1-pro at a record API price tag.

Top stories

OpenAI ships o1-pro in the API at $150/M input + $600/M output. Its most expensive model ever — 2x GPT-4.5 input cost and 10x regular o1 output. Limited initially to API customers who’d spent at least $5; supports vision, function calling, structured outputs, Batch API. Uses more compute per query than o1 for harder problems. via TechCrunch
NVIDIA Llama Nemotron open reasoning models go live. Nano (edge), Super (single-GPU) and Ultra (multi-GPU) tiers with toggleable reasoning. Post-training reportedly lifted accuracy up to 20% over base Llama and delivered ~5x faster inference than other leading open reasoning models. Released on build.nvidia.com and Hugging Face — NVIDIA’s reasoning answer to DeepSeek-R1. via NVIDIA Developer
NVIDIA Accelerated Quantum Research Center (NVAQC) — Boston. NVIDIA committed to a Boston-based research center integrating quantum hardware with AI supercomputing, with a GB200 NVL72-based system of 576 Blackwell GPUs over Quantum-2 InfiniBand. Partners: Harvard (HQI), MIT (EQuS), Quantinuum, QuEra and Quantum Machines. Notable because Jensen had said weeks earlier that useful quantum was decades away. via NVIDIA
Oracle + NVIDIA — 160+ AI tools and 100+ NIM microservices into OCI Console. No-code AI Blueprints deployment, vector-search acceleration in Oracle Database 23ai via cuVS, and NVIDIA AI Enterprise availability across OCI public regions, Government/sovereign clouds, OCI Dedicated Region, Alloy, Compute Cloud@Customer and Roving Edge. via NVIDIA
GM, Gatik and Torc deepen NVIDIA AV partnerships. GM plans NVIDIA-powered ADAS in consumer cars (not robotaxis) plus Omniverse + Cosmos for factory planning and robotics simulation. Gatik (autonomous trucking) and PACCAR-owned Torc Robotics deepen use of DRIVE AGX/Thor. via TechCrunch
Cisco Secure AI Factory with NVIDIA. Cisco announced a Secure AI Factory architecture co-designed with NVIDIA — Cisco Hypershield workload protection, Cisco AI Defense for model/app security, Spectrum-X Ethernet networking, with storage from Pure, Hitachi Vantara, NetApp and VAST Data. via Cisco

Who shipped

NVIDIA ran the entire enterprise stack — open reasoning models, inference framework, optics switches, quantum lab — under the GTC banner. OpenAI chose this day to put a $600/M output price tag on its frontier-tier reasoning, sharpening the price-performance argument. Microsoft, Oracle, Cisco, HPE and IBM announced deeper integrations across the day.

Open-source pulse

Llama Nemotron (open weights, on build.nvidia.com + Hugging Face) and Dynamo (open inference framework supporting SGLang, TensorRT-LLM, vLLM) made NVIDIA’s open-weights footprint as wide as it’s ever been.

Money, infra & hardware

Microsoft Azure made Container Apps serverless GPUs with NVIDIA NIM generally available (per-second billing, optimized cold start), and added Nemotron and Cosmos to Azure AI Foundry. HPE rolled out Blackwell Ultra servers and a Private Cloud AI developer kit.

Quiet corners

No major China-labs release dated Mar 19 — Tencent’s Hunyuan T1 was still two days out. No notable arxiv landmark.

By the numbers

$150 / $600 per million input/output tokens — o1-pro pricing
20% — accuracy lift Llama Nemotron over base Llama
5x — faster inference vs other open reasoning models
576 — Blackwell GPUs in the NVAQC quantum supercomputer
160+ / 100+ — NVIDIA AI tools / NIM microservices into Oracle OCI
Most-mentioned company: NVIDIA

Compiled by AI Feed’s editor from verified web sources for 19 March 2025.