🔜
🔜Alexandr Wang: the muse spark API will be coming soon!we have been thrilled with the amount of excitement amongst developers who want to try muse spark…
🔜Alexandr Wang: the muse spark API will be coming soon!we have been thrilled with the amount of excitement amongst developers who want to try muse spark…
New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a…
RT Artificial AnalysisMeta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Claude Opus 4.6. Muse…
RT Shengjia ZhaoExcited to share what we’ve been building at Meta Superintelligence Labs! We just released Muse Spark, our first AI model. It's a natively multimodal…
Farzapedia, personal wikipedia of Farza, good example following my Wiki LLM tweet.I really like this approach to personalization in a number of ways, compared to "status…
Something I've been thinking about - I am bullish on people (empowered by AI) increasing the visibility, legibility and accountability of their governments.Historically, it is the…
LLM Knowledge BasesSomething I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large…
The anti-AI coalition continues to maneuver to find arguments to slow down AI progress. If someone has a sincere concern about a specific effect of AI,…
New supply chain attack this time for npm axios, the most popular HTTP client library with 300M weekly downloads.Scanning my system I found a use imported…
RT John SchulmanModels that are great at calibrated predictions will be transformative for decision making. Excited about Mantic's work and proud they're using Tinker. Their new…
RT Brydon EastmanModel has good taste and so do the modellers (taste in training infra, that is)Tinker: tasty AND trained on Tinker >>>
Building technologies for better human-AI collaboration on next gen hardware at scale. Exciting.Thinking Machines: We are partnering with @nvidia to power our frontier model training and…
Grateful to Jensen and @nvidia team for their support. Together, we’re working to deploy at least 1GW of Vera Rubin systems, bringing adaptable collaborative AI to…
RT TinkerContextual AI used Tinker to post-train the planning behavior for a search agent. They land on a two-stage training recipe: On-Policy Distillation and GRPO with…
RT AnthropicA statement from Anthropic CEO Dario Amodei: https://www.anthropic.com/news/where-stand-department-war
RT AnthropicA statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War.https://www.anthropic.com/news/statement-department-of-war
RT TinkerSince Tinker launched, our community has used it to train state-of-the-art models, build infrastructure, and publish novel research. We will be highlighting this creative work…
I’ve been telling people this a lot today: I enjoy so much working with people who care about what they are building and craftsmanship. It is…
We have parted ways with Barret Zoph. Soumith Chintala will be the new CTO of Thinking Machines. He is a brilliant and seasoned leader who has…
RT John Schulmanjack-o-lora
On-policy distillation provides an elegant way to use the teacher model as a process reward model to provide dense reward while preventing SFT style "OOD shock"…
Today I met with PM @narendramodi to discuss Anthropic's expansion to India—where Claude Code use is up 5× since June. How India deploys AI across critical…
GPUs are expensive and setting up the infrastructure to make GPUs work for you properly is complex, making experimentation on cutting-edge models challenging for researchers and…
Looking through those little hidden gem stories in the footnote, you will find it so inspiring that researchers with interests on the same topic are able…