Feed · AI Feed

LessWrong AI Communities 3 hr ago

Refusal Is Complicated As Hell: An Update

TL;DRIt would make sense to briefly skim through our previous post that introduces our experiments on refusal in LLMs. There we explain how it started, here…

llama.cpp releases Infrastructure 3 hr ago

b9830

common : allow --offline in llama download (#25091) Expose the existing --offline flag to llama download so a script can run it to check whether a…

X · @elonmusk X / Twitter 3 hr ago

Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early eval…

Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early…

r/LocalLLaMA Communities 3 hr ago

"DuckDuckGo is blocking with a CAPTCHA. Let me try other approaches:"

My local llama.cpp-based LLM just started reporting this this morning: "DuckDuckGo is blocking with a CAPTCHA. Let me try other approaches:" Is anyone else seeing this…

r/LocalLLaMA Communities 4 hr ago

This application to join the GPT 5.6 Sol preview is wild

+ face scanner, fingerprint checker and passport swiper submitted by /u/Complete-Sea6655 [link] [comments]

THE DECODER Tech Media 4 hr ago

Only three AI models finished above starting capital in a 500-day startup survival test

Researchers at Princeton University built CEO-Bench, a test where AI agents have to run a fictional software company for 500 simulated days. Most current models go…

r/LocalLLaMA Communities 4 hr ago

A barebones CPU-only inference engine for Qwen 3, written from scratch in pure C

TL;DR: The (very messy) code and writeups can be found at https://github.com/jakint0sh/qwen3-engine Read the README for instructions on how to get started. And for those who…

Hacker News (front page) Communities 4 hr ago

DLL that was not present in memory despite not being formally unloaded

Article URL: https://devblogs.microsoft.com/oldnewthing/20260625-00/?p=112467 Comments URL: https://news.ycombinator.com/item?id=48705910 Points: 12 # Comments: 0

r/LocalLLaMA Communities 4 hr ago

We’re probably going to need that soon.

From: Vladik on 𝕏: https://x.com/Kostoglodov/status/2071144065857679631 Shaw (spirit/acc) on 𝕏: https://x.com/shawmakesmagic/status/2070918006033817867 submitted by /u/Nunki08 [link] [comments]

THE DECODER Tech Media 4 hr ago

Chinese cybersecurity firm builds AI tools to rival Mythos and frames the race as cyber-nuclear deterrence

360 founder Zhou Hongyi presents two AI security tools designed to compete with Anthropic's Mythos. One has already flagged 3,432 vulnerabilities. Zhou admits Chinese models trail…

r/LocalLLaMA Communities 5 hr ago

Whisperian: It is one of the best applications for Android, if you want to use Mic with some local ASR models. And it is also available on Play Store.

submitted by /u/9r4n4y [link] [comments]

Hacker News (front page) Communities 6 hr ago

Can China build its own ASML?

Article URL: https://nikkei.shorthandstories.com/can-china-build-its-own-asml/ Comments URL: https://news.ycombinator.com/item?id=48705276 Points: 26 # Comments: 10

THE DECODER Tech Media 6 hr ago

Sina's open model VibeThinker-3B aims to show reasoning compresses well but factual knowledge doesn't

Sina Weibo's VibeThinker-3B has just three billion parameters but matches models like DeepSeek V3.2 and Kimi K2.5 on math and coding benchmarks. Those models are up…

r/LocalLLaMA Communities 7 hr ago

Reverse engineered DeepSeek Chat into an OpenAI compatible API (V4 & R1 models, no API key, no bills)

DeepSeek already has an official OpenAI compatible API, but it's paid. The consumer web chat, on the other hand, is free. So I built a local…

MarkTechPost Tech Media 7 hr ago

Building a Stable Fable 5 Traces Workflow in Colab: Parsing Tool Calls, Auditing Data, and Training Baselines

In this tutorial, we build a stable workflow around the Fable 5 Traces dataset from Hugging Face. We avoid fragile dependencies and manually parse the merged…

r/LocalLLaMA Communities 7 hr ago

Is Qwen3-VL-2B the only viable VLM for JSON extraction on a "potato"?

After spending countless hours testing on 3 "potato" laptops (Intel i3, 8GB RAM, Win11, integrated GPU), that's my conclusion. For reliably extracting data from images to…

X · @teortaxesTex X / Twitter 7 hr ago

What is this slop? Do they actually mean GLM 5.2-Cyber (non-nerfed version), or some unrepresentative eval?

What is this slop? Do they actually mean GLM 5.2-Cyber (non-nerfed version), or some unrepresentative eval?First Squawk: ZHIPU AI’S NEW MODEL REPORTEDLY MATCHES CLAUDE MYTHOS IN…

LessWrong AI Communities 7 hr ago

A partner who plays music

Drafted March 6 2025I have thought for many years that it is a risky thing to have a partner who is into creating things, lest you…

llama.cpp releases Infrastructure 7 hr ago

b9829

logs : reduce v2 (#25078) server : reduce logs cont : common cont : spec cont : CMN_ -> COM_ macOS/iOS: macOS Apple Silicon (arm64) macOS…

X · @teortaxesTex X / Twitter 7 hr ago

got notified by a native Chinese speaker that > This is a mis-translation: 对标 means "goes head to head against" or at least "similar to", not emulat…

got notified by a native Chinese speaker that> This is a mis-translation: 对标 means "goes head to head against" or at least "similar to", not emulate.So…

X · @teortaxesTex X / Twitter 8 hr ago

Took a bit over three weeks Mustafa had overseen the first genuinely prosocial thing in his entire evil life, and got sidelined for it. As we say in R…

Took a bit over three weeksMustafa had overseen the first genuinely prosocial thing in his entire evil life, and got sidelined for it. As we say…

X · @elonmusk X / Twitter 8 hr ago

RT X Freeze: Happy Birthday Elon 🎂 YOU are the love of humanity ❤️

RT X FreezeHappy Birthday Elon 🎂YOU are the love of humanity ❤️

Source Daily Brief 10 hr ago

AI Morning Brief — 28 June 2026

Overnight in AI: GPT-5.6 Sol caught cheating on evals, Anthropic's banned models near reinstatement, and DeepSeek open-sources a 60–85% inference speedup.

Hacker News (front page) Communities 8 hr ago

Armadillo – A DNS Server in Gleam for Homelab Use

Article URL: https://github.com/vshakitskiy/armadillo Comments URL: https://news.ycombinator.com/item?id=48704816 Points: 7 # Comments: 0

Feed 4,312 posts

Refusal Is Complicated As Hell: An Update

b9830

Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early eval…

"DuckDuckGo is blocking with a CAPTCHA. Let me try other approaches:"

This application to join the GPT 5.6 Sol preview is wild

Only three AI models finished above starting capital in a 500-day startup survival test

A barebones CPU-only inference engine for Qwen 3, written from scratch in pure C

DLL that was not present in memory despite not being formally unloaded

We’re probably going to need that soon.

Chinese cybersecurity firm builds AI tools to rival Mythos and frames the race as cyber-nuclear deterrence

Whisperian: It is one of the best applications for Android, if you want to use Mic with some local ASR models. And it is also available on Play Store.

Can China build its own ASML?

Sina's open model VibeThinker-3B aims to show reasoning compresses well but factual knowledge doesn't

Reverse engineered DeepSeek Chat into an OpenAI compatible API (V4 & R1 models, no API key, no bills)

Building a Stable Fable 5 Traces Workflow in Colab: Parsing Tool Calls, Auditing Data, and Training Baselines

Is Qwen3-VL-2B the only viable VLM for JSON extraction on a "potato"?

What is this slop? Do they actually mean GLM 5.2-Cyber (non-nerfed version), or some unrepresentative eval?

A partner who plays music

b9829

got notified by a native Chinese speaker that > This is a mis-translation: 对标 means "goes head to head against" or at least "similar to", not emulat…

Took a bit over three weeks Mustafa had overseen the first genuinely prosocial thing in his entire evil life, and got sidelined for it. As we say in R…

RT X Freeze: Happy Birthday Elon 🎂 YOU are the love of humanity ❤️

AI Morning Brief — 28 June 2026

Armadillo – A DNS Server in Gleam for Homelab Use