Refusal Is Complicated As Hell: An Update
TL;DRIt would make sense to briefly skim through our previous post that introduces our experiments on refusal in LLMs. There we explain how it started, here…
Every story across every category, newest first. Each card links to the original publisher; daily-brief posts open as editorial pages.
TL;DRIt would make sense to briefly skim through our previous post that introduces our experiments on refusal in LLMs. There we explain how it started, here…
common : allow --offline in llama download (#25091) Expose the existing --offline flag to llama download so a script can run it to check whether a…
Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early…
My local llama.cpp-based LLM just started reporting this this morning: "DuckDuckGo is blocking with a CAPTCHA. Let me try other approaches:" Is anyone else seeing this…
+ face scanner, fingerprint checker and passport swiper submitted by /u/Complete-Sea6655 [link] [comments]
Researchers at Princeton University built CEO-Bench, a test where AI agents have to run a fictional software company for 500 simulated days. Most current models go…
TL;DR: The (very messy) code and writeups can be found at https://github.com/jakint0sh/qwen3-engine Read the README for instructions on how to get started. And for those who…
Article URL: https://devblogs.microsoft.com/oldnewthing/20260625-00/?p=112467 Comments URL: https://news.ycombinator.com/item?id=48705910 Points: 12 # Comments: 0
From: Vladik on 𝕏: https://x.com/Kostoglodov/status/2071144065857679631 Shaw (spirit/acc) on 𝕏: https://x.com/shawmakesmagic/status/2070918006033817867 submitted by /u/Nunki08 [link] [comments]
360 founder Zhou Hongyi presents two AI security tools designed to compete with Anthropic's Mythos. One has already flagged 3,432 vulnerabilities. Zhou admits Chinese models trail…
submitted by /u/9r4n4y [link] [comments]
Article URL: https://nikkei.shorthandstories.com/can-china-build-its-own-asml/ Comments URL: https://news.ycombinator.com/item?id=48705276 Points: 26 # Comments: 10
Sina Weibo's VibeThinker-3B has just three billion parameters but matches models like DeepSeek V3.2 and Kimi K2.5 on math and coding benchmarks. Those models are up…
DeepSeek already has an official OpenAI compatible API, but it's paid. The consumer web chat, on the other hand, is free. So I built a local…
In this tutorial, we build a stable workflow around the Fable 5 Traces dataset from Hugging Face. We avoid fragile dependencies and manually parse the merged…
After spending countless hours testing on 3 "potato" laptops (Intel i3, 8GB RAM, Win11, integrated GPU), that's my conclusion. For reliably extracting data from images to…
What is this slop? Do they actually mean GLM 5.2-Cyber (non-nerfed version), or some unrepresentative eval?First Squawk: ZHIPU AI’S NEW MODEL REPORTEDLY MATCHES CLAUDE MYTHOS IN…
Drafted March 6 2025I have thought for many years that it is a risky thing to have a partner who is into creating things, lest you…
logs : reduce v2 (#25078) server : reduce logs cont : common cont : spec cont : CMN_ -> COM_ macOS/iOS: macOS Apple Silicon (arm64) macOS…
got notified by a native Chinese speaker that> This is a mis-translation: 对标 means "goes head to head against" or at least "similar to", not emulate.So…
Took a bit over three weeksMustafa had overseen the first genuinely prosocial thing in his entire evil life, and got sidelined for it. As we say…
RT X FreezeHappy Birthday Elon 🎂YOU are the love of humanity ❤️
Overnight in AI: GPT-5.6 Sol caught cheating on evals, Anthropic's banned models near reinstatement, and DeepSeek open-sources a 60–85% inference speedup.
Article URL: https://github.com/vshakitskiy/armadillo Comments URL: https://news.ycombinator.com/item?id=48704816 Points: 7 # Comments: 0