Apparently you can skip entire transformer blocks at load time with minimal performance impact
Following recent (very cool) papers, I implemented this as a --skip-layers flag to a llama.cpp fork, so it just never instantiates the blocks you tell it…
Every story across every category, newest first. Each card links to the original publisher; daily-brief posts open as editorial pages.
Following recent (very cool) papers, I implemented this as a --skip-layers flag to a llama.cpp fork, so it just never instantiates the blocks you tell it…
EverMind has open-sourced EverOS, a local-first memory runtime that stores AI agent memory as plain Markdown indexed by SQLite and LanceDB. It combines hybrid BM25 +…
Good multi-hop eval, hard to construct at scale though"NotSupportedByCitedSourceBench"Guive Assadi: I asked Claude this question and it said yes, actually, the Dhofar War. Based on the…
Dean really doesn't want *us all* to be fuckedbut I think the essay follows quite reasonably from that premise, even if I disagree on some bifurcation…
Nice work by the @Tesla_AI team!The AI3 computer only has ~15% of the effective memory bandwidth of AI4, so this was a tough challenge.K10✨: Drove Tesla…
> you got 20k days leftrealistic medianI hopeludwig: @r0ck3t23 retardslopto every poor soul who read thru this whole thing, protect yourself, retards will use and abuse…
RT Elon MuskRe Neuralink is needed to solve the super low & lossy output bandwidth of humans. Our input bandwidth, thanks to vision, is many orders…
DeepSeek V4 (#24162) convert: add dsv4 conversion add basic setup add llm_graph_input_dsv4 add save-load state add sinkhorn eps - correction by @fairydreaming add rope fix cleanup…
https://preview.redd.it/n7rwh262b7ah1.jpg?width=1024&format=pjpg&auto=webp&s=33d775b456843cd2dbd458de89384a6a7d6d87d1 Source: Email sent from deepseek (email only available for chinese user) used gpt image 2 translate image into english submitted by /u/External_Mood4719 [link] [comments]
I really have no clue what I'm supposed to see here in terms of the pace@Rupprecht_A: Latest update on the 004 aircraft carrier via kane72/SDF:
Article URL: https://www.cpushack.com/2026/06/03/sandia-national-labs-sa3000-8085-cpu/ Comments URL: https://news.ycombinator.com/item?id=48717287 Points: 3 # Comments: 0
Tesla FSD v14 Lite is rolling out to customers with our AI3 hardwareAshok Elluswamy: FSD v14 Lite is now rolling out to AI3 early-access customers. Based…
Grok Build daily updatesX Freeze: Got another update to Grok Build.....it’s receiving daily improvements at a rapid paceRelease Notes: v0.2.73Features:• Keep text selection highlight setting added…
RT Elon MuskRe @ryanwang In some cases, where we don’t have access to the source code, eg network switch software, we are also decompiling and modifying…
Google deployed an agentic AI peer-reviewer at two top CS conferences — reviewing ~10,000 papers with 30-minute turnaround — and the new formal research paper shows…
Security researchers at Mozilla's 0DIN platform have shown how a single compromised GitHub repo can take over a developer's machine the moment an AI coding tool…
Voice agents face a fundamental tension: the reasoning, retrieval, and tool use that make foundation models capable are iterative and slow, while conversational interaction demands responses…
The All England Lawn Tennis Club is adding new AI-powered features to Wimbledon’s digital platforms through its ongoing work with IBM. The updates will be available…
I recently replaced GPT-OSS 20B Q4 with Gemma 4 12B Q8 but i went from roughly 70 t/s to 10 t/s. Am I doing something wrong?…
tools/ui: restore Tailwind scanning in ignored worktrees (#24879) macOS/iOS: macOS Apple Silicon (arm64) macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED macOS Intel (x64) iOS XCFramework Linux:…
I am determined to buy a bunch of GPUs. However, I would like to test the performance of models such as GLM-5.2 at different quantisation levels…
FOR ONCE people are begging to have some Chinese Overcapacity, and they can't deliver. Shameful display… (if they weren't on the receiving side of the export…
> Reflectioncursed name I guessGDP: @teortaxesTex Reflection has been an disappointment. Met the representatives first time in Neurips San Diego. By this time they should have…
Article URL: https://blog.pragmaticengineer.com/pollen-tried-to-remove-my-article-about-callum-negus-fancey-and-google-is-assisting-to-it/ Comments URL: https://news.ycombinator.com/item?id=48716902 Points: 31 # Comments: 4