Gemma 4 26BA4B Surprisingly Usable at IQ3_S – Are small quants really this usable?
I've been experimenting with using lower quants of Gemma 4 26B on my M3 16gb MacBook Air. The Quant runs at a solid 25 tokens per…
I've been experimenting with using lower quants of Gemma 4 26B on my M3 16gb MacBook Air. The Quant runs at a solid 25 tokens per…
AbstractWe make the case for training AIs to be risk-averse in resources — specifically, to treat resources as having diminishing marginal utility. These AIs would (for…
Article URL: https://www.sciencenews.org/article/deadly-fungus-cats-people-spreading Comments URL: https://news.ycombinator.com/item?id=48658186 Points: 30 # Comments: 6
I've been comparing GPU/LLM providers for a side project and ended up with way too many browser tabs and spreadsheets. So I decided to pull the…
Article URL: https://haystack.deepset.ai/ Comments URL: https://news.ycombinator.com/item?id=48658095 Points: 3 # Comments: 1
https://i.redd.it/zjduf8zns79h1.gif Baidu released Unlimited-OCR 2 days ago, and they claim it can transcribe dozens of pages in one forward pass. I read the research paper, and…
RT Elise StefanikGotham has fallen. The mass exodus out of the New York will continue apace. I call it the Blue Exodus. Last night’s sweep of…
Article URL: https://rworks.dev/posts/too-many-R-packages/ Comments URL: https://news.ycombinator.com/item?id=48657940 Points: 6 # Comments: 0
Language models may write cleaner prose than most humans, but ask one for 100 arguments on a topic and they'll all cluster together. Human reasoning is…
Hello, I recently bought a new RTX 5060Ti to pair with the RTX 5060Ti I already own, now I have 32GB of VRAM. Up until now…
vulkan: fail the build when a shader fails to compile (#24450) vulkan-shaders-gen: fail the build when a shader fails to compile vulkan-shaders-gen did not detect shader-compile…
As retrieval systems scale, high-quality reranking becomes increasingly important. However, most existing rerankers, whether encoder-based or decoder-based, jointly encode the query and passage, tightly coupling their…
RT Konstantin KisinThe strange thing about progress is that it only counts if you ignore where we started.Every civilisation practised slavery.The civilisation that ended it is…
Article URL: https://infoscience.epfl.ch/entities/publication/9a49779b-f9f8-448d-b3d1-737c78455309 Comments URL: https://news.ycombinator.com/item?id=48657481 Points: 5 # Comments: 0
The Kimi API is now live on AWS Marketplace. 🚀If your team is already running on AWS, you can now access Kimi with consolidated billing. Plus,…
Supported Models: granite-speech-4.1-2b-plus by 24818 LFM2.5-ColBERT-350M & LFM2.5-Embedding-350M by 24913 Vulkan: vulkan: link ggml-cpu when GGML_VULKAN_CHECK_RESULTS / RUN_TESTS are enabled #24444 vulkan: make mul_mm ALIGNED a…
In this tutorial, we build a fully offline Graphify pipeline that turns a multi-module Python application into a knowledge graph. We install Graphify, generate a connected…
Claude Tag lets teams bring Anthropic's AI into Slack by tagging @Claude in any channel and assigning it tasks. Internally, the tool already generates 65 percent…
Mistral AI has released OCR 4, a new model that reads text from documents like PDFs, Word files, and PowerPoint presentations. The article Mistral's new OCR…
Nous Research has added /learn to the Hermes Agent Skills System. The command authors a standards-compliant SKILL.md from a local directory, a doc URL, a past…
As part of my ablations, I want to generate text with a medical-oriented LLM, and I was surprised to find no exposed APIs for this kind…
Article URL: https://dev.karltryggvason.com/you-cant-unit-test-for-taste/ Comments URL: https://news.ycombinator.com/item?id=48657049 Points: 208 # Comments: 86
Article URL: https://bunny.net/blog/were-making-bunny-dns-free/ Comments URL: https://news.ycombinator.com/item?id=48657030 Points: 277 # Comments: 89
RT Victor MAt Hugging Face we've been building our own agent that we use via Slack (Moon Bot). Honestly, building your own is quite simple and…