AI Feed

r/LocalLLaMA Communities 2 days ago

Gemma 4 26BA4B Surprisingly Usable at IQ3_S – Are small quants really this usable?

I've been experimenting with using lower quants of Gemma 4 26B on my M3 16gb MacBook Air. The Quant runs at a solid 25 tokens per…

LessWrong AI Communities 2 days ago

Risk-Averse AIs

AbstractWe make the case for training AIs to be risk-averse in resources — specifically, to treat resources as having diminishing marginal utility. These AIs would (for…

Hacker News (front page) Communities 2 days ago

A deadly fungus that can infect cats and people is spreading

Article URL: https://www.sciencenews.org/article/deadly-fungus-cats-people-spreading Comments URL: https://news.ycombinator.com/item?id=48658186 Points: 30 # Comments: 6

r/MachineLearning Communities 2 days ago

I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

I've been comparing GPU/LLM providers for a side project and ended up with way too many browser tabs and spreadsheets. So I decided to pull the…

Hacker News (front page) Communities 2 days ago

Haystack: Open-Source AI Framework for Production Ready Agents, RAG

Article URL: https://haystack.deepset.ai/ Comments URL: https://news.ycombinator.com/item?id=48658095 Points: 3 # Comments: 1

r/LocalLLaMA Communities 2 days ago

How Baidu’s newly released Unlimited-OCR transcribes dozens of pages in one forward pass

https://i.redd.it/zjduf8zns79h1.gif Baidu released Unlimited-OCR 2 days ago, and they claim it can transcribe dozens of pages in one forward pass. I read the research paper, and…

X · @elonmusk X / Twitter 2 days ago

RT Elise Stefanik: Gotham has fallen. The mass exodus out of the New York will continue apace. I call it the Blue Exodus. Last night’s sweep of Marxi…

RT Elise StefanikGotham has fallen. The mass exodus out of the New York will continue apace. I call it the Blue Exodus. Last night’s sweep of…

Hacker News (front page) Communities 2 days ago

Too many R packages: CRAN is inundated with submissions

Article URL: https://rworks.dev/posts/too-many-R-packages/ Comments URL: https://news.ycombinator.com/item?id=48657940 Points: 6 # Comments: 0

THE DECODER Tech Media 2 days ago

Pangram CEO says language models give themselves away by making the same arguments

Language models may write cleaner prose than most humans, but ask one for 100 arguments on a topic and they'll all cluster together. Human reasoning is…

r/LocalLLaMA Communities 2 days ago

Qwen3.6 27B more dumb in vLLM compared to llama.cpp

Hello, I recently bought a new RTX 5060Ti to pair with the RTX 5060Ti I already own, now I have 32GB of VRAM. Up until now…

llama.cpp releases Infrastructure 2 days ago

b9780

vulkan: fail the build when a shader fails to compile (#24450) vulkan-shaders-gen: fail the build when a shader fails to compile vulkan-shaders-gen did not detect shader-compile…

r/LocalLLaMA Communities 2 days ago

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

As retrieval systems scale, high-quality reranking becomes increasingly important. However, most existing rerankers, whether encoder-based or decoder-based, jointly encode the query and passage, tightly coupling their…

X · @elonmusk X / Twitter 2 days ago

RT Konstantin Kisin: The strange thing about progress is that it only counts if you ignore where we started. Every civilisation practised slavery. The…

RT Konstantin KisinThe strange thing about progress is that it only counts if you ignore where we started.Every civilisation practised slavery.The civilisation that ended it is…

Hacker News (front page) Communities 2 days ago

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

Article URL: https://infoscience.epfl.ch/entities/publication/9a49779b-f9f8-448d-b3d1-737c78455309 Comments URL: https://news.ycombinator.com/item?id=48657481 Points: 5 # Comments: 0

X · @MoonshotAI China Labs 2 days ago

The Kimi API is now live on AWS Marketplace. 🚀 If your team is already running on AWS, you can now access Kimi with consolidated billing. Plus, eli…

The Kimi API is now live on AWS Marketplace. 🚀If your team is already running on AWS, you can now access Kimi with consolidated billing. Plus,…

r/LocalLLaMA Communities 2 days ago

llama.cpp updates – granite-speech-4.1-2b, LFM2.5-ColBERT/Embedding-350M, Vulkan backend related changes & Misc items

Supported Models: granite-speech-4.1-2b-plus by 24818 LFM2.5-ColBERT-350M & LFM2.5-Embedding-350M by 24913 Vulkan: vulkan: link ggml-cpu when GGML_VULKAN_CHECK_RESULTS / RUN_TESTS are enabled #24444 vulkan: make mul_mm ALIGNED a…

MarkTechPost Tech Media 2 days ago

Using Graphify and NetworkX to Map Python Codebase Structure with God Nodes, Communities, and Architecture Visualizations

In this tutorial, we build a fully offline Graphify pipeline that turns a multi-module Python application into a knowledge graph. We install Graphify, generate a connected…

THE DECODER Tech Media 2 days ago

Claude Tag embeds Anthropic's AI in Slack, already writes 65 percent of internal code, company says

Claude Tag lets teams bring Anthropic's AI into Slack by tagging @Claude in any channel and assigning it tasks. Internally, the tool already generates 65 percent…

THE DECODER Tech Media 2 days ago

Mistral's new OCR model beats competitors in 72 percent of blind test cases, company says

Mistral AI has released OCR 4, a new model that reads text from documents like PDFs, Word files, and PowerPoint presentations. The article Mistral's new OCR…

MarkTechPost Tech Media 2 days ago

Nous Research Adds /learn to Hermes Agent’s Skills System, Capturing Workflows as Slash Commands Without Hand-Writing SKILL.md

Nous Research has added /learn to the Hermes Agent Skills System. The command authors a standards-compliant SKILL.md from a local directory, a doc URL, a past…

r/MachineLearning Communities 2 days ago

Could it be that there aren’t really any medical LLM APIs available right now? [D]

As part of my ablations, I want to generate text with a medical-oriented LLM, and I was surprised to find no exposed APIs for this kind…

Hacker News (front page) Communities 2 days ago

You can't unit test for taste

Article URL: https://dev.karltryggvason.com/you-cant-unit-test-for-taste/ Comments URL: https://news.ycombinator.com/item?id=48657049 Points: 208 # Comments: 86

Hacker News (front page) Communities 2 days ago

We're making Bunny DNS free: because a faster internet won't build itself

Article URL: https://bunny.net/blog/were-making-bunny-dns-free/ Comments URL: https://news.ycombinator.com/item?id=48657030 Points: 277 # Comments: 89

X · @huggingface X / Twitter 2 days ago

RT Victor M: At Hugging Face we've been building our own agent that we use via Slack (Moon Bot). Honestly, building your own is quite simple and you'l…

RT Victor MAt Hugging Face we've been building our own agent that we use via Slack (Moon Bot). Honestly, building your own is quite simple and…

Latest

Gemma 4 26BA4B Surprisingly Usable at IQ3_S – Are small quants really this usable?

Risk-Averse AIs

A deadly fungus that can infect cats and people is spreading

I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

Haystack: Open-Source AI Framework for Production Ready Agents, RAG

How Baidu’s newly released Unlimited-OCR transcribes dozens of pages in one forward pass

RT Elise Stefanik: Gotham has fallen. The mass exodus out of the New York will continue apace. I call it the Blue Exodus. Last night’s sweep of Marxi…

Too many R packages: CRAN is inundated with submissions

Pangram CEO says language models give themselves away by making the same arguments

Qwen3.6 27B more dumb in vLLM compared to llama.cpp

b9780

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

RT Konstantin Kisin: The strange thing about progress is that it only counts if you ignore where we started. Every civilisation practised slavery. The…

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

The Kimi API is now live on AWS Marketplace. 🚀 If your team is already running on AWS, you can now access Kimi with consolidated billing. Plus, eli…

llama.cpp updates – granite-speech-4.1-2b, LFM2.5-ColBERT/Embedding-350M, Vulkan backend related changes & Misc items

Using Graphify and NetworkX to Map Python Codebase Structure with God Nodes, Communities, and Architecture Visualizations

Claude Tag embeds Anthropic's AI in Slack, already writes 65 percent of internal code, company says

Mistral's new OCR model beats competitors in 72 percent of blind test cases, company says

Nous Research Adds /learn to Hermes Agent’s Skills System, Capturing Workflows as Slash Commands Without Hand-Writing SKILL.md

Could it be that there aren’t really any medical LLM APIs available right now? [D]

You can't unit test for taste

We're making Bunny DNS free: because a faster internet won't build itself

RT Victor M: At Hugging Face we've been building our own agent that we use via Slack (Moon Bot). Honestly, building your own is quite simple and you'l…

Browse by category