Communities news · AI Feed

r/LocalLLaMA Communities 11 hr ago

Qwen3.6 27B more dumb in vLLM compared to llama.cpp

Hello, I recently bought a new RTX 5060Ti to pair with the RTX 5060Ti I already own, now I have 32GB of VRAM. Up until now…

r/LocalLLaMA Communities 11 hr ago

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

As retrieval systems scale, high-quality reranking becomes increasingly important. However, most existing rerankers, whether encoder-based or decoder-based, jointly encode the query and passage, tightly coupling their…

r/LocalLLaMA Communities 12 hr ago

llama.cpp updates – granite-speech-4.1-2b, LFM2.5-ColBERT/Embedding-350M, Vulkan backend related changes & Misc items

Supported Models: granite-speech-4.1-2b-plus by 24818 LFM2.5-ColBERT-350M & LFM2.5-Embedding-350M by 24913 Vulkan: vulkan: link ggml-cpu when GGML_VULKAN_CHECK_RESULTS / RUN_TESTS are enabled #24444 vulkan: make mul_mm ALIGNED a…

r/MachineLearning Communities 12 hr ago

Could it be that there aren’t really any medical LLM APIs available right now? [D]

As part of my ablations, I want to generate text with a medical-oriented LLM, and I was surprised to find no exposed APIs for this kind…

Hacker News (front page) Communities 12 hr ago

We're making Bunny DNS free: because a faster internet won't build itself

Article URL: https://bunny.net/blog/were-making-bunny-dns-free/ Comments URL: https://news.ycombinator.com/item?id=48657030 Points: 277 # Comments: 89

r/LocalLLaMA Communities 13 hr ago

New EU model (Domyn) will be 400b.

The source is in Italian, but a well respected newspaper (like Financial Times) https://www.ilsole24ore.com/art/frontier-grand-challenge-domyn-guidera-progetto-dell-ai-sovrana-AIgNTNoD?refresh_ce=1 They are a startup that has already created a closed 260b model…

Hacker News (front page) Communities 14 hr ago

Ashby (YC W19) Is Hiring EMEA Engineers Who Can Design

Article URL: https://www.ashbyhq.com/careers?ashby_jid=87b96eef-edc1-4de4-adb6-d460126d02f8&utm_source=hn Comments URL: https://news.ycombinator.com/item?id=48656219 Points: 0 # Comments: 0

r/LocalLLaMA Communities 15 hr ago

PCIE 5.0 16x split into 2×8 with riser cable

Hey guys Thanks in advance for your help and knowledge! My setup is born out of the parts I had at hand. Wanting to maximise VRAM…

r/LocalLLaMA Communities 15 hr ago

Qwen-AgentWorld-397B-A17B

It looks like a new model, mentioned on https://huggingface.co/Qwen/Qwen-AgentWorld-35B-A3B and on https://qwen.ai/blog?id=qwen-agentworld submitted by /u/Shoddy_Bed3240 [link] [comments]

r/LocalLLaMA Communities 15 hr ago

Unlimited-OCR is now on ModelScope! A 3.3B multilingual OCR model for one-shot parsing across single images, multi-page documents, and PDFs. License: MIT

Full-document parsing instead of cropped-region OCR 32K output length for long OCR sequences Base and gundam image modes for different document layouts Transformers inference + SGLang…

r/LocalLLaMA Communities 15 hr ago

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

Qwen just released Qwen-AgentWorld-35B-A3B — a 35B-parameter MoE with only ~3B active parameters per token. The interesting part: this is not positioned as a standard chat/instruction…

r/LocalLLaMA Communities 16 hr ago

GitHub – QwenLM/Qwen-AgentWorld: Qwen-AgentWorld: Language World Models for General Agents

submitted by /u/dan945 [link] [comments]

LessWrong AI Communities 17 hr ago

Can weak AI watch strong AI?

The more capabilities new frontier models gain, the more sharply the question arises how will we know when the model is doing something it shouldn't? Today,…

LessWrong AI Communities 17 hr ago

Reasoning and learning about injected concepts in language models

This work was done as a part of SPAR, under the mentorship of Mirko Bronzi and Damiano Fornasiere. TL;DRWe test models' ability to recover information about…

LessWrong AI Communities 17 hr ago

Toy transformers may represent belief-state geometry optimally but not minimally

Methods note: The code used for the experiments and related open-source repo were built with Claude. The experimental design and writeup is my own, with minimal…

r/LocalLLaMA Communities 17 hr ago

Speaking of those chinese chips… "Chinese supercomputer displaces US machines as world’s fastest for first time since 2017"

submitted by /u/johnnyApplePRNG [link] [comments]

LessWrong AI Communities 17 hr ago

We Should Train Frontier AIs on a Synthetic World, Not Ours

Epistemic status: I think the core idea could actually be built. My real doubt is whether anyone with the compute will ever bother to try it.…

r/LocalLLaMA Communities 18 hr ago

Seems this community might have missed it: Bill that would mandate AI chip location tracking gains industry support | Half a dozen companies have come out in support of the Chip Security Act, which would require location-tracking mechanisms for America’s most advanced computing chips.

Web / reddit search have not found this posted in this sub, even though it is several days old news. So I do post. Related links:…

Hacker News (front page) Communities 18 hr ago

Raspberry Pi Pico W as USB Wi-Fi Adapter

Article URL: https://gitlab.com/baiyibai/pico-usb-wifi Comments URL: https://news.ycombinator.com/item?id=48654676 Points: 164 # Comments: 72

LessWrong AI Communities 18 hr ago

Can You Hide From a Natural Language Autoencoder?

TLDR: NLAs are a recent black box mech interp method for verbalizing model internals. I will be focusing on one of two components, the Activation Verbalizer…

LessWrong AI Communities 18 hr ago

Tree Transformers: A step towards generalizing the transformer architecture

After a billion architectures and a trillion variations, I finally found a transformer architecture that intrigued me. And this essay is step one towards the theory…

Hacker News (front page) Communities 19 hr ago

"Fix" MacBook Neo Cursor Lag: Record 1 Pixel of the Screen Every 10 Seconds

Article URL: https://gist.github.com/retroplasma/ec21767d0a8380c7ea9c2fbee1c7d6bf Comments URL: https://news.ycombinator.com/item?id=48654465 Points: 127 # Comments: 52

Hacker News (front page) Communities 19 hr ago

The Teensy Executable Revisited

Article URL: https://www.muppetlabs.com/~breadbox/software/tiny/revisit.html Comments URL: https://news.ycombinator.com/item?id=48654411 Points: 45 # Comments: 2

Hacker News (front page) Communities 19 hr ago

Qwen-AgentWorld: Language World Models for General Agents

Article URL: https://arxiv.org/abs/2606.24597 Comments URL: https://news.ycombinator.com/item?id=48654351 Points: 134 # Comments: 42

Communities 155 stories

Qwen3.6 27B more dumb in vLLM compared to llama.cpp

KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking

llama.cpp updates – granite-speech-4.1-2b, LFM2.5-ColBERT/Embedding-350M, Vulkan backend related changes & Misc items

Could it be that there aren’t really any medical LLM APIs available right now? [D]

We're making Bunny DNS free: because a faster internet won't build itself

New EU model (Domyn) will be 400b.

Ashby (YC W19) Is Hiring EMEA Engineers Who Can Design

PCIE 5.0 16x split into 2×8 with riser cable

Qwen-AgentWorld-397B-A17B

Unlimited-OCR is now on ModelScope! A 3.3B multilingual OCR model for one-shot parsing across single images, multi-page documents, and PDFs. License: MIT

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

GitHub – QwenLM/Qwen-AgentWorld: Qwen-AgentWorld: Language World Models for General Agents

Can weak AI watch strong AI?

Reasoning and learning about injected concepts in language models

Toy transformers may represent belief-state geometry optimally but not minimally

Speaking of those chinese chips… "Chinese supercomputer displaces US machines as world’s fastest for first time since 2017"

We Should Train Frontier AIs on a Synthetic World, Not Ours

Seems this community might have missed it: Bill that would mandate AI chip location tracking gains industry support | Half a dozen companies have come out in support of the Chip Security Act, which would require location-tracking mechanisms for America’s most advanced computing chips.

Raspberry Pi Pico W as USB Wi-Fi Adapter

Can You Hide From a Natural Language Autoencoder?

Tree Transformers: A step towards generalizing the transformer architecture

"Fix" MacBook Neo Cursor Lag: Record 1 Pixel of the Screen Every 10 Seconds

The Teensy Executable Revisited

Qwen-AgentWorld: Language World Models for General Agents