Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000)
For agentic work fast high context is king, OpenCode fills the window quickly and most models that feel snappy at 8k context turn into dial-up ADSL…
For agentic work fast high context is king, OpenCode fills the window quickly and most models that feel snappy at 8k context turn into dial-up ADSL…
This post has me intrigued ... but not to buy, I want to rent/lease one of these FRANKNVIDIA GPUs. I'll learn Chinese. I'll VPN in through…
MiniMax 2.7 REAP Q4 on 96GB VRAM and 192 GB DDR5 udimm ram on a b840 MSI board and 9900X cpu. 1250W PSU and all cards…
Hey everyone! OpenMythos benchmarks are finally here sorry it took about a week to post these. The delay was mainly because SWE-bench results weren't matching up…
Over here: https://huggingface.co/papers/2606.21906 Kudos to xyzblaz for asking. submitted by /u/Kodix [link] [comments]
Hey all, I have a piece of hardware laying around which is pretty fast from a traditional (non-GPU) server viewpoint. The hardware is the following: Dell…
I ran a small benchmark on LLMs for medical scribing. Reason: most discussion around AI scribe safety focuses on hallucinations. That matters, but in notes I…
Three dragons, four snakes, and the silicon nobody outside China can name. For the past few months, many peoples in my timeline has been arguing about…
+ turbo: https://huggingface.co/krea/Krea-2-Turbo submitted by /u/paf1138 [link] [comments]
A megathread that is overdue! Let's discuss and debate on what the best local agents available today are Prologue First a note on terminology: While most…