Qwen3.6 27B more dumb in vLLM compared to llama.cpp
Hello, I recently bought a new RTX 5060Ti to pair with the RTX 5060Ti I already own, now I have 32GB of VRAM. Up until now…
Hello, I recently bought a new RTX 5060Ti to pair with the RTX 5060Ti I already own, now I have 32GB of VRAM. Up until now…
Crazy model! It actually uses the old Qwen2.5-Coder-3B stack and got really great performance with their post-training stack. Need to use it in the next days…
📢Qwen3.7-Max just hit #3 on ITbench-AA — a fresh benchmark testing how well models handle real-world enterprise IT tasks, agentic-style.🔧Agentic era, go with Qwen.🏃🏃Artificial Analysis: Artificial…
Fast, faster, Qwen. 🚀Thrilled to see Qwen3.5 reaching a record-breaking 580 tps for agentic workloads on the TokenSpeed engine! This milestone wouldn't be possible without our…
RT Nathan LambertGemma 4 adoption numbers outpacing Qwen 3.5/3.6 for the same sized models is a big shift in the international balance of influence via open…
RT Garry TanThinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon. Fast usable multimodal is also going…
ChatGPT’s new Images 2.0 model is surprisingly good at generating text , Alibaba Drops Qwen 3.6 Max Preview , SpaceX is working with Cursor
Merge pull request #575 from SHUMKASHUN/patch-1 Update bibtex
The Batch AI News and Insights: I’ve been hearing from people at all levels of seniority about a feeling of job insecurity.
Update bibtex Update bibtex of Qwen3-Coder-Next technical report.
Merge pull request #558 from Keytoyze/main Update tech report
Update tech report
Update README.md
Merge pull request #557 from QwenLM/cyente-patch-4 Update README.md
Update README.md
Merge pull request #556 from QwenLM/add_qwen3-coder-next-info qwen3-coder-next released
qwen3-coder-next released
Merge pull request #555 from wenting-zhao/main add qwen3 coder next tech report
Add files via upload
Merge pull request #1971 from 2003jiahang/patch-1 Remove unseeded shuffle for DDP consistency
Merge pull request #2000 from Zhaohai-Li/main Add VideoMME evaluation benchmark
Add VideoMME evaluation benchmark - Add VideoMME dataset evaluation pipeline with vLLM inference - Support short/medium/long video duration types - Include subtitle integration capability - Implement…
Add LM Studio md file and add link into navigation (#1800) * Add LM Studio md file and add link into navigation * Minor phrasing revisions…
Merge pull request #1928 from Keyvanhardani/main Add German Document OCR cookbook