r/LocalLLaMA
· Communities
How many of you do use Q1 or Q2 of Big models(100-250B)? How’s it?
Sharing popular(also recent) models for reference: 151-250B : DeepSeek-V4-Flash Step-3.X-Flash Command-a-plus-05-2026 Laguna-M.1 MiniMax-M2.X Qwen3-235B-A22B 100-150B : GLM-4.5-Air Qwen3.5-122B-A10B NVIDIA-Nemotron-3-Super-120B-A12B Mistral-Small-4-119B-2603 Devstral-2-123B-Instruct-2512 Mistral-Medium-3.5-128B Llama-4