r/LocalLLaMA
· Communities
What’s everyone using to estimate VRAM/RAM (weights + KV cache) before spinning up a local model?
Hi All, I typically check the model size to estimate if it will fit but I was thinking there should be some better way. There is option to log your hardware on huggingface and see estimates but then if you have multiple hardwares it's not that usable and not always showing. I do light research and only found these: hf-