arXiv cs.CV June 25, 2026 · Papers

Neural Network Quantization by Learning Low-Loss Subspaces

arXiv:2606.25087v1 Announce Type: new Abstract: Neural network quantization aims to find a discrete representation of parameters that preserves the performance of a full-precision (FP) model as faithfully as possible. Enforcing discrete constraints perturbs parameters away from a well-optimized minimum, generally resul

Read original