Skip to content
arXiv cs.CV · Papers

Neural Network Quantization by Learning Low-Loss Subspaces

arXiv:2606.25087v1 Announce Type: new Abstract: Neural network quantization aims to find a discrete representation of parameters that preserves the performance of a full-precision (FP) model as faithfully as possible. Enforcing discrete constraints perturbs parameters away from a well-optimized minimum, generally resul