Technical Terms

Weight Quantization (Machine Learning)

The process of reducing the numerical precision of neural network parameters (weights) to decrease memory requirements and increase inference speed. Quantization has a theoretical floor set by information theory: below a certain bit-width, information loss becomes unrecoverable. TurboQuant is reported to have approached this floor.

— defined in 154th Edition, Mar 29, 2026

1editions defined

Mar 2026first defined

Mar 2026most recent

Technical Termscategory

Defined in (1)

154th EditionW14 · Mar 29, 2026