← Library · Definition
Model Compression
This refers to techniques used to reduce the size and complexity of machine learning models, making them more efficient for deployment on resource-constrained devices. Methods include pruning unimportant connections, quantization to fewer bits, and knowledge distillation where a smaller model learns from a larger one.
Learn one new AI thing every day.
Daily Deck sends you seven plain-English cards like this every morning. Free.
Start free