← Library · Core concept

Data Normalization

Data normalization is the process of scaling numeric input values to a standard range, typically between 0 and 1 or -1 and 1. This prevents features with larger numerical ranges from disproportionately influencing a model's learning process. Common methods include Min-Max scaling and Z-score standardization, which transform data based on its minimum-maximum values or mean and standard deviation respectively, ensuring all features contribute more equally during training.

In plain terms

It's like making sure all ingredients in a recipe are measured in the same unit, so one ingredient doesn't overpower the others just because its quantity was given in a larger unit.

Why it matters

Proper data scaling can significantly improve the performance and convergence speed of many machine learning algorithms, especially those sensitive to feature magnitudes like neural networks and support vector machines.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free