← Library · Core concept

Data Augmentation

Data augmentation is a technique used to expand the diversity of training data by creating modified versions of existing data points. For images, this might involve rotations, flips, or color adjustments. For text, it could involve synonyms or rephrasing. This process helps prevent overfitting and improves the model's ability to generalize.

In plain terms

Data augmentation is like preparing for a cooking competition by practicing with slightly different ingredient quantities or presentation styles for the same dish.

Why it matters

It significantly reduces the need for vast amounts of original labeled data and makes models more robust to variations in real-world input.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free