← Library · Core concept
Data Augmentation
Data augmentation is a technique used to expand the diversity of training data by creating modified versions of existing data points. For images, this might involve rotations, flips, or color adjustments. For text, it could involve synonyms or rephrasing. This process helps prevent overfitting and improves the model's ability to generalize.
In plain terms
Data augmentation is like preparing for a cooking competition by practicing with slightly different ingredient quantities or presentation styles for the same dish.
Why it matters
It significantly reduces the need for vast amounts of original labeled data and makes models more robust to variations in real-world input.
Learn one new AI thing every day.
Daily Deck sends you seven plain-English cards like this every morning. Free.
Start free