← Library · Core concept

Semi-Supervised Learning

Semi-supervised learning is a machine learning approach that leverages both labeled and unlabeled data during training. It is particularly useful when labeling data is expensive or time-consuming, allowing the model to learn patterns from a large amount of raw, untagged information while still benefiting from a smaller, precisely categorized dataset. Techniques often involve training an initial model on labeled data, then using that model to pseudo-label the unlabeled data, and finally retraining on the combined dataset.

In plain terms

Imagine teaching a child to recognize animals by showing them a few labeled pictures of cats and dogs, and then giving them many more unlabeled pictures, asking them to guess, and subtly correcting them.

Why it matters

It allows AI systems to make effective use of vast amounts of readily available unlabeled data, reducing the burden of manual annotation and enabling learning in data-scarce domains.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free