Fact-checked May 20, 2026
Also called: SGD
SGD is a popular optimization algorithm used to train machine learning models, especially neural networks. It helps models learn by making small, iterative adjustments to their internal settings based on training data.
Stochastic Gradient Descent, often simply called SGD, is a core algorithm in the world of machine learning. Imagine you're trying to find the lowest point in a bumpy landscape, but you can only see a tiny area around your feet. SGD works similarly: it takes one, or a small batch of data points at a time, calculates how far off the model's prediction is, and then nudges the model's parameters (the internal settings) in the direction that would reduce that error. It does this over and over again, gradually moving the model towards better performance.
Because SGD only looks at a small portion of the data at each step, it's very efficient, especially for huge datasets. This 'stochastic' (random) nature also helps it avoid getting stuck in local minimums, which are points that look optimal but aren't the absolute best solution. It's a fundamental building block for training complex models like those used in image recognition, natural language processing, and much more, allowing them to learn from vast amounts of information.
Daily Deck explains terms like Stochastic Gradient Descent as part of a free seven-card daily brief. No jargon. No fluff.
Start free