top | item 45788828

(no title)

raindeer2 | 4 months ago

The first bit is why it is called Stochastic gradient decent. You follow the gradient of a randomly chosen minibatch at each step. It basically makes you "vibrate" down along the gradient.

discuss

order

No comments yet.