top | item 45711759

(no title)

gugagore | 4 months ago

This is the first time I see "SGD" to mean "standard gradient descent" and not "stochastic gradient descent".

discuss

order

tavianator|4 months ago

Presumably that's just a mistake. The author calls it "stochastic gradient descent" correctly elsewhere in the article

elanapearl|4 months ago

haha oops yeah the other comment is correct- that was just a mistake

I originally wrote "vanilla" there but didn't want to repeat that word twice in a row so swapped it for "standard" without realizing it now looked like the SGD acronym

just fixed that to avoid confusion- thanks for pointing it out!