top | item 39287529

(no title)

shenberg | 2 years ago

I suspect that weight initializations are geared towards inputs being normal random variables with mean 0 and variance 1. Deviating from that makes the learning process unhappy.

discuss

order

No comments yet.