top | item 39287529 (no title) shenberg | 2 years ago I suspect that weight initializations are geared towards inputs being normal random variables with mean 0 and variance 1. Deviating from that makes the learning process unhappy. discuss order hn newest No comments yet.
No comments yet.