top | item 41735113

(no title)

robertsdionne | 1 year ago

These are real RNNs, they still depend upon the prior hidden state, it’s just that the gating does not. The basic RNN equation can be parallelized with parallel prefix scan algorithms.

discuss

order

No comments yet.