top | item 46909046 (no title) yobbo | 24 days ago I think a latent embedding is almost equivalent to the article's hypernetwork, which I assume as y = (Wh + c)v + b, where h is a dataset-specific trainable vector. (The article uses multiple layers ...) discuss order hn newest No comments yet.
No comments yet.