top | item 46191332

(no title)

heavymemory | 2 months ago

This still seems like gradient descent wrapped in new terminology. If all learning happens through weight updates, its just rearranging where the forgetting happens

discuss

order

No comments yet.