(no title)
profchemai | 9 months ago
The same argument could be made for the transformer paper: hijacking a nostalgia pop-culture name to name a deep learning bi-linear operator. Many papers are guilty of this, some just become very influencial.
profchemai | 9 months ago
The same argument could be made for the transformer paper: hijacking a nostalgia pop-culture name to name a deep learning bi-linear operator. Many papers are guilty of this, some just become very influencial.
No comments yet.