top | item 41949679 (no title) SEGyges | 1 year ago it is not necessarily 16x if you, e.g., decrease model width by a factor of 4 or so also, but yeah naively the RAM and FLOPs scale up by n^2 discuss order hn newest No comments yet.
No comments yet.