top | item 41949679

(no title)

SEGyges | 1 year ago

it is not necessarily 16x if you, e.g., decrease model width by a factor of 4 or so also, but yeah naively the RAM and FLOPs scale up by n^2

discuss

order

No comments yet.