top | item 36719758

(no title)

namjh | 2 years ago

Might be related: https://arxiv.org/abs/2211.16421 It's a paper about directly learning via JPEG encodings which works well with visusal transformers' patch mechanism.

discuss

No comments yet.