(no title)
mayukhdeb | 1 year ago
This is true. The features closer together now have much stronger semantic overlap. You can watch how the weights self-organize in a GPT here: https://toponets.github.io/webpage_assets/banner_video.mp4
We're already studying the effects of topographic structure on polysemanticity.
No comments yet.