(no title)
mayukhdeb | 1 year ago
> it’s not scientifically useful
Having structured weights in GPTs enables us to localize and control various concepts and study stuff like polysemanticity, superposition, etc. Other scientific directions include sparse inference (already proven to work) and better model editing. Turns out, topographic structure also helps these models better predict neural data, which is yet another direction we're exploring in computational neuroscience.
photon_lines|1 year ago
mayukhdeb|1 year ago
brrrrrm|1 year ago