Did you investigate other search processes besides SGD? I'm thinking of those often termed "biologically plausible" (e.g. forward-forward, FA). Are their internal representations closer to the fractured or unified representations?
This is an interesting line of research but missing a key aspect: there's (almost) no references to the linear representation hypothesis. Much work on neural network interpretability lately has shown individual neurons are polysemantic, and therefore practically useless for explainability. My hypothesis is fitting linear probes (or a sparse autoencoder) would reveal linearly semantic attributes.
It is unfortunate because they briefly mention Neel Nanda's Othello experiments, but not the wide array of experiments like the NeurIPS Oral "Linear Representation Hypothesis in Language Models" or even golden gate Claude.
I am glad they evaluated this hypothesis using weight decay which is primarily thought of to induce a structured representation. My first thought was that the entire paper was useless if they didn't do this experiment.
I find it rather interesting that the structured representations go from sparse to full to sparse as a function of layer depth. I have noticed that applying weight decay penalty as an exponential function of layer depth gives improved results over using a global weight decay.
> Much of the excitement in modern AI is driven by the observation that scaling up existing systems leads to better performance.
Scaling up almost always leads to better performance. If you're only getting linear gains though then there is absolutely nothing to be excited about. You are in a dead end.
"I looked at the representations of a network and I don't like them".
Great! There's no mathematical definition of what a fractured representation is. It's whatever art preferences you have.
Our personal preferences aren't a good predictor of which network will work well. We wasted decades with classical AI and graphical models encoding our aesthetic into models. Just to find out that the results are totally worthless.
Can we stop please? I get it. I too like beautiful things. But we can't hold on to things that don't work. Entire fields like linguistics are dying because they refuse to abandon this nonsense.
I am interested in doing research like this. Is there any way I can be a part of it or a similar group? I have been fighting for funding from DoD for many years but to no avail so I largely have to do this research on my own time or solve my current grant's problems so that i can work on this. In my mind, this kind of research is the most interesting and important right now in the deep learning field. I am a hard worker and a high-throughput thinking... how can i get connected to otherwise with a similar mindset?
unknown|9 months ago
[deleted]
scarmig|9 months ago
goldemerald|9 months ago
It is unfortunate because they briefly mention Neel Nanda's Othello experiments, but not the wide array of experiments like the NeurIPS Oral "Linear Representation Hypothesis in Language Models" or even golden gate Claude.
akarshkumar0101|9 months ago
ipunchghosts|9 months ago
ipunchghosts|9 months ago
I find it rather interesting that the structured representations go from sparse to full to sparse as a function of layer depth. I have noticed that applying weight decay penalty as an exponential function of layer depth gives improved results over using a global weight decay.
timewizard|9 months ago
Scaling up almost always leads to better performance. If you're only getting linear gains though then there is absolutely nothing to be excited about. You are in a dead end.
unknown|9 months ago
[deleted]
cwmoore|9 months ago
light_hue_1|9 months ago
Great! There's no mathematical definition of what a fractured representation is. It's whatever art preferences you have.
Our personal preferences aren't a good predictor of which network will work well. We wasted decades with classical AI and graphical models encoding our aesthetic into models. Just to find out that the results are totally worthless.
Can we stop please? I get it. I too like beautiful things. But we can't hold on to things that don't work. Entire fields like linguistics are dying because they refuse to abandon this nonsense.
akarshkumar0101|9 months ago
pvg|9 months ago
ipunchghosts|9 months ago
unknown|9 months ago
[deleted]