The example sentences generated “only from neural data” at the top of this article seem surprisingly accurate to me, like, not exact matches but much better than what I would expect even from 10k hours:
“the room seemed colder” -> “ there was a breeze even a gentle gust”
Tangential to your point, if you collect 10,000 hours of brain scanning in exactly one damp basement, I wonder if perhaps the model would become very, very specialized for all of the flavors of "this room seems colder."
For the record, it was two basements -- we moved office in the middle -- and a bigger issue was actually overheating. But your point is basically right! The model is a lot better at certain kinds of ideas than others. Particularly concerning was the fact that the first cluster I noticed getting good was all the different variations of 'the headset is uncomfortable/heavy' etc. But this makes sense -- what participants talk about has a lot to do with what kinds of ideas the model can pick up, and this was more or less what we expected
CobrastanJorji|2 months ago
rio-popper|2 months ago
jcims|2 months ago
Very interesting!
ninapanickssery|2 months ago