(no title)
plopz
|
11 days ago
I would imagine a trajectory similar to AlphaGo, it starts out trying to replicate humans and then at a certain point pivots to entirely self-play. I think the main hurdle with llms, is that there isn't a strong reward target to go after. It seems like the current target is to simply replicate humans, but to go beyond that they will need a different target.
c22|11 days ago
I think real intelligences are working with myriad such targets, but an adversarial environment seems essential for developing intelligence along this axis.
I do think if there's a path to AGI from current efforts it will be through game play, but that could just be the impressionable kid who watched Wargames in the 80s speaking through me.