top | item 45707951

(no title)

ramanvarma | 4 months ago

do you have benchmarks on tasks with sparse rewards or partial observability? i feel like thats where most "train any agent" claims tend to break down

discuss

PaulRobinson|4 months ago

It doesn't replace core algorithms. It plumbs things together. It means you're not having to write the framework to connect things, your algos are still going to have the same problems as they had before.