top | item 45707951 (no title) ramanvarma | 4 months ago do you have benchmarks on tasks with sparse rewards or partial observability? i feel like thats where most "train any agent" claims tend to break down discuss order hn newest PaulRobinson|4 months ago It doesn't replace core algorithms. It plumbs things together. It means you're not having to write the framework to connect things, your algos are still going to have the same problems as they had before.
PaulRobinson|4 months ago It doesn't replace core algorithms. It plumbs things together. It means you're not having to write the framework to connect things, your algos are still going to have the same problems as they had before.
PaulRobinson|4 months ago