2 days ago|discuss
user: monadicmonad
-1 karma | created 2 years ago
recent submissions
Experimenting with policy gradient methods in Jax
(github.com)
2 pts|7 months ago|discuss
11 months ago|discuss
1 year ago|discuss
Policy Evaluation in Grid World
(github.com)
1 pts|1 year ago|discuss
1 year ago|discuss
2 years ago|discuss