top | item 45156146 (no title) rhaps0dy | 5 months ago The easiest thing to do is probably use A* to find the solution, then imitation learning on the NN to learn it. (The immediate feedback gets rid of the vanishing/exploding gradients problem). discuss order hn newest No comments yet.
No comments yet.