top | item 44470167

(no title)

holden_nelson | 8 months ago

They’re saying that they can use linters to check the output from a reinforcement learning model and reward it for correct output.

discuss

No comments yet.