top | item 42033070

(no title)

Tier3r | 1 year ago

That seems like a good idea. I am puzzled by what benefit the RL has in OP. It seems like a well defined constraint optimisation problem that could be done without RL, for example in the way you mentioned.

discuss

order

No comments yet.