top | item 42692120

(no title)

ertdfgcvb | 1 year ago

Isn't that the point of testing (to not maximize reward but rather wait and collect data)? It sounds like maximizing reward during the experiment period can bias the results

discuss

order

LPisGood|1 year ago

The great thing is that you can do both.