top | item 45612722

(no title)

This is referred to as “online reinforcement learning” and is already something done by, for example Cursor for their tab prediction model.

discuss

tinodb|4 months ago

Not sure that’s the same. They just very frequently retrain and “deploy a new model”.