top | item 45612722 (no title) stevenpetryk | 4 months ago This is referred to as “online reinforcement learning” and is already something done by, for example Cursor for their tab prediction model.https://cursor.com/blog/tab-rl discuss order hn newest tinodb|4 months ago Not sure that’s the same. They just very frequently retrain and “deploy a new model”.
tinodb|4 months ago Not sure that’s the same. They just very frequently retrain and “deploy a new model”.
tinodb|4 months ago