top | item 43197044

(no title)

ericye16 | 1 year ago

This was the basis of a project I did for my deep reinforcement learning class!

https://ericye16.com/stanford-cs224r

We were able to make some improvements by tuning how the reward is distributed and also by first pretraining the agent on scales before fine-tuning them on the final pieces.

Thanks to Kevin Zakka for helping us get started with the RL environment!

discuss

order

plaguuuuuu|1 year ago

did you guys ever try having the agents play the song slower at first?

ericye16|1 year ago

We definitely tried extending the lookahead, but I don't think we tried having a curriculum-style thing where we gradually increased the speed of the song. Great idea though!