top | item 43197044

(no title)

ericye16 | 1 year ago

This was the basis of a project I did for my deep reinforcement learning class!

We were able to make some improvements by tuning how the reward is distributed and also by first pretraining the agent on scales before fine-tuning them on the final pieces.

Thanks to Kevin Zakka for helping us get started with the RL environment!

discuss

plaguuuuuu|1 year ago

did you guys ever try having the agents play the song slower at first?

ericye16|1 year ago

We definitely tried extending the lookahead, but I don't think we tried having a curriculum-style thing where we gradually increased the speed of the song. Great idea though!