top | item 42378545

(no title)

abdljasser2 | 1 year ago

Good question. In my experience combining generic descriptors is what works best. This is probably due to the text captions used during training mostly consist of generic instrument names, genre names and adjectives.

discuss

order

No comments yet.