top | item 42378545 (no title) abdljasser2 | 1 year ago Good question. In my experience combining generic descriptors is what works best. This is probably due to the text captions used during training mostly consist of generic instrument names, genre names and adjectives. discuss order hn newest No comments yet.
No comments yet.