(no title)
n7ck
|
2 months ago
1. The predictions get better with more data - and we don't seem to be anywhere near diminishing returns.
2. The thing we care about is generalization between people. For this, less data from more people is much better.
richardfeynman|2 months ago
For a given amount of data, is it better to have more people with less data per person or fewer people with more data per person?
clemvonstengel|2 months ago
For a given amount of data, whether you want more or less data per person really depends on what you're trying to do. The thing we want is for it to be good at zero-shot, that is, for it to decode well on people who have zero hours in the train set. So for that, we want less data per person. If instead we wanted to make it do as well as possible on one individual, then we'd want way more data from that one person. (So, e.g., when we make it into a product at first, we'll probably finetune on each user for a while)