(no title)
solresol | 8 months ago
That said, fine tuning small models because you have to power through vast amounts of data where a larger model might be cost ineffective -- that's completely sensible, and not really mentioned in the article.
solresol | 8 months ago
That said, fine tuning small models because you have to power through vast amounts of data where a larger model might be cost ineffective -- that's completely sensible, and not really mentioned in the article.
lyu07282|8 months ago
Mostly referred to as model distillation, but I give the author the benefit of the doubt that they didn't mean that.
sota_pop|8 months ago
cbsmith|8 months ago
...which I thought was arguably the most popular use case for fine tuning these days.