The details on how exactly they may have used it to train their model is vague. I believe transfer learning or knowledge distillation are valid techniques based on the inference from other models.
You store the output from ChatGPT, you don't run it again every time you do a training step. Generating millions of examples to add to your own training wont cost much at all relatively.
triyambakam|2 years ago
kevsim|2 years ago
ReptileMan|2 years ago
Jensson|2 years ago