top | item 41703812 (no title) johnsutor | 1 year ago Seems like this is already being answered:https://arxiv.org/abs/2407.10930 https://arxiv.org/abs/2006.04439 discuss order hn newest valine|1 year ago Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.
valine|1 year ago Not really the first paper is just fine-tuning on synthetic data. The second paper doesn’t optimize the model weights.
valine|1 year ago