(no title)
abossy | 1 year ago
Could you say more about this? What was the entirety of your training data, exactly, and how did the sketch of changes and git patch play into that?
abossy | 1 year ago
Could you say more about this? What was the entirety of your training data, exactly, and how did the sketch of changes and git patch play into that?
jahooma|1 year ago
This is all the data I need: the old file, the sketch of how Claude would update it, and the ground truth diff that should be produced. I compiled this into the ideal conversation where the assistant responds with the perfect patch, and that became the training set. I think I had on the order of ~300 of these conversations for the first run, and it worked pretty well.
I came up with more improvements too, like replacing all the variant placeholder comments like "// ... existing code ..." or "# ... (keep the rest of the function)" with one [[*REPLACE_WITH_EXISITNG_CODE*]] symbol, and that made it more accurate
abossy|1 year ago