(no title)
adchurch | 1 year ago
Evaluated on a proprietary data set of manually labelled PRs
> Is this generally just sniffing surface quality and quantity of written code...
Somewhere in between the two :) a PR with a poorly and quickly implemented login will have a lower output score than a PR with a robust, well-designed and tested login, simply because the latter is more effort. But there isn't (yet!) a metric to quantify the relative quality. So our metric doesn't tell the full story, but it gives more info than would have previously been available.
captn3m0|1 year ago
Some of such changes have been my most impactful ones.