top | item 39978647

(no title)

wsdookadr | 1 year ago

What makes you say it's not representative?

discuss

order

skywhopper|1 year ago

SWE-bench Lite is a subset of extremely simple issues from a cherry-picked subset (SWE-bench) of a handful of large (presumably well-run) Python-only projects.

Here are some rules they used to trim down the SWE-bench Lite problems:

* We remove instances with images, external hyperlinks, references to specific commit shas and references to other pull requests or issues.

* We remove instances that have fewer than 40 words in the problem statement.

* We remove instances that edit more than 1 file.

* We remove instances where the gold patch has more than 3 edit hunks (see patch).

See https://www.swebench.com/lite.html

kevindamm|1 year ago

That's... rather limiting.

arp242|1 year ago

Look at the data. Does that seem like the average bug report to you?

falcor84|1 year ago

It would help if you were to provide a specific example or two