top | item 47148095 Bullshit Benchmark Explorer 8 points| smusamashah | 5 days ago |petergpt.github.io 3 comments order hn newest fragebogen|4 days ago Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands. smusamashah|5 days ago https://github.com/petergpt/bullshit-benchmark drsalt|4 days ago this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.
fragebogen|4 days ago Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.
drsalt|4 days ago this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.
fragebogen|4 days ago
smusamashah|5 days ago
drsalt|4 days ago