top | item 47148095

Bullshit Benchmark Explorer

8 points| smusamashah | 5 days ago |petergpt.github.io

3 comments

order

fragebogen|4 days ago

Such a great project that could automate a lot vibes testing hopefully! A pity that the dataset only contains 55 questions. I'd like to see this number in the thousands.

drsalt|4 days ago

this isn't really bullshit, it's just nonsense. bullshit can only be understood in proper context. i swear i'm not bullshitting you.