Recently, the people of the city of Kolkata witnessed the earthquake for a short time; individuals ran out of high-rise offices to the streets in panic. Thanks to the internet, we’ve all seen it.
BullshitBench tests whether AI models can detect nonsensical questions—or if they'll confidently answer them anyway. The ...