top | item 43301918

(no title)

surferbayarea | 11 months ago

Here are 2 examples from the Black Spatula project where we were able to detect major errors: - https://github.com/The-Black-Spatula-Project/black-spatula-p... - https://github.com/The-Black-Spatula-Project/black-spatula-p...

Some things to note : this didn't even require a complex multi-agent pipeline. A single shot prompting was able to detect these errors.

discuss

order

fph|11 months ago

This black spatula case was pretty famous and was all over the internet. Is it possible that the AI is merely detecting something that was already in its training data?

surferbayarea|11 months ago

This is the original work that detected the problem.