top | item 40337958

(no title)

polygamous_bat | 1 year ago

This is an interesting question. Is there a “controversy-benchmark” perhaps, to measure this?

discuss

order

pennomi|1 year ago

In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.