top | item 40337958 (no title) polygamous_bat | 1 year ago This is an interesting question. Is there a “controversy-benchmark” perhaps, to measure this? discuss order hn newest pennomi|1 year ago In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.
pennomi|1 year ago In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.
pennomi|1 year ago