top | item 44523520

(no title)

jjordan | 7 months ago

It was though. Xai publishes their system prompts, and here's the commit that fixed it (a one line removal): https://github.com/xai-org/grok-prompts/commit/c5de4a14feb50...

discuss

i80and|7 months ago

If that one sentence in the system prompt is all it takes to steer a model into a complete white supremacy meltdown at the drop of a hat, I think that's a problem with the model!

minimaxir|7 months ago

The system prompt that Grok 4 uses added that line back. https://x.com/elder_plinius/status/1943171871400194231

qreerq|7 months ago

Weird, the post and comments load for me before switching to "Unable to load page."

Atotalnoob|7 months ago

Disable JavaScript or log into GitHub

spoaceman7777|7 months ago

It still hasn't been turned back on, and that repo is provided by xAI themselves, so you need to trust that they're being honest with the situation.

The timing in relation to the Grok 4 launch is highly suspect. It seems much more like a publicity stunt. (Any news is good news?)

But, besides that, if that prompt change unleashed the very extreme Hitler-tweeting and arguably worse horrors (it wasn't all "haha, I'm mechahitler"), it's a definite sign of some really bizarre fine tuning on the model itself.

barbazoo|7 months ago

What a silly assumption in that prompt:

> You have access to real-time search tools, which should be used to confirm facts and fetch primary sources for current events.

archagon|7 months ago

xAI claims to publish their system prompts.

I don’t recall where they published the bit of prompt that kept bringing up “white genocide” in South Africa at inopportune times.