top | item 45820980

(no title)

tifa2up | 3 months ago

For large context (up to 100K tokens in some cases). We found that GPT-5: a) has worse instruction following; doesn't follow the system prompt b) produces very long answers which resulted in a bad ux c) has 125K context window so extreme cases resulted in an error

discuss

order

Shank|3 months ago

ChatGPT when using 5 or 5-Thinking doesn’t even follow my “custom instructions” on the web version. It’s a serious downgrade compared to the prior generation of models.

cj|3 months ago

It does “follow” custom instructions. But more as a suggestion rather than a requirement (compared to other models)

Xmd5a|3 months ago

Ah, 100k/125K this is what poses problems I believe. GPT-5 scores should go up should you process contexts that are 10 times shorter.