top | item 47110007 (no title) Sammi | 7 days ago 1. Don't implement too much at at time2. Have the agent review if it followed the plan and relevant skills accurately. discuss order hn newest irthomasthomas|7 days ago the first link was from a simple request with fewer than 1000 tokens total in the context window, just a short shell script.here is another one which had about 200 tokens and opus decided to change the model name i requested.https://x.com/xundecidability/status/2005647216741105962?s=2...opus is bad at instruction following now.
irthomasthomas|7 days ago the first link was from a simple request with fewer than 1000 tokens total in the context window, just a short shell script.here is another one which had about 200 tokens and opus decided to change the model name i requested.https://x.com/xundecidability/status/2005647216741105962?s=2...opus is bad at instruction following now.
irthomasthomas|7 days ago
here is another one which had about 200 tokens and opus decided to change the model name i requested.
https://x.com/xundecidability/status/2005647216741105962?s=2...
opus is bad at instruction following now.