top | item 40967978

(no title)

silentdanni | 1 year ago

Claude managed to write code successfully.

```

DISCOVER HOW TO square_root WITH x, iterations RUMOR HAS IT EXPERTS CLAIM guess TO BE x DIVIDED BY 2 DISCOVER HOW TO improve_guess WITH current_guess RUMOR HAS IT SHOCKING DEVELOPMENT (current_guess PLUS (x DIVIDED BY current_guess)) DIVIDED BY 2 END OF STORY

    DISCOVER HOW TO iterate WITH current_guess, remaining_iterations
    RUMOR HAS IT
        WHAT IF remaining_iterations SMALLER THAN 1
            SHOCKING DEVELOPMENT current_guess
        LIES! RUMOR HAS IT
            EXPERTS CLAIM new_guess TO BE improve_guess OF current_guess
            SHOCKING DEVELOPMENT
                iterate OF new_guess, remaining_iterations MINUS 1
        END OF STORY
    END OF STORY
    
    SHOCKING DEVELOPMENT iterate OF guess, iterations
END OF STORY

EXPERTS CLAIM number TO BE 16 EXPERTS CLAIM num_iterations TO BE 5

YOU WON'T WANT TO MISS 'The square root of' YOU WON'T WANT TO MISS number YOU WON'T WANT TO MISS 'is approximately' YOU WON'T WANT TO MISS square_root OF number, num_iterations

PLEASE LIKE AND SUBSCRIBE

```

discuss

order

CapeTheory|1 year ago

This is consistent with my own experience that Claude is just downright better than ChatGPT.

cowsaymoo|1 year ago

Same, I've been pretty impressed as well and typically give Claude a shot. Sometimes I even pass their results back and forth in an LLM collab so they generate more diverse perspectives. However, this paper from 4 days ago shows that Claude can fall apart quickly in out of distribution tasks. If you ask opposite day questions, GPT-4 is weirdly strong at it (figure 2).

https://arxiv.org/pdf/2307.02477

cowsaymoo|1 year ago

Ah bravo! What was the prompt and Claude model?