top | item 41526596

(no title)

Also seems very impractical to embed this into a deployed product. How can you possibly hope to control and estimate costs? I guess this is strictly meant for R&D purposes.

discuss

sebzim4500|1 year ago

You can specify the max length of the response, which presumably includes the hidden tokens.

I don't see why this is qualitatively different from a cost perspective than using CoT prompting on existing models.

BoorishBears|1 year ago

For one, you don't get to see any output at all if you run out of tokens during thinking.

If you set a limit, once it's hit you just get a failed request with no introspection on where and why CoT went off the rails

dartos|1 year ago

You can’t verify that you’re paying what you should be if you can’t see the hidden tokens.

HarHarVeryFunny|1 year ago

They are currently trying to raise money (talk of new $150B valuation), so that may have something to do with it