top | item 46211612

(no title)

cgorlla | 2 months ago

I checked with the team and it may have been some temporary rate-limiting issue. We've rectified the results, it seems to be an isolated case.

https://www.ctgt.ai/benchmarks

discuss

order

rancar2|2 months ago

Thanks for the thoroughness! I look forward to the next steps as you all apply this approach in other unique ways to have even better results.

SomaticPirate|2 months ago

Are these benchmarks correct that adding Anthropic's Constitutional AI system prompt lowered results across all the models?