(no title)
degrews | 6 months ago
That eval has also become a lot less relevant (it's considered not very indicative of real-world performance), so it's unlikely Anthropic will prioritize optimizing for it in future models.
degrews | 6 months ago
That eval has also become a lot less relevant (it's considered not very indicative of real-world performance), so it's unlikely Anthropic will prioritize optimizing for it in future models.
kmacdough|6 months ago
Meanwhile Meta and Xai are behind the ball and largely marketing focused.
ttroyr|6 months ago