top | item 46320348

(no title)

gizmodo59 | 2 months ago

Is that your website or something? You keep promoting it

discuss

order

Mkengin|2 months ago

No, I am not affiliated with the website, I just want to see more discussions based on uncontaminated benchmarks and feel that people rely too much on benchmarks that companies can conduct themselves. If that is the case, I don't feel I can trust them. For general LLM capabilities, for example, I would also tend to rely on dubesor [1] rather than artificial analysis or similar leaderboards.

[1] https://dubesor.de/benchtable