top | item 42136061 (no title) Ldorigo | 1 year ago Do we a know whether the current SOTA foundation models (Gemini, gpt4o, Claude, etc) are actually all GPT-based (as in, causal models)? discuss order hn newest falcor84|1 year ago I feel a bit bad bringing this up, but should Gemini actually be considered SOTA?They make impressive demos, but I can't recall any of their released models being at the top of any leaderboard.EDIT: Sorry, looking into it a bit more now, they still seem to be at the top in term of the context window, so they got that going for them. azinman2|1 year ago Leaderboards are misleading. Try diff models for YOUR task and you’ll see a wide variety of outputs compared to “official” rankings. load replies (1) dartos|1 year ago GPT-based isn’t really a thing outside of openai (it’s just the commercial name for their models)But I believe we’re confident that all major models are causal transformer models right now.No reason to believe otherwise. If one of them was doing something different, they’d let us know in order to stand out. Tostino|1 year ago No, they didn't get to co-opt that word. load replies (1)
falcor84|1 year ago I feel a bit bad bringing this up, but should Gemini actually be considered SOTA?They make impressive demos, but I can't recall any of their released models being at the top of any leaderboard.EDIT: Sorry, looking into it a bit more now, they still seem to be at the top in term of the context window, so they got that going for them. azinman2|1 year ago Leaderboards are misleading. Try diff models for YOUR task and you’ll see a wide variety of outputs compared to “official” rankings. load replies (1)
azinman2|1 year ago Leaderboards are misleading. Try diff models for YOUR task and you’ll see a wide variety of outputs compared to “official” rankings. load replies (1)
dartos|1 year ago GPT-based isn’t really a thing outside of openai (it’s just the commercial name for their models)But I believe we’re confident that all major models are causal transformer models right now.No reason to believe otherwise. If one of them was doing something different, they’d let us know in order to stand out. Tostino|1 year ago No, they didn't get to co-opt that word. load replies (1)
falcor84|1 year ago
They make impressive demos, but I can't recall any of their released models being at the top of any leaderboard.
EDIT: Sorry, looking into it a bit more now, they still seem to be at the top in term of the context window, so they got that going for them.
azinman2|1 year ago
dartos|1 year ago
But I believe we’re confident that all major models are causal transformer models right now.
No reason to believe otherwise. If one of them was doing something different, they’d let us know in order to stand out.
Tostino|1 year ago