(no title)
xendipity | 1 year ago
> Priced at 15 cents per million input tokens and 60 cents per million output tokens, the GPT-4o mini is more than 60% cheaper than GPT-3.5 Turbo, OpenAI said. It currently outperforms the GPT-4 model on chat preferences and scored 82% on Massive Multitask Language Understanding (MMLU), OpenAI said.
...
> The GPT-4o mini model's score compared with 77.9% for Google's Gemini Flash and 73.8% for Anthropic's Claude Haiku, according to OpenAI.
For some more context: We don't know the size of 4o-mini but Mistral's just released NeMo 12B scores 68% on the MMLU. [2]
[1]: https://www.reuters.com/technology/artificial-intelligence/o...
pzo|1 year ago
Gemma 2 27B scored: 75.2 in MMLU
LLama 3 70B scored: 79.5 in MMLU
Haiku scored: 75.2 in MMLU
GPT 3.5 scored: 70.0 in MMLU
Based on pricing I see in openrouter.ai across different providers this seems like the cheapest model for this kind of performance.
ref: [0] https://www-cdn.anthropic.com/de8ba9b01c9ab7cbabf5c33b80b7bb...
[1] https://blog.google/technology/developers/google-gemma-2/