(no title)
python273 | 8 months ago
There's not much incentive to subsidize prices for OpenRouter providers for example, and the prices are much lower than the $6.37/M estimate from the article.
https://openrouter.ai/meta-llama/llama-3.3-70b-instruct
avg $0.37/M input tokens, $0.73/M output tokens (21 providers)
Llama is not even a good example, as the recent models are more optimized using Mixture Of Experts and KV cache compression.
No comments yet.