top | item 44551095

(no title)

tompark | 7 months ago

More context at: "Moonshot's Kimi K2 uses a 1T-parameter MoE architecture with 32B active parameters and outperforms models like GPT-4.1 and DeepSeek-V3 on key benchmarks" <https://www.techmeme.com/250712/p11#a250712p11>

discuss

order

No comments yet.