I suspect that the OpenRouter result originates from a quantized hosting provider. The difference compared to the direct API call from Moonshot is striking, almost like night and day. It creates a peculiar user and developer experience since OpenRouter enforces quantization restrictions only at the API level, rather than at the account settings level.
simonw|3 months ago
irthomasthomas|3 months ago
-o provider '{ "only": ["moonshotai"] }'