The margins on inference definetly aren’t negative. An easy way to check this is by looking at the costs of using cloud hosted open source models, which necessarily are served at a positive margin, and are much lower $/token than what you get from the labs.
unknown|13 days ago
[deleted]