top | item 46175375

(no title)

xfalcox | 2 months ago

We have vLLM for running text LLMs in production. What is the equivalent for this model?

discuss

order

mh-|2 months ago

I would say there's isn't an equivalent. Some people will probably tell you ComfyUI - you can expose workflows via API endpoints and parameterize them. This is how e.g. Krita AI Diffusion uses a ComfyUI backend.

For various reasons, I doubt there are any large scale SaaS-style providers operating this in production today.

salty_frog|2 months ago

I'm intrigued by the various reasons why you think there are not any large scale SAAS operating this in production?