top | item 46812474 (no title) tarruda | 1 month ago I'm only interested in the local, single user use case. Plus I use a Mac studio for inference, so vLLM is not an option for me. discuss order hn newest mycall|1 month ago You can get concurrency gains [0] as local/single user (multi-agent) use case with vLLM with your Mac Studio.[0] https://youtu.be/Ze5XLooTt6g?t=658
mycall|1 month ago You can get concurrency gains [0] as local/single user (multi-agent) use case with vLLM with your Mac Studio.[0] https://youtu.be/Ze5XLooTt6g?t=658
mycall|1 month ago
[0] https://youtu.be/Ze5XLooTt6g?t=658