top | item 47097880

(no title)

Zetaphor | 9 days ago

I keep reading comments saying it's useless from people who clearly haven't actually used it.

I'm building and using this machine daily for building and using applications with LLMs, TTS, STT, ASR, and image generation.

discuss

androiddrew|8 days ago

Yeah, there is a lot of advantage to having this machine because the CUDA stack is still king. My Two AMD GPUs are suffering when it comes to working with ROCm stack. I have forks of Ollama and VLLM that took many weekends to figure out.

Zetaphor|8 days ago

If you're on Strix Halo, check out Donato's prebuilt toolboxes for ROCm with RADV or Vulkan:

https://github.com/kyuz0/amd-strix-halo-toolboxes

It takes all the work out of it, you just start llama-server in the container context and you're off doing inference without having to figure out dependencies.

EnPissant|8 days ago

Which, GB10 or Strix Halo?

androiddrew|8 days ago

Pretty sure they mean the GB10

Zetaphor|8 days ago

Strix Halo