banderwidthdk | 11 months ago | on: The Nvidia DGX Spark Is a Tiny 128GB AI Mini PC Made for Scale-Out Clustering
From my experience LLM inference really, really likes memory bandwidth, which at 1.79 TB/s the 5090 has quite the lead on the APU's 273GB/s