Digits will be $3k and have 128GB of unified memory, so don't we already know that it wouldn't compare well this this rig? 128 won't be enough to fit the model in memory.
Check out the power draw metrics. Following the CPU+GPU power consumption, it seems like it averaged 22W for about a minute. Unless I'm missing something, the inference for this example consumed at most .0004 kWh.
That's almost nothing. If these models are capable/functional enough for most day-to-day uses, then useful LLM-based GenAI is already at the "too cheap to meter" stage.
This is amazing!! What kind of applications are you considering for this? A part from saving variable costs, fine tuning extensively and security… I’m curious to evaluate this in a financial perspective, as variable costs can be daunting, but not too much “yet”.
I’m hoping NVIDIA comes up with their new consumer computer soon!
Fascinating to read the thinking process of a flush vs a straight in poker. It's circular nonsense that is not at all grounded in reason — it's grounded in the factual memory of the rules of Poker, repeated over and over as it continues to doubt itself and double-check. What nonsense!
How many additional nuclear power plants will need to be built because even these incredibly technical achievements are, under the hood, morons? XD
mythz|1 year ago
Will be interesting to see the value/performance compared to next gen M4 Ultra's (or Extreme?) vs NVIDIA's new DIGITS [2] when they're released.
[1] https://x.com/carrigmat/status/1884244369907278106
[2] https://www.nvidia.com/en-us/project-digits/
CamperBob2|1 year ago
mrcwinn|1 year ago
As for Apple, we'll see.
rahimnathwani|1 year ago
6 to 8 tokens per second.
And less than a tenth of the cost of a GPU setup.
phonon|1 year ago
danans|1 year ago
That's almost nothing. If these models are capable/functional enough for most day-to-day uses, then useful LLM-based GenAI is already at the "too cheap to meter" stage.
danans|1 year ago
teruakohatu|1 year ago
I don't think they specified what they were using for networking, but it was probably Thunderbolt/USB4 networking which can reach 40Gbps.
shihab|1 year ago
doctoboggan|1 year ago
rashidae|1 year ago
I’m hoping NVIDIA comes up with their new consumer computer soon!
iFred|1 year ago
CharlesW|1 year ago
creativenolo|1 year ago
DrNosferatu|1 year ago
Still interesting though.
mrcwinn|1 year ago
How many additional nuclear power plants will need to be built because even these incredibly technical achievements are, under the hood, morons? XD
talldayo|1 year ago
[deleted]
epistasis|1 year ago
And cheaper than a lot hobbyists' bicycles!
mrbungie|1 year ago
But you're right, just let's keep waiting for the town-sized Data Centers + Power Plants kindly served by our big tech overlords.
PS: If you refer to it being a Mac, obviously you can build a more cost efficient but difficult to cool rig.
sitkack|1 year ago
cma|1 year ago