top | item 44333729

(no title)

treesciencebot | 8 months ago

the main question is going to be software stack. NVIDIA is already shipping NVFP4 kernels and perf is looking good. It took a really long time after MI300X's that the FP8 kernels were OK (not even good, compared to almost perfect FP8 support in NVIDIA side of things).

I will doubt that they will be able to reach %60-70 of the FLOPs in majority of the workloads (unless they hand craft and tune a specific GEMM kernel for their benchmark shape). But would be happy to be proven wrong, and go buy a bunch of them

discuss

order

pella|8 months ago

(related)

Tinygrad:

  "We've been negotiating a $2M contract to get AMD on MLPerf, but one of the sticking points has been confidentiality. Perhaps posting the deliverables on X will help legal to get in the spirit of open source!"

   "Contract is signed! No confidentiality, AMD has leadership that's capable of acting. Let's make this training run happen, we work in public on our Discord.
" https://x.com/__tinygrad__/status/1935364905949110532

LeonM|8 months ago

It still amazes me that George/Tinycorp somehow seems to get AMD on board every time, and being blissfully unaware that they are a very small player. See for example top comment here [0].

Don't get me wrong, I think it's impressive what he achieved so far, and I hope tiny can stay competitive in this market.

[0] https://news.ycombinator.com/item?id=36193625