There's a lot of people in this thread that don't seem to have caught up with the fact that AMD has worked very hard on their cuda translation layer and for the most part it just works now, you can build cuda projects on amd just fine on modern hardware/software.
numbers_guy|11 days ago
If you want portablitiy you need a machine learning compiler ala TorchInductor or TinyGrad or OpenXLA.
jillesvangurp|12 days ago