(no title)
nikonyrh | 6 months ago
Making something just CUDA-compatible is non-trivial, and since Nvidia decides its direction and new features then the alternatives would always be lagging behind. Currently there are also major hardware differences between Nvidia and AMD, which may make highly optimized CUDA code inefficient or even buggy.
[1] https://github.com/ROCm/composable_kernel?tab=readme-ov-file#composable-kernel
[2] https://github.com/triton-lang/triton?tab=readme-ov-file#triton
[3] https://github.com/ROCm/HIP?tab=readme-ov-file#what-is-this-repository-for
No comments yet.