top | item 45138088

(no title)

Actually in julia you can write kernels with a subset of the julia language:

With KernelAbstractions.jl you can actually target CUDA and ROCm:

For python (or rather python-like), there is also triton (and probably others):

discuss

davidatbu|5 months ago

Chris's claim (at least with regards to Triton) is that it avails 80% of the performance, and they're aiming for closer to 100%.