This work is four years old (with development happening before that), so the Julia GPU capabilities probably weren't good enough at the time. If you wanted to do it today, that'd probably be the way to do it, but would need some benchmarking.
A lot of modern supercomputer use/have GPUs. But most GPUs had very bad fp64 compute capabilities, so they were not really used for anything requiring precision for a long time.
nevi-me|5 years ago
Interesting that they did this with Julia, with 83% of instructions being AVX-512 (if I'm reading it correctly).
Does anyone know if Julia's GPU capabilities could have been leveraged on say a cluster of NVIDIA A100/V100?
KenoFischer|5 years ago
maeln|5 years ago