top | item 39973700 (no title) geph2021 | 1 year ago As far as I can tell, its optional dependency is Open MP, not CUDA. Doesn't seem directly dependent on CUDA. discuss order hn newest dnlzro|1 year ago The plan is to eventually implement with CUDA:"Currently, I am working on [...] direct CUDA implementation, which will be significantly faster and probably come close to PyTorch." gpderetta|1 year ago Yes, a quick skim of the code only shows openmp dependency. The C/CUDA reference might have meant to be C/OMP .Although I wonder if it would work well with GCC PTX OMP offloading.
dnlzro|1 year ago The plan is to eventually implement with CUDA:"Currently, I am working on [...] direct CUDA implementation, which will be significantly faster and probably come close to PyTorch."
gpderetta|1 year ago Yes, a quick skim of the code only shows openmp dependency. The C/CUDA reference might have meant to be C/OMP .Although I wonder if it would work well with GCC PTX OMP offloading.
dnlzro|1 year ago
"Currently, I am working on [...] direct CUDA implementation, which will be significantly faster and probably come close to PyTorch."
gpderetta|1 year ago
Although I wonder if it would work well with GCC PTX OMP offloading.