top | item 46217728

CUDA to cuTile transpiler for Nvidia's CUDA 13.1

1 points| rncode | 2 months ago |github.com

1 comment

order

rncode|2 months ago

We just dropped a new open source project: CUDA to cuTile transpiler for NVIDIA's CUDA 13.1

We built a transpiler that converts your CUDA kernels to cuTile automatically. It figures out what your kernel does (flash attention, matrix multiplication, RoPE) and writes the cuTile version

Zero AI involved! It's pure pattern matching and code analysis

Currently supports 18 kernel patterns: - Core: GEMM, Reduction, Scan, Stencil, Elementwise, FFT - ML/DL: Convolution (1D/2D/3D), Pooling, Normalization - LLM: Flash Attention, RoPE, KV Cache, Quantization (INT8/FP8) - Specialized: Sparse matrices, Histogram, Sorting

Contributions we need:More kernel pattern templates