Show HN: Pure CUDA C Inference for Qwen3 0.6B in One File, No Dependencies (github.com) 1 pts|7 months ago|discuss