spps11 | 1 year ago | on: Introduction to CUDA programming for Python developers
spps11's comments
spps11 | 1 year ago | on: Introduction to CUDA programming for Python developers
spps11 | 1 year ago | on: Introduction to CUDA programming for Python developers
spps11 | 1 year ago | on: Introduction to CUDA programming for Python developers
> with better AI models and tools like Cursor, we will move to a world where you can mold code ever more specific to your use case to make it more performant
what do you think the value of having the right abstraction will be in such a world?
spps11 | 1 year ago | on: How browsers really load web pages [video]
expecting current gen SWEs to talk about network layer protocols while answering this is kinda the same as expecting 1990s SWEs to include wire physics and dispersion statistics in their answer to this question.
Depth alone isn't always a good indicator. We have to move on from some of the low level stuff at some point and it is okay for engineers to know in detail about things that have been solved long back.
spps11 | 1 year ago | on: Show HN: Immersive Gaussian Splat experience of Sutro Tower, San Francisco
spps11 | 1 year ago | on: Introduction to CUDA programming for Python developers
I have a slightly tangential question: Do you have any insights into what exactly DeepSeek did by bypassing CUDA that made their run more efficient?
I always found it surprising that a core library like Cuda, developed over such a long time, still had room for improvement—especially to the extent that a seemingly new team of developers could bridge the gap on their own.
spps11 | 1 year ago | on: DeepSeek's multi-head latent attention and other KV cache tricks
Thanks for the post, it was an excellent read!
spps11 | 1 year ago | on: Cryptoscammers Impersonated and Hacked Us – Now What?