nravic's comments

nravic | 9 months ago | on: Compiling LLMs into a MegaKernel: A path to low-latency inference

This is super interesting! We do something similar I think by taking a checkpoint after model initialization. I'm curious what you think about our approach, here's some benchmarks: https://docs.cedana.ai/articles/performance-of-cedanas-gpu-i...

We do some on-the-fly optimizations as well (like compiling into CUDA graphs or fusing together calls) which ends up resulting (for some inference engines) faster token throughput too.

nravic | 3 years ago | on: Ask HN: Who is hiring? (July 2022)

Hey! I'd love to learn more about the position - the hook @ jpl email on your website is bouncing though. Do you mind sending me contact details so we can chat?

Thanks :)

nravic | 5 years ago | on: The death of corporate research labs

If you're looking at aerospace/defense startups, this is usually the case.

It's not unheard of for them to employ people who's primary role is grant writing to try and get (for example) SBIR funding

page 1