top | item 44964118

(no title)

jafioti | 6 months ago

yep! currently we're emitting cuda / metal but once the search is better, i want to directly emit ptx / low-level asm on other hardwares.

discuss

order

Lerc|6 months ago

I don't suppose you have an eye towards verilog in the long term?

I'm curious as to the breadth of possibilities that could be searched. I would imagine something like this could invent flash attention if it cast its net wide enough, but that is a pretty broad net. [Edit: I scrolled back and saw flash attention was explicitly mentioned, cool stuff]

bojle|6 months ago

Equality saturation (something that luminal uses at its core) is a topic for hardware synthesis and verification too. Something like dynamic hardware generation (instead of kernel generation). For example, see this thesis [1] by Samuel Coward of Imperial.

[1] https://samuelcoward.co.uk/assets/pdf/Thesis_Imperial.pdf

jafioti|6 months ago

you suppose correctly ;)