top | item 44112689

(no title)

Stem0037 | 9 months ago

I wonder how much of this overhead (like the 250µs for activations/consistency on B200) could be further chipped away with even finer-grained control or different sync primitives.

discuss

order

No comments yet.