top | item 44112689 (no title) Stem0037 | 9 months ago I wonder how much of this overhead (like the 250µs for activations/consistency on B200) could be further chipped away with even finer-grained control or different sync primitives. discuss order hn newest No comments yet.
No comments yet.