(no title)
shihab | 1 month ago
Something I recently learnt: the actual number of physical registers in modern x86 CPUs are significantly larger, even for 512-bit SIMD. Zen 5 CPUs actually have 384 vectors registers, 384*512b = 24KB!
shihab | 1 month ago
Something I recently learnt: the actual number of physical registers in modern x86 CPUs are significantly larger, even for 512-bit SIMD. Zen 5 CPUs actually have 384 vectors registers, 384*512b = 24KB!
cmovq|1 month ago
zeusk|1 month ago
justifa|1 month ago
[deleted]
dapperdrake|1 month ago
And the critical matrix tiling size is often SRAM, so L3 unified cache.