(no title)
jra101 | 3 years ago
Instead of computing 8 independent values, compute one with 8x more iterations:
for (int i = 0; i < count * 8; i++) {
v0 += acc * v0;
}
That plus inlining the iteration count so the compiler can unroll the loop might help get closer to SOL.
clamchowder|3 years ago
WithinReason|3 years ago