Yes, but the part that got me was the halving of the result followed by the clamping. SIMD generally makes sense, but for something like this to exist usually there's something very specific (like a certain video codec, for example) that greatly benefits from such a complex instruction.
ekelsen|1 year ago
See for example https://math.mit.edu/~stevenj/18.335/newton-sqrt.pdf
The initial guess is the approximate square root, but it needs to be halved as part of the calculation.
creato|1 year ago
epcoa|1 year ago