There are many ways to compute the same matrix multiplication that apply the sum reduction in different orders, which can produce different answers when using floating point values. This is because floating point addition is not truly associative because of rounding.
spwa4|9 days ago
Also if the inference provider guarantees specific hardware this shouldn't happen.
nomel|9 days ago