You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is kind of a retry at #898. One of the
problems there was that it would have added overhead and regressed
performance for typical inputs.
Unlike that PR, this doesn't aim for sub-linear scaling; the cost of
evaluating `fmod(x, y)` is still roughly proportional to `log2(|x/y|)`.
However, the constant factor is much better. Running the
`random`-benchmarks locally, I got walltime reductions of
fmodf16: -56.9%
fmodf: -85.0%
fmod: -95.4%
fmodf128: -98.7%
0 commit comments