ispc/ctx.cpp at 1dedd881327dc7b5609e49533b7e1aae1ef15e81

aaron/ispc

Files

Matt Pharr 1dedd88132 Improve implementaton of 'are both masks equal' check for AVX.

Previously, we did a vector equal compare and then a movmsk, the
result of which we checked to see if it was on for all lanes.
Because masks are vectors of i32s, under AVX, the vector equal
compare required two 4-wide SSE compares and some shuffling.
Now, we do a movmsk of both masks first and then a scalar
equality comparison of those two values, which seems to generate
overall better code.

2011-09-15 06:25:02 -07:00

75 KiB

Raw Blame History

View Raw

75 KiB Raw Blame History

75 KiB

Raw Blame History