This reverts commit d9c38b5c1f.
d9c38b5c1f
This path was poorly maintained and wasn't actually available on most targets.
Like SSE4-8 and SSE4-16, these use 8-bit and 16-bit values for mask elements, respectively, and thus should generate the best code when used for computation with datatypes of those sizes.