Available on x86-64 and target feature 
sse2 only.Expand description
Shuffles 16-bit integers in the low 64 bits of a using the control in
IMM8.
Put the results in the low 64 bits of the returned vector, with the high 64
bits being copied from from a.