Available on x86 and target feature 
avx512f,avx512vbmi2,avx512vl,avx,sse only.Expand description
Load contiguous active 16-bit integers from unaligned memory at mem_addr (those with their respective bit set in mask k), and store the results in dst using zeromask k (elements are zeroed out when the corresponding mask bit is not set).