Available on x86 and target feature 
avx512vbmi2 only.Expand description
Contiguously store the active 8-bit integers in a (those with their respective bit set in writemask k) to dst, and pass through the remaining elements from src.