Available on x86 and target feature 
avx,sse only.Expand description
Stores the high and low 128-bit halves (each composed of 4 packed
single-precision (32-bit) floating-point elements) from a into memory two
different 128-bit locations.
hiaddr and loaddr do not need to be aligned on any particular boundary.