Available on x86 and target feature 
avx only.Expand description
Stores 256-bits (composed of 4 packed double-precision (64-bit)
floating-point elements) from a into memory.
mem_addr does not need to be aligned on any particular boundary.