Available on x86 and target feature 
avx only.Expand description
Loads 256-bits (composed of 4 packed double-precision (64-bit)
floating-point elements) from memory into result.
mem_addr must be aligned on a 32-byte boundary or a
general-protection exception may be generated.