Function core::arch::x86::_mm256_hadd_ps   
1.27.0 · source · Available on (x86 or x86-64) and target feature 
avx and x86 only.Expand description
Horizontal addition of adjacent pairs in the two packed vectors
of 8 32-bit floating points a and b.
In the result, sums of elements from a are returned in locations of
indices 0, 1, 4, 5; while sums of elements from b are locations
2, 3, 6, 7.