Available on x86 and target feature 
fma only.Expand description
Multiplies packed single-precision (32-bit) floating-point elements in a
and b, and subtract packed elements in c from the negated intermediate
result.
fma only.Multiplies packed single-precision (32-bit) floating-point elements in a
and b, and subtract packed elements in c from the negated intermediate
result.