Available on x86-64 and target feature 
fma only.Expand description
Multiplies the lower double-precision (64-bit) floating-point elements in
a and b, and add the negated intermediate result to the lower element
in c. Store the result in the lower element of the returned value, and
copy the upper element from a to the upper elements of the result.