Hello
I have been trying to compile code for AVX512 with -O2 and all the other optimizations. However stumbled upon a small problem.
When using the zmm registers before each arithmetic instruction such as vaddpd, vsubpd, vmulpd, fmaddpd move instructions pop out of the blue. Each time the data from the high zmm registers (16 - 31) is first moved to the low zmm registers and the the operation is being done...
Is this a hardware problem where the instructions only use the low registers or is it a compiler bug?
Best,
Thom