hpc/roofline/report/inputs/discussion.tex
2016-06-23 21:22:30 +02:00

10 lines
589 B
TeX

According to the definition used the arithmetic intensity is measured by operations per byte. This might not be adequat for haswell processors (and later). Due to the fused multiply-add\footnote{although called multiply-add there are 36 different slightly instructions} extension two floating point operations can be performed with a single instruction.
- worse results for 4 threads @ NUMA-STREAM not necessarily expected
- better results for triad possibly due to combined storage in FMA
- striding for arrays
%%% Local Variables:
%%% mode: latex
%%% TeX-master: "../report"
%%% End: