hpc/roofline/report/inputs/discussion.tex

11 lines
589 B
TeX
Raw Normal View History

According to the definition used the arithmetic intensity is measured by operations per byte. This might not be adequat for haswell processors (and later). Due to the fused multiply-add\footnote{although called multiply-add there are 36 different slightly instructions} extension two floating point operations can be performed with a single instruction.
2016-06-23 00:40:48 +00:00
- worse results for 4 threads @ NUMA-STREAM not necessarily expected
2016-06-23 19:22:30 +00:00
- better results for triad possibly due to combined storage in FMA
- striding for arrays
2016-06-23 00:40:48 +00:00
%%% Local Variables:
%%% mode: latex
%%% TeX-master: "../report"
%%% End: