Quantcast
Channel: Intel® Software - Intel® C++ Compiler
Viewing all articles
Browse latest Browse all 1175

OpenMP programming for multiple sockets server ccNUMA

$
0
0

Hi,

I am facing a problem while programming in OpenMP. I am running my code on a server which has 48 cores and two sockets (each socket has 24 cores). The scalability of my program is good upto 24 OpenMP threads scheduled as static. But after that running time increases when I scale over 24 cores and it continues unto 48. I assume this problem is due to bandwidth limitation due to ccNUMA architecture. My question is how can I solve this problem in comparatively better way? It would be great to know and also thanks in advance.

The running time of my code is O(n^2). I am just putting openmp for loop directives before executing loop like following:

#pragma omp parallel for schedule(static)


Viewing all articles
Browse latest Browse all 1175

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>