hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
<ste||ar-github>
hpx/khuck-patch-2 6c362d7 Kevin Huck: Merge branch 'master' into khuck-patch-2
ste||ar-github has left #ste||ar [#ste||ar]
hkaiser has quit [Quit: bye]
nanashi55 has quit [Ping timeout: 245 seconds]
nanashi55 has joined #ste||ar
ste||ar-github has joined #ste||ar
<ste||ar-github>
[hpx] sithhell reopened pull request #3439: Use correct MPI CXX libraries for MPI parcelport (master...fix-ubuntu-bionic-mpi) https://github.com/STEllAR-GROUP/hpx/pull/3439
ste||ar-github has left #ste||ar [#ste||ar]
wash[m] has quit [Ping timeout: 260 seconds]
wash[m] has joined #ste||ar
jaafar has quit [Ping timeout: 250 seconds]
david_pfander has joined #ste||ar
daissgr has joined #ste||ar
<_diers_>
Hello, I have a problem with NUMA on a dual-socket system (2x12 cores + HT). I'm testing a 2D stencil based on the HPX tutorial. Once the two NUMA domains are used, the performance breaks down completely. Maybe someone can give me support / help to investigate the problem?
<heller>
hi _diers_
<heller>
first, it would help if you can share the code
<heller>
second, how you run it
<_diers_>
hi @heller
<_diers_>
I think i can reproduce it with the unchanged tuturial code stencil_parallel_1.cpp . I will compile it.
<heller>
stencil_parallel_1.cpp is not NUMA aware ;)
<heller>
crap, it should be ... my mistake
<_diers_>
Yes, I can reproduce it. Where should I paste the results and the call?
<heller>
may I ask what "lstopo" is giving you for your allocation?
<heller>
the other thing is, that the problem size seems to be very small
<_diers_>
likwid is also available
<_diers_>
i added a comment
<heller>
lstopo would have been more interesting ;)
<heller>
let me check with my local system
<_diers_>
yes, but I also get almost the same behavior with a 25 point stencil.
<_diers_>
It is also possible that something is wrongly configured on the system.
<heller>
no, I can reproduce the results
<zao>
Ooh.
<heller>
it gets better once you increase the number of elements in the y direction
<heller>
so, the tutorial code only parallelizes in y direction, IIRC
<_diers_>
is the output of lstopo-no-graphics enough?
<heller>
so it looks like there just isn't enough parallelism for more than 2 NUMA domains
<heller>
yes
nikunj has joined #ste||ar
<_diers_>
added the output of lstopo
<_diers_>
Should I switch Nx,Ny or increase it?
<_diers_>
...for more than 1 NUMA domain?
<heller>
if you increase Ny, you should see the expected performance behavior
<_diers_>
I added a comment with an increased Ny
<heller>
_diers_: that's better ;D
K-ballo has joined #ste||ar
hkaiser has joined #ste||ar
<_diers_>
ok, but that's not a solution for me ;-)
<jbjnr>
_diers_: I have been working on improved numa support and I will take a look at the stencil example. Could you please file an issue on github that describes the problem and your findings.
<heller>
_diers_: ok, I guess you need the fixed size there? May I ask you to run stencil_parallel_4.cpp and see if that exhibits the same behavior?
<_diers_>
@jbjnr ok i will create an issue tomorrow
<_diers_>
@heller ok, i test it
aserio has joined #ste||ar
<_diers_>
@heller Behave the same with stencil_parallel_4.cpp. Added the results to the existing gist.
<heller>
hmn, ok
eschnett has joined #ste||ar
hkaiser has quit [Quit: bye]
bibek has quit [Quit: Konversation terminated!]
bibek has joined #ste||ar
eschnett has quit [Quit: eschnett]
Anushi1998 has joined #ste||ar
Anushi1998 has quit [Quit: Bye]
Anushi1998 has joined #ste||ar
anushi has joined #ste||ar
Anushi1998 has quit [Ping timeout: 245 seconds]
anushi has quit [Remote host closed the connection]
anushi has joined #ste||ar
eschnett has joined #ste||ar
eschnett has quit [Ping timeout: 260 seconds]
aserio1 has joined #ste||ar
aserio has quit [Ping timeout: 260 seconds]
anushi_ has joined #ste||ar
aserio1 has quit [Ping timeout: 240 seconds]
anushi has quit [Ping timeout: 260 seconds]
aserio has joined #ste||ar
anushi has joined #ste||ar
anushi_ has quit [Ping timeout: 260 seconds]
anushi_ has joined #ste||ar
eschnett has joined #ste||ar
anushi has quit [Ping timeout: 246 seconds]
david_pfander has quit [Ping timeout: 246 seconds]
aserio has quit [Ping timeout: 260 seconds]
hkaiser has joined #ste||ar
parsa[[w]] has quit [Read error: Connection reset by peer]
parsa[w] has joined #ste||ar
daissgr has quit [Quit: WeeChat 1.9.1]
aserio has joined #ste||ar
aserio has quit [Remote host closed the connection]