hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
eschnett has joined #ste||ar
Anushi1998 has joined #ste||ar
eschnett has quit [Quit: eschnett]
K-ballo has quit [Quit: K-ballo]
Anushi1998 has quit [Quit: Bye]
ste||ar-github has joined #ste||ar
<ste||ar-github> [hpx] khuck pushed 1 new commit to khuck-patch-2: https://github.com/STEllAR-GROUP/hpx/commit/6c362d7c1417fab101ec76a476fedcfe93de17f2
<ste||ar-github> hpx/khuck-patch-2 6c362d7 Kevin Huck: Merge branch 'master' into khuck-patch-2
ste||ar-github has left #ste||ar [#ste||ar]
hkaiser has quit [Quit: bye]
nanashi55 has quit [Ping timeout: 245 seconds]
nanashi55 has joined #ste||ar
ste||ar-github has joined #ste||ar
<ste||ar-github> [hpx] sithhell reopened pull request #3439: Use correct MPI CXX libraries for MPI parcelport (master...fix-ubuntu-bionic-mpi) https://github.com/STEllAR-GROUP/hpx/pull/3439
ste||ar-github has left #ste||ar [#ste||ar]
wash[m] has quit [Ping timeout: 260 seconds]
wash[m] has joined #ste||ar
jaafar has quit [Ping timeout: 250 seconds]
david_pfander has joined #ste||ar
daissgr has joined #ste||ar
<_diers_> Hello, I have a problem with NUMA on a dual-socket system (2x12 cores + HT). I'm testing a 2D stencil based on the HPX tutorial. Once the two NUMA domains are used, the performance breaks down completely. Maybe someone can give me support / help to investigate the problem?
<heller> hi _diers_
<heller> first, it would help if you can share the code
<heller> second, how you run it
<_diers_> hi @heller
<_diers_> I think i can reproduce it with the unchanged tuturial code stencil_parallel_1.cpp . I will compile it.
<heller> stencil_parallel_1.cpp is not NUMA aware ;)
<heller> crap, it should be ... my mistake
<_diers_> Yes, I can reproduce it. Where should I paste the results and the call?
<heller> gist.github.com maybe?
<heller> interesting...
<heller> may I ask what "lstopo" is giving you for your allocation?
<heller> the other thing is, that the problem size seems to be very small
<_diers_> likwid is also available
<_diers_> i added a comment
<heller> lstopo would have been more interesting ;)
<heller> let me check with my local system
<_diers_> yes, but I also get almost the same behavior with a 25 point stencil.
<_diers_> It is also possible that something is wrongly configured on the system.
<heller> no, I can reproduce the results
<zao> Ooh.
<heller> it gets better once you increase the number of elements in the y direction
<heller> so, the tutorial code only parallelizes in y direction, IIRC
<_diers_> is the output of lstopo-no-graphics enough?
<heller> so it looks like there just isn't enough parallelism for more than 2 NUMA domains
<heller> yes
nikunj has joined #ste||ar
<_diers_> added the output of lstopo
<_diers_> Should I switch Nx,Ny or increase it?
<_diers_> ...for more than 1 NUMA domain?
<heller> if you increase Ny, you should see the expected performance behavior
<_diers_> I added a comment with an increased Ny
<heller> _diers_: that's better ;D
K-ballo has joined #ste||ar
hkaiser has joined #ste||ar
<_diers_> ok, but that's not a solution for me ;-)
<jbjnr> _diers_: I have been working on improved numa support and I will take a look at the stencil example. Could you please file an issue on github that describes the problem and your findings.
<heller> _diers_: ok, I guess you need the fixed size there? May I ask you to run stencil_parallel_4.cpp and see if that exhibits the same behavior?
<_diers_> @jbjnr ok i will create an issue tomorrow
<_diers_> @heller ok, i test it
aserio has joined #ste||ar
<_diers_> @heller Behave the same with stencil_parallel_4.cpp. Added the results to the existing gist.
<heller> hmn, ok
eschnett has joined #ste||ar
hkaiser has quit [Quit: bye]
bibek has quit [Quit: Konversation terminated!]
bibek has joined #ste||ar
eschnett has quit [Quit: eschnett]
Anushi1998 has joined #ste||ar
Anushi1998 has quit [Quit: Bye]
Anushi1998 has joined #ste||ar
anushi has joined #ste||ar
Anushi1998 has quit [Ping timeout: 245 seconds]
anushi has quit [Remote host closed the connection]
anushi has joined #ste||ar
eschnett has joined #ste||ar
eschnett has quit [Ping timeout: 260 seconds]
aserio1 has joined #ste||ar
aserio has quit [Ping timeout: 260 seconds]
anushi_ has joined #ste||ar
aserio1 has quit [Ping timeout: 240 seconds]
anushi has quit [Ping timeout: 260 seconds]
aserio has joined #ste||ar
anushi has joined #ste||ar
anushi_ has quit [Ping timeout: 260 seconds]
anushi_ has joined #ste||ar
eschnett has joined #ste||ar
anushi has quit [Ping timeout: 246 seconds]
david_pfander has quit [Ping timeout: 246 seconds]
aserio has quit [Ping timeout: 260 seconds]
hkaiser has joined #ste||ar
parsa[[w]] has quit [Read error: Connection reset by peer]
parsa[w] has joined #ste||ar
daissgr has quit [Quit: WeeChat 1.9.1]
aserio has joined #ste||ar
aserio has quit [Remote host closed the connection]
<hkaiser> helyt?
<hkaiser> heller: yt?
eschnett has quit [Quit: eschnett]
jaafar has joined #ste||ar
eschnett has joined #ste||ar
aserio has joined #ste||ar
hkaiser has quit [Quit: bye]
eschnett has quit [Quit: eschnett]
hkaiser has joined #ste||ar
aserio has quit [Quit: aserio]
nikunj has quit [Quit: Bye]