#ste||ar on 2021-09-12 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

00:04 <hkaiser> PatrickDiehl[m]: +1

02:17 diehlpk has joined #ste||ar

02:29 K-ballo has quit [Quit: K-ballo]

02:48 hkaiser has quit [Quit: Bye!]

03:00 diehlpk has quit [Quit: Leaving.]

13:03 K-ballo has joined #ste||ar

13:07 hkaiser has joined #ste||ar

13:53 <jedi18[m]> hkaiser: What chunk sizes do you recommend I test it with? I tried size/48 and size/96 size chunks but those perform the same/worse than default

13:53 <hkaiser> jedi18[m]: I think reducing the number of chunks might help

13:53 <hkaiser> i.e. one chunk per core

13:54 <hkaiser> especially for small sequence sizes

13:54 <jedi18[m]> Ok so since there are 48 cores, size/48 should do that right?

13:55 <jedi18[m]> https://github.com/Jedi18/scan_benchmarks/tree/main/varying_chunk_size default still seems to perform better

16:28 <hkaiser> jedi18[m]: ok

16:30 <hkaiser> jedi18[m]: can you try running with --hpx:threads=24 or so?

16:30 <hkaiser> btw, the default is cores*4 chunks

16:31 <hkaiser> also, could you add the L1/L2/L3 cache sizes as vertical lines on the graph? that might help understanding the drops in scaling

16:46 <jedi18[m]> Oh ok sure

17:44 Yorlik has joined #ste||ar

18:21 <gnikunj[m]> hkaiser yes, I'm glad my university recognized our work! I'll get down to Louisiana this December. So let me treat you to a beer ;)

18:40 <hkaiser> gnikunj[m]: +1

19:34 tufei has joined #ste||ar