#ste||ar on 2022-03-26 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

02:22 K-ballo has quit [Quit: K-ballo]

02:31 hkaiser has quit [Quit: Bye!]

03:20 chaitanya710 has joined #ste||ar

08:55 chaitanya710 has quit [Ping timeout: 256 seconds]

09:39 chaitanya710 has joined #ste||ar

10:22 Ahmedehabb has joined #ste||ar

10:23 <Ahmedehabb> hello , i have made a tentative proposal for the Implement a Faster Associative Container for GIDs project.

10:23 <Ahmedehabb> should i send it here or privately ?

10:28 Ahmedehabb has quit [Quit: Client closed]

10:29 Ahmedehabb has joined #ste||ar

12:31 K-ballo has joined #ste||ar

13:44 Ahmedehabb has quit [Quit: Client closed]

13:47 hkaiser has joined #ste||ar

14:17 <srinivasyadav227> hkaiser: is `--hpx:numa-sensitive`(https://hpx-docs.stellar-group.org/latest/html/manual/hpx_runtime_and_resources.html#hpx-runtime-and-resources) the one you told about work stealing related to NUMA in the meeting ?

14:17 <hkaiser> srinivasyadav227: not sure anymore ;-)

14:18 <hkaiser> this options is on by default anyways, so will not have any measurable effect

14:18 <srinivasyadav227> i tried using this, for the mandelbrot, but it didnt work

14:18 <srinivasyadav227> yeah, there was no effect

14:18 <hkaiser> nod

14:18 <hkaiser> I created a PR that might improve things, could you try that?

14:18 <srinivasyadav227> i saw your branch, https://github.com/STEllAR-GROUP/hpx/tree/numa_stealing

14:19 <hkaiser> #5825

14:19 <hkaiser> yes

14:19 <hkaiser> it might have no effect, please try it out

14:19 <srinivasyadav227> yea, i was about to link that none , i will try with this

14:19 <hkaiser> thanks

14:20 <srinivasyadav227> i compared hpx::compute::vector with std::vector as well but they dont see to have any improvements,

14:22 <hkaiser> ok, I didn't expect for compute::vector to make a big difference, it's the allocator/executor that should change things

14:24 <srinivasyadav227> yes, i mean, hpx with allocator's and executor's VS plain std::vector without allocator and executors

14:36 <hkaiser> ok

14:44 * srinivasyadav227 uploaded an image: (58KiB) < https://libera.ems.host/_matrix/media/r0/download/matrix.org/cNJeeLdhthbvmZqSpKLzwnrd/Mandelbrot%20set%20-%20strong%20scaling%20graph%20(A64FX).png >

14:46 <srinivasyadav227> hkaiser: this is the scaling for mandelbrot on ookami (48 cores), only parallelization

14:46 <srinivasyadav227> it has 4 numa domains, each containing 12 cores

14:48 <srinivasyadav227> till 1 numa domain (12 cores), 75% is being parallelized; till 2 numa domains (24 cores) its around 60-65, after that it drops to 50% parallelization

15:03 <hkaiser> yah, cross numa-traffic

15:03 <hkaiser> so something is off with the allocator/executor - those should take care of data locality

15:05 <hkaiser> srinivasyadav227: gtg now, sorry

15:06 <srinivasyadav227> np :), i will try to profile the application till then

15:08 hkaiser has quit [Quit: Bye!]

15:10 chaitanya710 has quit [Quit: Client closed]

15:52 diehlpk_work_ has quit [Ping timeout: 240 seconds]

17:34 Ahmedehabb has joined #ste||ar

17:47 Ahmedehabb has quit [Quit: Client closed]

17:47 Ahmedehabb has joined #ste||ar

19:13 diehlpk_work_ has joined #ste||ar

20:29 K-ballo has quit [Quit: K-ballo]

20:30 K-ballo has joined #ste||ar

23:02 Ahmedehabb has quit [Quit: Client closed]