#ste||ar on 2022-04-27 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

00:06 Yorlik has quit [Ping timeout: 250 seconds]

00:42 K-ballo has quit [Ping timeout: 240 seconds]

00:42 K-ballo has joined #ste||ar

00:44 <gonidelis[m]> hkaiser: did you happen to check the docs html that i sent you the previous week?

00:52 <hkaiser> gonidelis[m]: yeah, let's go with it for now, it's still a RC

00:52 <hkaiser> I will do it for the final release

01:07 <gonidelis[m]> hkaiser: going over the work stealing scheduler

01:07 <gonidelis[m]> is this intended for 1.8?

01:08 <hkaiser> gonidelis[m]: no

01:08 <gonidelis[m]> ok

01:08 <hkaiser> I don't think this will be ready soon

01:10 <gonidelis[m]> ok and about the performance test report. where is the 5%-10% perf increase visible ?

01:10 <gonidelis[m]> in the report i mean

01:11 <hkaiser> gonidelis[m]: the perf test does not use the new scheduler

01:11 <gonidelis[m]> ahh so what is this about ?

01:11 <gonidelis[m]> https://github.com/STEllAR-GROUP/hpx/pull/5845#issuecomment-1098539356

01:11 <hkaiser> I wanted to set up a perf test CI n rostam first (there is a draft PR for this)

01:12 <hkaiser> not sure why this perf result was generated, it was an unrelated change, I believe

01:12 <gonidelis[m]> oh

01:12 <gonidelis[m]> so where do you see the perf results? on your local machine?

01:13 <hkaiser> local measurements

01:14 <gonidelis[m]> alright

01:15 <hkaiser> gonidelis[m]: btw, for task-bench, I want to try creating a numa-aware fork-join executor, let's see if that helps with ARM perf

01:15 <gonidelis[m]> arm perf?

01:16 <gonidelis[m]> you mean the results you got on fugaku?

01:16 <hkaiser> and ookami

01:16 <hkaiser> yes

01:16 <hkaiser> Nan has very bad perf results on ookami

01:16 <gonidelis[m]> ...

01:17 <gonidelis[m]> iis the whole prefetching fuzz i 've been hearing around about that numa awareness?

01:17 <hkaiser> no, that's different and unrelated

01:18 <gonidelis[m]> ok let's please go through it on thursday

01:18 <gonidelis[m]> starting getting our hands dirty

01:18 <hkaiser> ok

01:18 <gonidelis[m]> finally, was the latest apex tagged integrated into master?

01:18 <gonidelis[m]> tag*

01:18 <hkaiser> not yet, it's still a PR

01:19 <hkaiser> #5860

01:19 <gonidelis[m]> ok i will go through them rn

01:19 <hkaiser> we also need #5864 in the release

01:19 <gonidelis[m]> see what's missing

01:19 <hkaiser> and your release docs PR, that's it

01:20 <hkaiser> https://github.com/STEllAR-GROUP/hpx/milestone/39

01:20 <gonidelis[m]> hm

01:21 <gonidelis[m]> is #5864 related to https://en.cppreference.com/w/cpp/atomic/ATOMIC_FLAG_INIT

01:21 <gonidelis[m]> ?

01:21 <hkaiser> yes

01:21 <hkaiser> it has been deprecated in C++20, so #5864 makes sure it's not used anymore

01:22 <gonidelis[m]> the macro name is different though ;p

01:22 <hkaiser> is it?

01:22 <gonidelis[m]> INIT_FLAG

01:22 <gonidelis[m]> FLAG_INIT

01:23 <hkaiser> ahh, it's just a typo, thanks

01:23 <hkaiser> will fix in the PR

01:24 <gonidelis[m]> what's its purpose though

01:24 <gonidelis[m]> why initialize atomic_flag ?

01:24 <hkaiser> read the cppref page

01:24 <gonidelis[m]> why need to ^^

01:25 <gonidelis[m]> yes that's a question on cppref

01:25 <hkaiser> let's talk Thu

01:28 diehlpk has joined #ste||ar

01:29 K-ballo has quit [Quit: K-ballo]

02:42 diehlpk has quit [Quit: Leaving.]

02:42 diehlpk has joined #ste||ar

02:46 diehlpk has quit [Ping timeout: 240 seconds]

03:05 diehlpk has joined #ste||ar

03:06 diehlpk has left #ste||ar [#ste||ar]

03:16 hkaiser has quit [Quit: Bye!]

06:04 Yorlik has joined #ste||ar

11:14 hkaiser has joined #ste||ar

11:58 K-ballo has joined #ste||ar

14:24 K-ballo has quit [Read error: Connection reset by peer]

14:26 K-ballo has joined #ste||ar

14:36 K-ballo has quit [Read error: Connection reset by peer]

14:36 K-ballo has joined #ste||ar

16:19 <satacker[m]> In case of class inheriting `tag_fallback_noexcept` how do I make sure that the tag dispatching takes place when user does incorrect return type overload?

16:20 <hkaiser> satacker[m]: how do you overload on the return type

16:20 <hkaiser> I don't think that's possible

16:20 <satacker[m]> hkaiser: I meant `tag_invoke`

16:21 <satacker[m]> * Sorry, I meant

16:23 <hkaiser> C++ doesn't allow you to overload functions based on the return type

16:26 <satacker[m]> hkaiser: yes, i completely used a wrong terminology. Say tag_dispatching using `tag_fallback_noexcept`, when will the user's `tag_invoke` be called?

16:35 <satacker[m]> https://github.com/SAtacker/hpx/blob/scheduler_query_temporary/libs/core/execution/include/hpx/execution/queries/get_scheduler.hpp#L72-L82

16:35 <satacker[m]> is the impl

16:35 <satacker[m]> https://github.com/SAtacker/hpx/blob/scheduler_query_temporary/libs/core/execution/tests/unit/forward_progress_guarantee.cpp#L28-L50

16:35 <satacker[m]> are the failing tests

16:37 <hkaiser> tag_invoke will be tried first (i.e. used if it is valid), tag_fallback_invoke will be used (if it is valid) only if no tag_invoke's are available

16:39 <satacker[m]> hkaiser: Thanks, but how do I make sure the tag_invoke return type is valid, not possible?

16:39 <satacker[m]> (The tag_invoke which user implements)

16:47 <hkaiser> it wil fail compiling if not

16:49 <satacker[m]> Okay, thanks, probably I have done some other mistake, because it compiles, only static_assert fails due to other reasons.

16:54 <K-ballo> how can it compile if static_assert fails?

16:55 <satacker[m]> static_assert as in tests

16:57 <satacker[m]> and same is the case of wg21 implementation... (full message at https://libera.ems.host/_matrix/media/r0/download/libera.chat/44c120333b1982b184b1d8a9a88c6c4f443d4f5a)

16:58 <satacker[m]> This is the BAL's implementation

19:49 <gonidelis[m]> hkaiser: could you please check your email?

19:52 <hkaiser> gonidelis[m]: will do

19:52 <gonidelis[m]> Thanks!

20:22 <hkaiser> gonidelis[m]: see pm, pls

20:22 <Yorlik> o/

20:22 <hkaiser> \o

20:24 <Yorlik> How's HPX doing these days? I've been quite a bit out of the loop. We're still at 1.7.1 and working on connecting Unreal Engine 5 to the server.

20:39 <Yorlik> Seems everyone is busy - gotta run. See you another day :)

20:40 * Yorlik waves and fades

20:40 Yorlik has quit [Quit: Leaving]

20:44 <diehlpk_work> hkaiser, Ok, boost does not support cross compilation using the Fujitsu compiler

21:21 <hkaiser> diehlpk_work: nod, do we need the Fujitsu compiler?

21:28 <diehlpk_work> hkaiser, Do we have a bug in the hello_world_distributed?

21:29 <diehlpk_work> If I run it without mpiexec, it shows 0 loc with 48 cores

21:29 <hkaiser> diehlpk_work: if you run it without mpiexec it will create one locality

21:30 <diehlpk_work> If I run with mpiexec it shows 0 loc with 48 cores as well, but the cores are printed multiple times

21:30 <hkaiser> really?

21:31 <hkaiser> that doesn't sound right

21:31 <diehlpk_work> https://pastebin.com/jEWY2Ky7

21:32 <hkaiser> what's you mpiexec parameters?

21:32 <diehlpk_work> mpiexec paht to app

21:32 <hkaiser> no -n?

21:33 <diehlpk_work> I set the parmeter using the the scheduler

21:33 <hkaiser> what scheduler?

21:33 <hkaiser> srun?

21:33 <diehlpk_work> pjsub

21:34 <hkaiser> not sure if we support that

21:35 <hkaiser> diehlpk_work: is there some documentation on what environment variables this batch system creates?

21:37 <diehlpk_work> hkaiser, I need to check that

21:37 <hkaiser> we support these batch schedulers: https://github.com/STEllAR-GROUP/hpx/tree/master/libs/core/batch_environments/include/hpx/batch_environments

21:38 <hkaiser> alps, slurm, and pbs

21:38 <hkaiser> everything needs to be added

21:38 <hkaiser> everything *else*

21:39 <diehlpk_work> https://www.hucc.hokudai.ac.jp/en_supercomputer/detail/en_job_script/

21:39 <diehlpk_work> Fugaku uses pjsub

21:39 <diehlpk_work> To access the documentation, you need to install the root certificate

21:40 <hkaiser> I need the environment variables set by it

21:42 <diehlpk_work> https://pastebin.com/7qH0eCjQ

21:43 <hkaiser> this gives me a 404

21:44 <diehlpk_work> Ok, it seems they removed this post

21:45 <diehlpk_work> hkaiser, check your email

21:45 <hkaiser> ok, I'll have a look, thanks

21:45 <hkaiser> is this batch system derived from one the others?

21:47 <diehlpk_work> https://www.ibm.com/docs/en/was/9.0.5?topic=application-parallel-job-manager-pjm

21:47 <diehlpk_work> hkaiser, I never heard of this tool before

21:48 <hkaiser> me neither

21:49 <diehlpk_work> #PJM --mpi "shape=1"

21:49 <diehlpk_work> #PJM --mpi "max-proc-per-node=48"

21:49 <diehlpk_work> Shoudl give me 1 mpi rank with 48 cores

21:49 <hkaiser> nod

21:50 <diehlpk_work> They provide 1d, 2d, and 3d node allocation

21:50 <diehlpk_work> shape could be 1x2

21:50 <diehlpk_work> or 1x2x3

21:51 <diehlpk_work> https://github.com/NanmiaoWu/task_bench_hpx_only_paper/blob/master/april/april_20/figures/non-overd/metg_numa_ookami.pdf

21:51 <diehlpk_work> Das ist spannend

21:51 <hkaiser> I don't have access to that repo

21:52 <diehlpk_work> Wenn wir > 32 numa domain haben dann wird HPX schlechter

21:52 <diehlpk_work> Nan laedt dich ein

21:55 <diehlpk_work> hkaiser, I assume that hpx does not read the env from the new scheduler correct and therefore a single node run is slower as on Ookami using slurm

21:55 <hkaiser> yes, I'm surprised anything works at all

22:04 K-ballo has quit [Ping timeout: 250 seconds]

22:05 K-ballo has joined #ste||ar

23:13 diehlpk_work has quit [Remote host closed the connection]