#ste||ar on 2021-02-09 — irc logs at irclog.cct.lsu.edu

2020-09-17 16:16 K-ballo changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

00:32 bita has joined #ste||ar

00:42 <gnikunj[m]> hkaiser here now

00:42 <gnikunj[m]> Idk how it slipped my notifications :/

01:22 hkaiser has joined #ste||ar

02:46 hkaiser has quit [Quit: bye]

05:31 bita has quit [Ping timeout: 240 seconds]

07:00 <gnikunj[m]> ms: I have HPX_WITH_CUDA and HPX_WITH_CUDA_COMPUTE turned ON but I don't see the header hpx/compute/cuda.hpp in the installed directory. Do I to turn any other cmake option on?

08:24 <ms[m]> gnikunj: no... I think I can reproduce that, and it's very suspicious, I'll have a look

08:24 <ms[m]> thanks pointing that out!

08:24 <gnikunj[m]> thanks!

09:00 jpinto[m] has quit [Quit: Idle for 30+ days]

09:09 <ms[m]> gnikunj: fixed (on master), sorry and thank you, that was a very good catch!

09:09 <gnikunj[m]> ms: that was quick. Thanks!

13:10 hkaiser has joined #ste||ar

13:25 <hkaiser> Thanks ms[m] and rori for preparing the release candidate!

13:50 K-ballo has quit [Read error: Connection reset by peer]

13:50 K-ballo has joined #ste||ar

13:53 <hkaiser> ms[m]: any idea what could have happened here: https://github.com/STEllAR-GROUP/hpx/pull/5153/checks?check_run_id=1859043009? I'm clueless...

14:11 <ms[m]> hkaiser: possibly the hpx_async_base here: https://github.com/STEllAR-GROUP/hpx/blob/7b99d78e41a37cec08030985db4d6c2986338feb/libs/full/components/CMakeLists.txt#L51

14:11 <ms[m]> (maybe other similar ones)

14:11 <ms[m]> basically the hpx_async_base module gets included in the hpx_parallelism and hpx_full libraries, hence the odr violation

14:12 <ms[m]> you can simply remove it and add hpx_parallelism in DEPENDENCIES

14:12 <hkaiser> ok, sounds reasonable

14:12 <hkaiser> I will try that - thanks

14:12 <hkaiser> I would never have thought of this possibility :/

14:12 <ms[m]> it's possibly something we could detect earlier, but I don't know how easy it would be

14:13 <hkaiser> na, that's what we have the sanitizers for - good we have added them

14:13 <ms[m]> indeed, I like that odr violation one!

14:14 <hkaiser> so on linux, hpx_core/hpx_parallism are not shared libraries?

14:14 <ms[m]> and I hope it's actually that, it's just the first thing that popped into my mind (there's been similar ones earlier)

14:14 <ms[m]> they are shared libraries

14:15 <hkaiser> I assumed, that things that are exported from a shared library (async is a symbol exported from one) wouldn't be duplicated in any case

14:16 <hkaiser> but that's probably my misconception caused by assuming Windows dll visibility rules apply everywhere

14:18 <ms[m]> the duplication comes from when we construct the shared libraries from the modules (static libraries) where we have to make sure that each module is only included in one of the shared libraries, but then I don't know how the linker deals with e.g. preloading an allocator library which also exports malloc and friends...

14:30 <hkaiser> ms[m]: ahh, so the shared libraries are linked from a list of static libraries, which is causing some of the object files to end up in two of them

14:30 <hkaiser> that's something to watch out for

14:31 <hkaiser> IOW, the DEPENENCIES clause in add_hpx_module should never contain any modules from another module category

14:31 <hkaiser> that's something we should try to detect indeed

14:31 <ms[m]> exactly, that's why there's a separate MODULE_DEPENDENCIES in addition to DEPENDENCIES when creating modules, the former should only include modules from within the shared library that the module itself belongs to

14:31 <ms[m]> right, exactly that :D

14:32 <hkaiser> ok, I'll take a stab at this

14:32 <hkaiser> thanks for the explanations

14:32 <ms[m]> yep

14:33 <ms[m]> ok, nice, thank you!

14:33 <srinivasyadav227> anyone working on Adapt parallel algorithms to C++20 #4822 ?

14:33 <hkaiser> srinivasyadav227: yes, gonidelis[m] is working on it, but there is a sufficient amount of work for more than one person

14:33 <ms[m]> freenode_srinivasyadav227[m]: gonidelis[m] is

14:33 <ms[m]> too slow...

14:36 <srinivasyadav227> hkaiser: ok, I will try to work on it! :-)

14:36 <hkaiser> srinivasyadav227: please coordinate with gonidelis[m]

14:38 <srinivasyadav227> hkaiser: ok

14:44 <hkaiser> srinivasyadav227: I am planning to add two more related tickets (also as gsoc projects) that are related: disentangle the segmented algorithm implementations from the parallel algorithms, and work on support for the par_unseq execution policy (vectorize the parallel algorithms)

14:48 <hkaiser> ms[m]: I have another nitpick for 5142, sorry

14:48 <ms[m]> hkaiser: :D

14:48 <hkaiser> ... or not :/

14:49 <ms[m]> no worries, I appreciate it, so tell me!

14:49 <hkaiser> https://github.com/STEllAR-GROUP/hpx/pull/5142/files#diff-9d331166da570e7469e476d468b684dfd42590305a752f337fd6f9aa3f3bff86R239

14:49 <srinivasyadav227> hkaiser: yea cool!

14:49 <hkaiser> could this take a string const& ?

14:49 <hkaiser> just avoiding another allocation, possibly

14:52 <ms[m]> it could, yes

14:53 <ms[m]> yeah, that makes sense

15:06 <hkaiser> srinivasyadav227: see #5156 and #5157, I'll create the gsoc projects later today

15:06 <hkaiser> srinivasyadav227: I have also marked #3364 as a possible gsoc project

15:21 <zao> Preparing in good time for GSoC, eh? :)

15:21 <zao> I guess that it's actually soon.

15:21 <srinivasyadav227> hkaiser: yup, saw them just now, so #3364 is like major release to C++20 and #5156 #5157 #4822 and others are parts of it, right?

15:23 <srinivasyadav227> zao: this season work and time line have been reduced right 😀 , so early prep helps for me :-)

15:24 <zao> I have no idea, I'm blissfully far from it :)

15:24 <srinivasyadav227> ok :)

15:25 <jaafar> Out of curiosity - I see the HPX build system has stuff for asan - has anyone tried running ubsan?

15:26 <jaafar> I ask because of https://github.com/boostorg/lockfree/issues/52

15:32 <hkaiser> srinivasyadav227: yes, google has reduced the period of performance to 7 week, iirc - however we will be able to extend this for another 4 paid weeks using our funds

15:35 <srinivasyadav227> hkaiser: that's really nice!

15:43 <hkaiser> srinivasyadav227: I have added some language explaining this here: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-(GSoC)-2021

15:59 shahrzad has joined #ste||ar

16:04 <gnikunj[m]> hkaiser: I'm back. Apologies for the short notice. Yes, I'll join today's meeting. I'm currently on my way to run the prototype on hpx+cuda to see if it works without failures there as well.

16:05 <hkaiser> cool, thanks

16:05 <hkaiser> gnikunj[m]: please look over the email I sent

16:05 <gnikunj[m]> yes, reading it

16:06 <srinivasyadav227> hkaiser: yea so far its clear! thanks :)

16:07 <srinivasyadav227> Just curious! can any one attend the meeting?.. wanted to be a spectator as its related to CUDA

16:08 <hkaiser> srinivasyadav227: that is an internal project meeting, sorry

16:09 <srinivasyadav227> ok, np :-)

16:10 <hkaiser> srinivasyadav227: if it was for me, sure - but I can't drop just somebody on our clients

16:12 <srinivasyadav227> no no, its fine! 😀

16:12 <gnikunj[m]> hkaiser: the email is well explained. If the gpu code runs fine, we already have more than we anticipated ;). The 2nd link you provided is broken btw.

16:13 <hkaiser> urgs

16:13 <hkaiser> could you respond to all and fix the link, pls?

16:13 <gnikunj[m]> sure

16:14 <hkaiser> gnikunj[m]: works for me :/

16:15 <gnikunj[m]> are you sure? You have a 80 character line break btw. So tors.hpp part is on the next line and is not part of the link.

16:15 <hkaiser> that's your email client breaking things

16:15 <gnikunj[m]> at least, I can't open the link on the email I receivedxc

16:15 <gnikunj[m]> could be cct to gmail conversion issue then.

16:15 <hkaiser> nod

16:16 <gnikunj[m]> should I write an email then? (or leave it as is?)

16:17 <hkaiser> just leave it, they will ask, if needed

16:17 <gnikunj[m]> sounds good.

16:22 <gnikunj[m]> hkaiser: ms how do I provide gpu architecture to cmake to build cuda backend for hpx?

16:22 <gnikunj[m]> default architecture does not compile for me

16:23 <gnikunj[m]> default is sm_20 btw which needs version 7 or 8 (current standards is 11.2)

16:24 <ms[m]> gnikunj: cmake -DCUDA_NVCC_FLAGS=-arch=sm_XX

16:24 <ms[m]> where is sm_20 the default?

16:24 <gnikunj[m]> sm_20 is what my cmake took as default so I thought that must be set as default

16:42 hkaiser has quit [Read error: Connection reset by peer]

16:44 hkaiser has joined #ste||ar

16:45 <srinivasyadav227> gnikunj[m]: sm_20 supports CUDA 7 and later

16:47 <zao> Do current CUDA toolchains still support deprecated and obsolete models?

16:47 <srinivasyadav227> not later*, I think it support till 7

16:47 <zao> (I haven't had a GPU wired up to my VMs for a long while now)

16:48 <gnikunj[m]> zao: looks like it's a libdevice not found issue

16:48 <srinivasyadav227> zao: with later versions of CUDA, they don't support compilation

16:48 <gnikunj[m]> wait I didn't see that it's nvcc flag. ms is there an equivalent flag for clang?

16:49 <gnikunj[m]> or do you insist on using nvcc for anything cuda?

16:50 <ms[m]> gnikunj: no, I prefer clang

16:50 <ms[m]> I think we have -DHPX_CUDA_CLANG_FLAGS=--cuda-gpu-arch=sm_XX

16:50 <gnikunj[m]> let me try it

16:53 <gnikunj[m]> HPX_WITH_CUDA_CLANG_FLAGS looks like it's working fine

17:37 <gnikunj[m]> ms: HPX_WITH_CUDA_CLANG_FLAGS isn't working either :/

17:37 <gnikunj[m]> now I'm getting weird C++ errors about things not being in the std namespace (possibly due to the omition of CXX version)

17:45 <diehlpk_work> ms[m], yet?

17:46 <diehlpk_work> Have you experience with hpx-kokkos on Summit?

17:59 <diehlpk_work> https://pastebin.com/N0UpeT5z

17:59 <diehlpk_work> I get following error because kokkos can not find some hpx headers. I asusme this is related to a wrong HPX version or?

18:00 <gnikunj[m]> hkaiser: ms zao could you please help me with the following make error? Is it something on my end or the build system? https://gist.github.com/NK-Nikunj/add34f1c8c5a944e6e24fcb8bd932fd0

18:01 <gnikunj[m]> diehlpk_work: what version of HPX are you using? Looks like an old version.

18:02 <diehlpk_work> gnikunj[m], one specific commit from last September

18:02 <gnikunj[m]> yeah, that's why. Use a newer version.

18:03 <gnikunj[m]> it's most likely due to the CPOs that we added lately (hkaiser will know more)

18:03 <diehlpk_work> Octo-Tiger can not use a newer version and it works for Gregor

18:05 <gnikunj[m]> not sure then. Btw it's not a CPO issue. parallel_execution_tag was previously in hpx::execution::parallel (which was later deprecated for hpx::execution). It could be due to that as well. Are you sure Gregor uses the same commit version?

18:08 <zao> gnikunj[m]: Oh, thought it was a problem with `make`, seems like it's just the CUDA compiler being upset as usual.

18:09 <gnikunj[m]> When will we have a cuda compiler that does what it's meant to do :/

18:09 <zao> Always chasing the C++ wavefront :)

18:36 <gonidelis[m]> hkaiser: gnikunj[m] is the meeting today or tomorrow? i am confused...

18:37 <gnikunj[m]> gonidelis[m]: tomorrow

18:37 <gonidelis[m]> cool

18:38 <hkaiser> gonidelis[m]: what meeting?

18:39 <gonidelis[m]> i read gnikunj[m] referencing a today's meeting but i only have an email for tomorrow

18:40 <hkaiser> gonidelis[m]: ahh, tomorrow is the group meeting, yes

18:43 <gonidelis[m]> hkaiser: ok. one more thing. just read your tickets. if we disentangle completely segmented algos from parallel ones, does that mean that we completely remove the underscore dispatching calls?

18:43 <hkaiser> yes

18:43 <hkaiser> replacing those with tag_invoke overloads

18:44 <gonidelis[m]> hmm... ok. so in such a case there are absolutely zero dependencies between segmented and parallel algos

18:44 <hkaiser> correct

18:44 <gonidelis[m]> and no harm is done if i change the result type of the parallel overload for example ;p

18:44 <hkaiser> except for that the segmented ones rely on the local ones, but not v.v.

18:45 <gonidelis[m]> ok ok

18:48 <gonidelis[m]> hkaiser: i saw you merged the two gsoc projects we were talking about

18:50 <gonidelis[m]> we mention "This project is different from the project Parallel Algorithms and Ranges " but we don't have a " Parallel Algorithms and Ranges" project any more ;p

18:50 <gonidelis[m]> is there any capability for me to make suggestions? or do i just edit the thing?

18:57 <hkaiser> just edit the thing

19:00 <gonidelis[m]> hkaiser: thanks ;)

19:00 <hkaiser> gonidelis[m]: thank *you*

20:12 <zao> gnikunj[m]: What CUDA version was that with? I'm not having any failures with HPX master, CUDA 11.1.1, GCC 10.2.0, Boost 1.74.0

20:13 <gnikunj[m]> I'm using clang

20:13 <gnikunj[m]> not gcc

20:13 <gnikunj[m]> cuda 11.2

20:13 <gnikunj[m]> boost 1.73

20:20 <zao> Your cmake output said 1.75 :)

20:27 <zao> What Clang too?

20:36 <diehlpk_work> Having a thorough and well thought out list of Project Ideas is the most important part of your application.

20:36 <diehlpk_work> So please help all to get or project ideas well organized

20:42 <diehlpk_work> shahrzad, hkaiser, ms[m] gnikunj[m] gonidelis[m]

21:13 <gonidelis[m]> diehlpk_work: it looks pretty good actually. The algorithm side looks pretty stacked to me. Idk, maybe being more verbal about the prerequisities and how users could lead their way through a minimum level for every project could help. But then the page might get a bit chaotic...

21:34 <diehlpk_work> gonidelis[m], Thanks

23:13 <zao> gnikunj[m]: /eb/software/Boost/1.75.0-GCC-10.2.0/include/boost/move/detail/type_traits.hpp(884): error: data member initializer is not allowed

23:13 <zao> Weeee.

23:14 <zao> Had to build CUDA 11.2.1 to get clang 11 support to repro it.

23:14 <zao> You're kind of on the bleeding edge of support as 11.1.1 doesn't do that compiler

23:19 <zao> CUDA 11.2.1 compiles fine with the same setup but the underlying GCC 10.2.0

23:20 * zao invents sleep