#ste||ar on 2020-05-14 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:43 bita_ has quit [Ping timeout: 240 seconds]

00:48 nan11 has quit [Remote host closed the connection]

00:51 parsa has quit [Ping timeout: 246 seconds]

03:07 hkaiser has quit [Quit: bye]

03:11 bita_ has joined #ste||ar

04:22 sayef_ has quit [Ping timeout: 272 seconds]

05:23 bita_ has quit [Ping timeout: 260 seconds]

05:36 bita_ has joined #ste||ar

05:54 bita_ has quit [Ping timeout: 264 seconds]

06:30 <heller1> Yorlik: that's something that's physically impossible

06:48 aalekhnigam has joined #ste||ar

06:51 aalekhnigam has quit [Client Quit]

07:01 parsa has joined #ste||ar

07:04 parsa has quit [Client Quit]

07:13 <ms[m]> weilewei: jbjnr ok, thanks for checking

07:14 <ms[m]> if one of you could comment on the pr (https://github.com/STEllAR-GROUP/hpx/pull/4618) once you know if it's actually a problem for weilewei that'd be nice

07:14 <ms[m]> I do fear that jbjnr is just getting lucky with his configuration...

07:14 <ms[m]> but if it's not needed we don't need to merge that pr

07:40 parsa has joined #ste||ar

07:49 sayefsakin has joined #ste||ar

08:06 LiliumAtratum has joined #ste||ar

08:08 <LiliumAtratum> Hello. Stupid question.... which header to include to have `hpx::register_thread` in scope? The documentaiton https://stellar-group.github.io/hpx-docs/latest/html/api.html?highlight=register_thread#_CPPv3N3hpx15register_threadEP7runtimePcR10error_code does not specify (or is hard to find). With manual search in the sources I tend to find more

08:08 <LiliumAtratum> complex versions of that function.

08:13 <LiliumAtratum> oh... nvm. Old doc for hpx 1.0 seems to have an accurate include provided. Sorry for troubling you. Still, the current documentation could be more verbose in that aspect ;)

08:16 <heller1> LiliumAtratum: indeed. The module API reference has better include information ;)

08:17 <heller1> but I guess those runtime specific functions haven't been modularized yet

08:30 <LiliumAtratum> Since I am here already, I have a more general question. I have several independent top-level tasks for hpx to crunch. I could just launch all those tasks in a loop and wait for all hpx::futures at the end. However, I expect I will run out of main memory before the hpx thread pool is saturated. When things start to swap on the disk - the

08:30 <LiliumAtratum> performance will deteriorate. Is there an idiomatic way to tell the scheduler not to launch too many of those top-level tasks at once?

08:30 <LiliumAtratum> Currently I am thinking about just putting some counting semaphore at the beginning of each of those top-level task. But maybe there is a better way?

08:36 <heller1> LiliumAtratum: there is the limiting_executor which does exactly that

08:46 <zao> Yorlik: Nice, isn’t it?

08:46 <jbjnr> LiliumAtratum: please note that I have some changes to limiting executor that I have not made into a PR yet, so please feel free to comment on features you need.

08:46 <jbjnr> PS. Who is LiliumAtratum ?

08:48 <LiliumAtratum> Just another developer from across the Globe who wants to incorporate hpx in their project ;)

08:50 <LiliumAtratum> but I am a "noob" user at the moment.

08:51 <heller1> we all started as beginners eventually ;)

08:51 <LiliumAtratum> alas, `limiting_executor` gives me no hits in hpx 1.4.1 doc

08:51 <heller1> nope

08:51 <heller1> we are lacking docs...

08:52 <jbjnr> https://github.com/STEllAR-GROUP/hpx/blob/bede37b45a93d15ca31a0ac9f5da413378e5cc9d/libs/executors/include/hpx/executors/limiting_executor.hpp

08:55 <jbjnr> I've made improvements to it, but they predate the modularization (files changed location), so it needs to be updated and merged back to master

08:56 <jbjnr> basic idea is use the executor to say - when tasks_in_flight>N stop launching and when they fallbelow M, start launching again. So typically something like N=2000 and M=1000 or somethig, but if you have very memory intensive tasks, smaller numbers

08:56 <jbjnr> new version has better waiting/blocking with less intrusive spin overhead and better blocking on destruction or waiting till tasks drain

09:00 <LiliumAtratum> oh, in my case it will be probably N=4, M=1 ;)

09:01 <heller1> LiliumAtratum: would be cool if you could share some details about your project! It's always nice to have a little overview of our use cases

09:02 <LiliumAtratum> Global point cloud registration. Matching point clouds of the same building obtained from different locations.

09:02 <LiliumAtratum> With as little user input as possible. Ideally, fully automatic.

09:03 <LiliumAtratum> Running on a single high-end PC. No cluster computing and such. But still, we believe hpx may help.

09:06 <heller1> sounds like a cool project!

09:07 <heller1> one last question: academia or industry?

09:07 <LiliumAtratum> Industry

09:08 <heller1> welcome aboard ;)

09:08 <LiliumAtratum> the early version is already being used in-house :)

09:08 <heller1> nice

09:15 <jbjnr> Are you academic or industry? LiliumAtratum - (it is helpful for us to know who are users are)

09:19 <LiliumAtratum> I used to work in academia but I work in industry right now. Still, our project requires a good deal of research, so that we know what we are writing in our program :)

09:34 mcopik has joined #ste||ar

09:35 mcopik has quit [Client Quit]

09:37 <heller1> ms[m]: btw, sanitizer PR created

09:54 LiliumAtratum has quit [Remote host closed the connection]

10:04 <heller1> tag_invoke should be clean as well now...

10:16 <ms[m]> heller: thank you!

10:20 <heller1> the sanitizer PR needs some more work ...

10:20 <heller1> is daint up and running again?

10:29 <jbjnr> no. cscs systems locked out due to ongoing cyber attack

10:36 karame_ has quit [Remote host closed the connection]

11:09 sayefsakin has quit [Read error: Connection reset by peer]

11:09 weilewei has quit [Remote host closed the connection]

11:09 sayefsakin has joined #ste||ar

11:59 LiliumAtratum has joined #ste||ar

12:04 sayefsakin has quit [Ping timeout: 260 seconds]

12:07 LiliumAtratum has quit [Ping timeout: 245 seconds]

12:10 hkaiser has joined #ste||ar

12:15 LiliumAtratum has joined #ste||ar

12:30 <heller1> ms[m]: #pragma once will take me a while to adapt...

12:37 <Yorlik> heller1: You don't believe the build time I posted?

12:39 <heller1> Yorlik: very hard to believe! After all, I think with all tests, you have more than 1000 targets. libhpx has around 400

12:39 <Yorlik> I just measured again: With an already downloaded hpx: From hitting F7 to "Build all succeeded": 2'09" incl. Tests but not Examples.

12:39 <heller1> well, good for you then ;)

12:40 <Yorlik> Team Red gave me a nice and warm welcome :D

12:41 hkaiser has quit [Read error: Connection reset by peer]

12:43 hkaiser has joined #ste||ar

12:49 nikunj97 has joined #ste||ar

12:57 <ms[m]> heller: you'll get used to it ;)

12:57 <ms[m]> Yorlik: I don't believe you either

12:57 <heller1> Yorlik: I doubt that you actually build all the HPX tests

12:57 <ms[m]> it takes at least 15 minutes for us to build tests and examples on a dual 18 core xeon system (examples don't add that much time)

12:58 <ms[m]> I don't think the (pseudo)targets work very well on windows

12:58 <Yorlik> What was weird, was there were only like near to 400 targets when I checked again - I wonder if it was really building the tests

12:59 <ms[m]> that's roughly what the core libraries contain

13:00 <Yorlik> then it skipped the tests - I'll double check my settings.

13:00 <Yorlik> I just do this: -DHPX_WITH_TESTS=ON

13:01 <ms[m]> linux or windows? what target did you build?

13:01 <Yorlik> Oh crap - found a baug

13:01 <Yorlik> bug

13:01 <Yorlik> theres a second time where its unsetting the flag ... dammit

13:03 <ms[m]> :P

13:03 <ms[m]> how nice it would be to build all tests in two minutes...

13:04 <Yorlik> measuring again ...

13:05 <Yorlik> Still I'm happy with that machine - boost with vcpkg built in ~13 minutes.

13:06 <Yorlik> Curious what will come out of this test.

13:08 <Yorlik> Do I have to switch on all tests manually? Because it was crazy fast again and only a bit over 400 targets

13:15 Nikunj__ has joined #ste||ar

13:19 nikunj97 has quit [Ping timeout: 244 seconds]

13:23 <hkaiser> ms[m]: pls feel free to use the PMC meeting link for the Kokos meeting before that

13:23 <ms[m]> Yorlik: what target are you building?

13:23 <Yorlik> All

13:24 <ms[m]> hkaiser: thanks! I think we don't have much to discuss though...

13:24 <hkaiser> k

13:24 <ms[m]> gdaiss: ^

13:24 <Yorlik> But - obviously not - so I made an error somewhere

13:25 <Yorlik> Obviously the tests do not get built

13:26 <Yorlik> I'm setting -DHPX_WITH_TESTS=ON and all these variables: -DHPX_UTIL_WITH_TESTS=${HPX_ALL_TESTS} with HPX_ALL_TESTS = ON

13:27 <ms[m]> Yorlik: they should even be on by default...

13:27 <ms[m]> what does ccmake say?

13:27 <Yorlik> Let me prepare some output for you ...

13:27 <ms[m]> also, the tests aren't part of all I think

13:27 <ms[m]> make/ninja tests

13:31 <Yorlik> ninja doesn't find the tests target

13:33 Amy1 has quit [Ping timeout: 260 seconds]

13:34 <Yorlik> ms[m]: This is fundamentally my build command: https://gist.github.com/McKillroy/068169c997b4de074b9102843b503419

13:34 <Yorlik> All variables are checked

13:35 <rori> I think you also have to specify the `HPX_WITH_TESTS_UNIT` `HPX_WITH_TESTS_REGRESSION` for them to be enabled

13:36 <Yorlik> rori: I'll add that and try - thanks !

13:36 <rori> REGRESSIONS*

13:36 <Yorlik> kk

13:43 <Yorlik> I added both - still only 406 targets - I'll dig the docs a bit

13:44 <ms[m]> Yorlik: all of those should be on by default (also the modules unit tests, HPX_WITH_TESTS guards all of them but you shouldn't need to enable them explicitly)

13:44 <heller1> hkaiser: ms[m]: #4305 should be good now!

13:44 <ms[m]> try a clean build not setting any of the tests related options

13:45 <Yorlik> Allright - I'll do

13:45 LiliumAtratum has quit [Remote host closed the connection]

13:45 <ms[m]> heller: thanks! let's wait until daint is back up though before merging anything new

13:46 <heller1> ok

13:46 <heller1> any ETA?

13:46 <ms[m]> nothing at the moment :/

13:47 <ms[m]> "they're working on it"

13:56 <Yorlik> ms[m] https://gist.github.com/McKillroy/068169c997b4de074b9102843b503419

13:56 <Yorlik> It did not build the tests

13:57 <Yorlik> All build directories were deleted - it was a clean build

13:58 <ms[m]> windows or linux?

13:58 <Yorlik> Windows

13:59 nan11 has joined #ste||ar

13:59 <Yorlik> ms[m]: I added the cmake output to the gist

13:59 <ms[m]> that's why... tests is a pseudotarget and pseudotargets are a bit messed up on windows (meaning they don't work)

13:59 <Yorlik> Fix it?

13:59 <hkaiser> jbjnr, ms[m]: meeting

13:59 <ms[m]> yep

14:00 <Yorlik> ms[m] I think it must have worked in between - because I added the test var for a reason - to not build the tests.

14:04 LiliumAtratum has joined #ste||ar

14:05 <LiliumAtratum> Hello... I am coming with a new question. Any idea why I may be getting this linker error? `hpx.lib(hpx.dll) : error LNK2005: "public: static class hpx::components::detail::wrapper_heap_list<class hpx::components::detail::fixed_wrapper_heap<class hpx::components::managed_component<class hpx::lcos::detail::promise_lco<void,struct

14:05 <LiliumAtratum> hpx::util::unused_type>,struct hpx::components::detail::this_type> > > & __cdecl hpx::components::detail::component_heap_impl<class hpx::components::managed_component<class hpx::lcos::detail::promise_lco<void,struct hpx::util::unused_type>,struct hpx::components::detail::this_type> >::call(void)" (?call@?$component_heap_impl@V?$managed_component@V?

14:05 <LiliumAtratum> $promise_lco@XUunused_type@util@hpx@@@detail@lcos@hpx@@Uthis_type@2components@4@@components@hpx@@@detail@components@hpx@@SAAEAV?$wrapper_heap_list@V?$fixed_wrapper_heap@V?$managed_component@V?$promise_lco@XUunused_type@util@hpx@@@detail@lcos@hpx@@Uthis_type@2components@4@@components@hpx@@@detail@components@hpx@@@234@XZ) already defined in

14:05 <LiliumAtratum> main.cpp.obj`

14:14 <LiliumAtratum> I narrowed it down to `hpx::cout << hpx::flush;`. Comment out that line and the linker works. But it feels like a small side effect of me doing something wrong elsewhere.

14:37 <hkaiser> LiliumAtratum: I'm in a meeting will get back later

14:39 <LiliumAtratum> see you then!

14:48 nan1194 has joined #ste||ar

14:48 nan1194 has quit [Remote host closed the connection]

14:48 nan11 has quit [Ping timeout: 245 seconds]

14:48 weilewei has joined #ste||ar

14:49 nan111 has joined #ste||ar

14:50 karame_ has joined #ste||ar

14:52 <Yorlik> ms[m] I have issue after linking HPX against the vcpkg debug version of boost. When using Boost from vcpkg the header directory is the same than with release, but the libraries are in the /debug subdirectory. I can pass that as BOOST_ROOT, but then I have to pass the include directory separately as Boost_INCLUDE_DIR. When done so - and everything compiles nicely, when using this Debug-HPX in debug mode I get an

14:52 <Yorlik> error, that the current BOOS_ROOT is different from the one HPX was compiled with. It looks, as if HPX is ignoring my previous setting from its build and just linking against the non debug version of Boost when I pass the Boost_INCLUDE_DIR Variable.

14:53 <Yorlik> HPX Claims, its BOOST_ROOT was the non debug version

14:55 <Yorlik> I wonder if I should set BOOST_ROOT always to the vcpkg ROOT/installed/.../ and set the Boost_Library_DIR Instead

14:56 nikunj has quit [Read error: Connection reset by peer]

14:57 nikunj has joined #ste||ar

14:57 rtohid has joined #ste||ar

15:00 <hkaiser> Yorlik: just use -DCMAKE_TOOLCHAIN_FILE=.../vcpkg/scripts/buildsystems/vcpkg.cmake

15:00 <hkaiser> it will detect everything properly

15:00 <Yorlik> OK

15:00 <hkaiser> no need for BOOST_ROOT alltogether

15:01 <Yorlik> I', reconfiguring our build system in the moment to optionally use vcpkg or our homebrewn builds

15:01 <Yorlik> Some of the vcpkg builds do not work for several reasons, e.g. Lua

15:01 <Yorlik> They do not build Lua by default as C++ for example

15:14 nikunj has quit [Ping timeout: 258 seconds]

15:15 nikunj has joined #ste||ar

15:16 <Yorlik> hkaiser: fixed

15:16 <Yorlik> Thanks!

15:18 LiliumAtratum has quit [Ping timeout: 245 seconds]

15:18 <heller1> LiliumAtratum: you need to link against the iostreams_component library

16:15 <Yorlik> hkaiser: YT?

16:17 <Yorlik> I am running my app on 12 cores, 12 threads (.hpx.ini) now and am getting these affinity masks. Is this supposed to be like that?

16:17 <Yorlik> worker-thread#00 0000000000000001

16:17 <Yorlik> worker-thread#01 0000000000000100

16:17 <Yorlik> worker-thread#04 0000000100000000

16:17 <Yorlik> worker-thread#02 0000000000010000

16:17 <Yorlik> worker-thread#03 0000000001000000

16:17 <Yorlik> worker-thread#05 0000010000000000

16:17 <Yorlik> worker-thread#06 0001000000000000

16:17 <Yorlik> worker-thread#07 0100000000000000

16:17 <Yorlik> worker-thread#08 0000000000000000

16:17 <Yorlik> worker-thread#09 0000000000000000

16:17 <Yorlik> worker-thread#10 0000000000000000

16:17 <Yorlik> worker-thread#11 0000000000000000

16:18 LiliumAtratum has joined #ste||ar

16:29 <zao> Are you printing those right?

16:29 <zao> Yorlik: HPX will by default go onto 12 of my 24 hyperthreads, I assume it's every other of them.

16:30 <Yorlik> I just took that from the CS thread view when stoping the debug

16:30 <Yorlik> VS thread view ..

16:31 <hkaiser> Yorlik: do a --hpx:print-bind

16:31 <Yorlik> OK

16:32 <zao> https://i.imgur.com/hrYRLJ7.png

16:32 <zao> (lstopo)

16:33 <zao> I wonder why that says 42GB, considering that I've got 64 GiB of mem in this box.

16:34 <Yorlik> the printout looks good

16:34 <Yorlik> exact 1:1 assignment

16:35 <hkaiser> Yorlik: so what's the problem?

16:35 <Yorlik> Maybe the mcvc output is garbage

16:35 <Yorlik> msvc

16:35 <Yorlik> It gave weird affinity masks for the threads

16:40 <zao> Yorlik: Seems like a VS bug that it truncates the affinity mask like that.

16:40 <Yorlik> Yes, probably.

16:42 <zao> "no-one would ever have more than 16 hardware threads, surely"

16:42 <zao> I wonder if they've actually mixed up the formatting when printing the DWORD_PTR value, confusing nibbles and bits.

16:42 <Yorlik> lol :D

16:42 <LiliumAtratum> haha

16:44 <weilewei> nvidia A100 released today has nvlink bandwidth 600 Gb/s...

16:44 rtohid has quit [Remote host closed the connection]

16:45 <weilewei> The one on Summit was 50 Gb/s? looks like a toy now...

16:51 <zao> Most exciting for us is that CUDA 11 will support GCC 9.x, Clang 9, and Intel 19.1

16:52 <zao> (us as in my site, not us as in ste||ar)

16:52 bita_ has joined #ste||ar

16:53 <weilewei> even arm server in CUDA11?

16:53 <weilewei> @zao where are you locating now?

16:53 <zao> HPC2N, Sweden.

16:54 <zao> EasyBuild's 2020a toolchains have been a bit blocked on that CUDA didn't support the compilers.

16:55 weilewei has quit [Remote host closed the connection]

16:55 rtohid has joined #ste||ar

16:55 <LiliumAtratum> will CUDA11 be C++17?

16:55 weilewei has joined #ste||ar

16:56 <weilewei> seems like a good place

16:56 <hkaiser> is nvcc a c++ compiler in the first place?

16:56 <weilewei> oops, got kicked out...

16:58 <zao> https://devblogs.nvidia.com/cuda-11-features-revealed/

16:58 <zao> This claims that it is C++17.

16:58 <hkaiser> zao: we'll see - so far it was mightly braindead

16:59 <LiliumAtratum> @zao This is indeed good news for me :)

17:00 karame_ has quit [Remote host closed the connection]

17:03 <heller1> nvcc is barely a compiler, more like a preprocessor

17:03 <LiliumAtratum> hkaiser Formally nvcc itself is not a compiler, but a toolkit that transfers various compilation tasks elsewhere. For example CPU code is transferred to other C++ compiler (clang, msvc, etc...) but device code is processed by something underneath provided with the CUDA toolkit.

17:03 <LiliumAtratum> and that "something undernath for device code" was stuck to C++14 for quite some time...

17:04 <LiliumAtratum> I am glad to hear they move it forward

17:05 <heller1> Transpiler!

17:05 <zao> When in doubt, blame/praise wash :)

17:06 <hkaiser> heller1: it will still produce that C goobledegook

17:07 <LiliumAtratum> heller1 I was sceptical, how linking against something can help resolve a symbol that is being defined too many times already. However, with your hint, I properly included `HPX::iostreams_component` in every lib I need and the problem is gone. Thank you!

17:07 <heller1> Yeah...

17:08 nan111 has quit [Remote host closed the connection]

17:08 <heller1> Oh, I only read hpx::cout and linker error... Completely overlooked the 'already defined' part

17:09 <heller1> Do you happen to have a a small reproducible example for that?

17:10 <LiliumAtratum> No, I have a humongous app for that ;)

17:10 <LiliumAtratum> split into few libs, using cmake and many dependencies

17:11 <heller1> That doesn't help :p

17:12 <LiliumAtratum> anyway, the problem is gone now, since I properly set up the linking

17:13 <heller1> Did you have the iostreams_component as a target_link_library of at least one of your libs?

17:14 <LiliumAtratum> I had it only in 1. Now I have it everywhere, where I use it. In PRIVATE section

17:14 karame_ has joined #ste||ar

17:15 <heller1> Interesting, this might have messed it up

17:15 <heller1> I'm not that familiar with the msvc linker to be if any real help here though

17:32 rtohid has quit [Remote host closed the connection]

17:46 LiliumAtratum has quit [Remote host closed the connection]

17:47 rtohid has joined #ste||ar

18:02 <zao> I just realized that if I put HPX in a VR application any scheduler hiccups could cause the user to throw up. This sounds like a noble goal.

18:08 <Yorlik> heller1: I'm currently doing measurements of frametime versus object count and cores used - gotta see how the USL charts are going to look when it's done :)

18:09 <heller1> Looking forward to see the results!

18:10 <heller1> zao: sounds like a good goal

18:29 weilewei has quit [Remote host closed the connection]

19:01 weilewei has joined #ste||ar

19:11 nan111 has joined #ste||ar

20:15 <diehlpk_work_> hkaiser, The rotating star works in Debug with and without One GPU

20:16 <diehlpk_work_> At least for four time steps

20:33 zao[m] has joined #ste||ar

20:34 <zao[m]> Look ma, I'm using Matrix.

20:34 <zao> Almost as cool as you other people now :D

21:07 hkaiser has quit [Ping timeout: 240 seconds]

21:48 hkaiser has joined #ste||ar

22:18 rtohid has left #ste||ar [#ste||ar]

23:21 hkaiser has quit [Ping timeout: 260 seconds]