hkaiser changed the topic of #ste||ar to: The topic is 'STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
adi_ahj has joined #ste||ar
adi_ahj has quit [Client Quit]
hkaiser has quit [Quit: bye]
mdiers_ has quit [Remote host closed the connection]
heller has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
heller has joined #ste||ar
hkaiser has joined #ste||ar
<simbergm> jbjnr_: hkaiser: is the talk still on today (joost is wondering)?
<jbjnr_> going ahead as planned
<hkaiser> simbergm: yes, I'm planning to give the talk as planned
<simbergm> very good, thanks
nikunj has joined #ste||ar
<nikunj> diehlpk_work, yt?
<hkaiser> nikunj: so I assume you did arrive safely
<nikunj> hkaiser, yes :)
<nikunj> I registered for the semester today
<hkaiser> good
<nikunj> settled down in my dorm here
<hkaiser> just don't go on strike with the rest of your lot ;-)
<nikunj> haha, point noted ;)
<zao> nikunj: Welcome to wherever you are now! :)
<nikunj> zao, xD
<nikunj> I'm back in India after my internship at LSU
<zao> So what's coming up now, doctoral studies or something else? I don't think I ever knew what level you were at.
<nikunj> zao, I'm still in my undergrad
<nikunj> I'm a junior at IIT Roorkee studying computer science and eng
<heller> staying with HPX then?
<nikunj> heller, absolutely!
<heller> cool
adi_ahj has joined #ste||ar
<hkaiser> heller: I'd liek to talk to you about the execution_agent stuff at some point
<hkaiser> I think it's not quite right, but I might not understand all of the details
<heller> hkaiser: ok
<heller> hkaiser: tonight might be good for me
<hkaiser> ok, I should have some time this afternoon - should we play it by ear?
<heller> sure
<hkaiser> would 2pm/21:00 sound like a viable target time for you?
<heller> yup!
<hkaiser> ok
<hkaiser> thanks
<heller> hkaiser: I am on and off trying to integrate p0443r11 properly into the execution agent stuff ... not using C++17 features is making it incredibly hard :/
<hkaiser> heller: sure, I know
<hkaiser> you will have to suffer
<heller> *a lot*
<hkaiser> poor guy ;-)
jbjnr_ has quit [Read error: Connection reset by peer]
jbjnr_ has joined #ste||ar
<hkaiser> heller: all of spirit was written using c++98, so this should be possible as well using c++14
<heller> hkaiser: I feel like we are missing out on a few opportunities by not going bleeding edge
<heller> it seems like the really start to shine once combining them with coroutines, for example
<hkaiser> absolutely - but I'd suggest to do that only in step two
<hkaiser> anyways, gtg
hkaiser has quit [Quit: bye]
nikunj has quit [Ping timeout: 260 seconds]
hkaiser has joined #ste||ar
adi_ahj has left #ste||ar [#ste||ar]
hkaiser has quit [Ping timeout: 260 seconds]
adi_ahj has joined #ste||ar
adi_ahj has quit [Ping timeout: 268 seconds]
adi_ahj_ has joined #ste||ar
hkaiser has joined #ste||ar
<diehlpk_work> simbergm, Should I start a Fedora build before we release HPX 1.4?
<simbergm> diehlpk_work: yes please
<diehlpk_work> Ok, which branch?
<simbergm> release
<diehlpk_work> Ok, I will start it today, so we have the results by tomorrow morning
<simbergm> Thanks
<hkaiser> simbergm: hey yt?
adi_ahj_ has quit [Quit: adi_ahj_]
weilewei has joined #ste||ar
adi_ahj has joined #ste||ar
adi_ahj has quit [Ping timeout: 265 seconds]
adi_ahj has joined #ste||ar
nikunj has joined #ste||ar
adi_ahj has quit [Ping timeout: 265 seconds]
hkaiser has quit [Quit: bye]
adi_ahj has joined #ste||ar
adi_ahj has quit [Ping timeout: 260 seconds]
hkaiser has joined #ste||ar
nikunj has quit [Ping timeout: 240 seconds]
<heller> hkaiser: I am ready from now on
<hkaiser> heller: we can do it now
<hkaiser> appear.in?
<heller> sure
nikunj has joined #ste||ar
<hkaiser> how is it called nowadays?
<weilewei> For the hpx module recently available on Summit, in its .../include/hpx/, it does not have collectives.hpp, which is required by hpx 1d stencil 8 here: https://github.com/STEllAR-GROUP/hpx/blob/3c8cada91cd6532c2c07fdc3dd72600c38f8ef4d/examples/1d_stencil/1d_stencil_8.cpp#L14. This example cannot be built.
<weilewei> For this hpx module, it passes -DHPX_WITH_EXAMPLES=OFF, so no examples folder is generated. But I am trying to build stencil 8 on my own to test newly existing hpx module on Summit.
<weilewei> What should I do?
<weilewei> or more specifically, why collectives.hpp is not generated with hpx module on Summit?
<hkaiser> because it's not part of V1.3
<weilewei> ah, I see... then I need to find the stencil 8 code that is V1.3
<hkaiser> right
<simbergm> hkaiser: here now, kind of
<hkaiser> simbergm: sec
nikunj has quit [Ping timeout: 268 seconds]
<hkaiser> simbergm: we ran into the problem of how to test the hpx module on summit
<hkaiser> is there a way to build the tests against an installed version of hpx?
<simbergm> hkaiser: uh, no, don't think so... :/
<simbergm> to do it cleanly would maybe require having the tests separate from the the rest
<simbergm> but better would be if whatever module system had some sort of step to verify the build
<simbergm> sanity check
<simbergm> *whatever module system is used on summit
<hkaiser> right
<hkaiser> weilewei was running into this
<weilewei> spack is used simbergm
<simbergm> ah yes, you already mentioned this
<simbergm> maybe zao would know...? :) do any of the typically used build/module systems have some sort of way to specify a sanity check for a build?
<weilewei> spack cannot stage/stop the installation at the build stage
<simbergm> I don't know, sorry
<simbergm> one can always put something together manually with scripts but then it won't be 100% the same as the spack build
<simbergm> and we could of course hack something together in our cmake, but I'm not sure it's worth the effort or our responsibility
<weilewei> yea, no, it won't be the same
<simbergm> I get that you want to verify the build though, and if there's no better way...
<simbergm> zao is probably enjoying his evening now, but I'd try to ping him when he's around
<simbergm> he should be able to tell you for sure if something like that exists already
<weilewei> terminate called after throwing an instance of 'std::runtime_error' what(): partitioner::partioner: Currently, HPX_HAVE_MAX_CPU_COUNT is set to 64 while your system has 128 processing units. Please reconfigure HPX with -DHPX_WITH_MAX_CPU_COUNT=128 (or higher) to increase the maximal CPU count supported by HPX.Aborted (core dumped)
<weilewei> Aha, I need to tell Summit manager to set HPX_WITH_MAX_CPU_COUNT to 256 or something
<simbergm> yay, a useful error message :) unless your system doesn't actually have 128 cores
<weilewei> simbergm thanks for following it up
<simbergm> yeah, 128 should be enough
<zao> simbergm: EasyBuild has a sanity check step, which checks for a recipe-specified set of directories and files, tries loading any Python modules, and can run `make check` or whatever.
<simbergm> zao: know of anything similar in spack?
<zao> No idea how they build things.
<weilewei> zao how about spack?
<weilewei> @zao ok..
<simbergm> no worries, thanks
<zao> One thing that works more after-installation is ReFrame from CSCS (Victor and friends), which is intended to be more for perf and integration testing against clusters.
<zao> On my site we've looked at running reframe to verify that our installs actually work with proper test cases, but it's been slow.
<zao> (slow in deployment from our side, not the tool itself)
<simbergm> do you then use custom test cases, not the usual unit test suite from a library/application?
<hkaiser> weilewei: what about creating a new CMakeLists.txt that does a find_package(HPX) and a add_sub_directory(tests)
<hkaiser> perhaps also finding Pythong etc
<heller> and enabling tests and so on
<simbergm> also needs the unit tests from modules
<simbergm> it's doable, but ugly as heck
<heller> shouldn't that be handled by the depending tatrgets?
<weilewei> hkaiser yea, that's what I am doing as well. Currently, I found out HPX_WITH_MAX_CPU_COUNT is not set correctly on Summit. maybe more errors are on the way
<hkaiser> you might want to talk to Rod as he is maintaingg the HPX Spack package for FLeCSI
<simbergm> it might be enough to run a carefully selected set of examples
<weilewei> hkaiser Ok, will do
<simbergm> heller: modules unit tests? they link correctly but the modules unit test directories need to be added as well
<heller> ahh, ok
<zao> Reframe would have separate (typically user-sourced) actual datasets with known performance properties to run with.
<zao> If I had to have wishlists from an EasyBuild perspective, I'd like a in-tree test suite that runs in acceptable time when building and detects problems like "not building with MPI right", "missing CUDA env", etc.
<zao> Exhaustive test suites that test for every problem in HPX since the dawn of time is cool and all, but if the build takes an hour with the test suite - it sucks.
<zao> The kind of stuff one cares about in a release is a bit different from what you might want while developing, after all.
<heller> tests.examples?
<simbergm> Yah, then I think running a custom set of examples and checking their output is probably the best option
<simbergm> We could also install some additional unit tests for that purpose (or have an option to install all of them)
<heller> without polluting the bin directory
<heller> please ;)
<simbergm> Yeah sure, it wouldn't be a default option either
Vir has joined #ste||ar
weilewei has quit [Remote host closed the connection]
weilewei has joined #ste||ar
jbjnr_ has quit [Read error: Connection reset by peer]
jbjnr_ has joined #ste||ar