hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020
diehlpk has joined #ste||ar
diehlpk has quit [Ping timeout: 240 seconds]
weilewei has quit [Remote host closed the connection]
shahrzad has quit [Remote host closed the connection]
hkaiser has quit [Quit: bye]
akheir has quit [Quit: Leaving]
bita_ has quit [Ping timeout: 240 seconds]
jbjnr has joined #ste||ar
mdiers[m] has joined #ste||ar
<mdiers[m]> i have a problem with 1.5.0-rc3 on fedora 32. at runtime i get the error: PMIX ERROR: NO-MEM in file client/pmix_client_get.c. maybe a bug in openmpi. how can i change it again that no pmix is used? it's been so long ago ;-(
<ms[m]> jbjnr: you know anything about pmix? ^
<mdiers[m]> how can I switch from mpi to tcp communication via command line argument?
<ms[m]> mdiers: ah, that's a bit more complicated unfortunately (at least if you have mpi implicitly initialized in your environment)
<ms[m]> in principle --hpx:ini=hpx.parcel.tcp.enable=1 --hpx:ini=hpx.parcel.tcp.enable=0
<ms[m]> but that most likely currently doesn't work
<ms[m]> see https://github.com/STEllAR-GROUP/hpx/pull/4924 for a short explanation
<ms[m]> if we get a fix for that soon we'll have a patch release containing the fix
<mdiers[m]> I think it should be ```--hpx:ini=hpx.parcel.tcp.enable=1 --hpx:ini=hpx.parcel.mpi.enable=0```. But unfortunately it does not work as already described. I just suspected our slurm openmpi combination that something does not fit. With mpirun it works.
<rori> diehlpk_work: sure sorry for the trouble :/
<ms[m]> mdiers: yep, sorry, typo... yeah, sounds indeed like something in the environment, I wouldn't know what to do with that
zatumil has joined #ste||ar
hkaiser has joined #ste||ar
<mdiers[m]> <mdiers[m] "i have a problem with 1.5.0-rc3 "> found it, there was a missing name resolution in the container. the error messages are still there but it works fine again.
<kordejong> I am using HPX 1.5.0 icw APEX+OTF2. When visualizing the trace with Vampir, actions show up, which is cool! But I cannot seem to get any custom tasks passed to `dataflow` or as a continuation to show up in the trace. I annotate tasks with `util::annotated_function(my_function, my_name)`. I must be doing something wrong. But what? Any hints maybe?
<kordejong> Never mind. Wrong ordering `unwrapping` and `annotated_function`. Funny how asking questions itself can lead to answers ;-)
<jbjnr> kordejong, you might need to use the annotate_function wrapper as the annotated_function RAII thingy doesn't work any more (I thought we got rid of it)
<jbjnr> note difference between annotate_function and annotated_function - you need the one that wraps the task which is more painful to use, but should work.
<kordejong> jbjnr: Aha, yes, `annotate_function` works also, thanks. Actually, I find it easier to use than `annotated_function`. I just instantiate it at the start of the tasks. Is that how it is supposed to be used? Are there any other considerations about using `annotate_function`, like a noticeable runtime cost?
<ms[m]> Kor de Jong: the way it's implemented right now means that doing an `annotate_function` has to yield the task to set the name
<ms[m]> depending on your use case it may have noticeable overhead
<ms[m]> very relevant for hpx/kokkos, but I haven't looked if there's a program yet
<kordejong> jbjnr: OK thanks, I will then conditionally enable it. It works great BTW.
diehlpk_work has quit [Ping timeout: 244 seconds]
<ms[m]> gdaiss, hkaiser kokkos meeting as usual? joining in a few minutes
<ms[m]> rori, jbjnr as well
diehlpk_work has joined #ste||ar
<diehlpk_work> rori, No worries, we were able to fix things and the paper is released
<diehlpk_work> ms[m], The DOI is available. Can you might add the paper's DOI to the readme and the documentation>
<hkaiser> diehlpk_work: as said, I'll take care of the readme
<rori> diehlpk_work: thanks!
diehlpk_work has quit [Ping timeout: 240 seconds]
diehlpk_work has joined #ste||ar
diehlpk_work has quit [Ping timeout: 240 seconds]
akheir has joined #ste||ar
diehlpk has joined #ste||ar
diehlpk has quit [Changing host]
diehlpk has joined #ste||ar
diehlpk has quit [Remote host closed the connection]
diehlpk has joined #ste||ar
diehlpk has quit [Changing host]
diehlpk has joined #ste||ar
shahrzad has joined #ste||ar
bita_ has joined #ste||ar
diehlpk_work has joined #ste||ar
diehlpk__ has joined #ste||ar
hkaiser has quit [Read error: Connection reset by peer]
diehlpk has quit [Ping timeout: 240 seconds]
hkaiser has joined #ste||ar
diehlpk__ has quit [Read error: Connection reset by peer]
diehlpk_work has quit [Ping timeout: 240 seconds]
diehlpk_work has joined #ste||ar
diehlpk_work has quit [Ping timeout: 240 seconds]
weilewei has joined #ste||ar
diehlpk_work has joined #ste||ar
diehlpk_work has quit [Ping timeout: 244 seconds]
diehlpk_work has joined #ste||ar
diehlpk_work has quit [Ping timeout: 256 seconds]
weilewei has quit [Remote host closed the connection]
diehlpk_work has joined #ste||ar
weilewei has joined #ste||ar
<K-ballo> do I still have a rostam account? what's my username?
<parsa> akheir: see ^
<akheir> K-ballo: I don't think so, send me an email I will create yours in a bit. aliz@cct.lsu.edu
<K-ballo> akheir: sent
<akheir> got it
<K-ballo> "Password change failed. Server message: Old password not accepted." lol
<K-ballo> akheir: actually, I get "old password not accepted" regardless of what I write
<akheir> K-ballo, the password change menu is a bit stupid, it ask for current password twice, let me reset your password
<akheir> K-ballo: see pm pls.
<K-ballo> confusing menu, makes sense now
diehlpk_work has quit [Ping timeout: 240 seconds]
diehlpk_work has joined #ste||ar
diehlpk_work has quit [Ping timeout: 240 seconds]
diehlpk has joined #ste||ar
<K-ballo> "TBB has some of the algorithms implemented, AFAIK not all of them. HPX implements all (except for three of them)"
<zatumil> hey, my installation directory /usr/local/include/hpx/include looks weird, can the second include directory stripped from path? I would like to #include <hpx/async.h>
<K-ballo> zatumil: looks right to me..
<K-ballo> hpx/async.hpp is not the same as hpx/include/async.hpp
<zatumil> oh I didnt know hpx/async.hpp exists too
<K-ballo> which one did you want?
<K-ballo> note there is no async.h anywhere
diehlpk_work has joined #ste||ar
<zatumil> I think async_local/apply.hpp is fine for the hello world example, the #include <hpx/include/async.hpp> was confusing me, I would expect either #include <hpx/distributed/async.hpp> or #include <hpx/local/async.hpp>
<K-ballo> #include <hpx/include/async.hpp> sounds fine for an example
<zatumil> include looks redundant to me
<K-ballo> these are user facing headers who's job is to include a bunch of internal pieces
<zao> The subtree is for friendly user-facing includes, not getting you lost in the sea of implementation things in the full tree.
<K-ballo> you should be picking stuff from hpx/include, unless you really really know what you are doing
<zao> It's a convention of some header-heavy libraries, Spirit has a "home" IIRC.
<zatumil> maybe I can -I /usr/local/hpx/include and do #include <async.hpp> then
<zao> Please don't.
<K-ballo> ouch
<zao> Everything expects that the top-level dir(s) are included so that top-relative paths works.
<ms[m]> zatumil: for everything that you can find on this page, prefer the headers in the headings: https://hpx-docs.stellar-group.org/tags/1.5.0/html/api/public_api.html
<ms[m]> everything else requires a bit of guesswork, but the hpx/include/ headers give you access to all the rest
<ms[m]> that's if you're on 1.5.0...
<zatumil> the document is good, how to make docs/html from git?
<zao> zatumil: There's a build target for it (docs?) assuming you have the tools installed (doxygen, sphinx?)
diehlpk has quit [Ping timeout: 240 seconds]
<zatumil> yes with cmake -DHPX_WITH_DOCUMENTATION=ON
<zatumil> thank you
weilewei has quit [Remote host closed the connection]
jbjnr has quit [Ping timeout: 240 seconds]