#ste||ar on 2019-10-21 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

00:54 Coldblackice_ has joined #ste||ar

00:55 Coldblackice has quit [Ping timeout: 240 seconds]

00:57 jaafar has quit [Quit: Konversation terminated!]

00:59 jaafar has joined #ste||ar

01:08 Coldblackice_ is now known as Coldblackice

01:29 K-ballo has quit [Quit: K-ballo]

02:07 hkaiser has quit [Ping timeout: 250 seconds]

02:10 Coldblackice has quit [Ping timeout: 250 seconds]

02:14 Coldblackice has joined #ste||ar

02:23 hkaiser has joined #ste||ar

02:40 hkaiser has quit [Ping timeout: 250 seconds]

03:32 Amy has joined #ste||ar

03:32 Amy is now known as Guest70891

03:36 Guest70891 has quit [Ping timeout: 265 seconds]

03:37 Guest70891 has joined #ste||ar

05:22 Guest70891 has quit [Ping timeout: 250 seconds]

05:22 Guest70891 has joined #ste||ar

05:33 jbjnr_ has joined #ste||ar

06:29 mdiers_ has joined #ste||ar

06:36 Guest70891 has quit [Ping timeout: 268 seconds]

06:37 Guest70891 has joined #ste||ar

06:55 Guest70891 has quit [Ping timeout: 268 seconds]

06:55 mdiers_1 has joined #ste||ar

06:55 mdiers_1 has quit [Read error: Connection reset by peer]

06:56 Guest70891 has joined #ste||ar

06:56 mdiers_ has quit [Ping timeout: 276 seconds]

07:10 Guest70891 has quit [Ping timeout: 265 seconds]

07:10 Guest70891 has joined #ste||ar

07:22 mdiers_ has joined #ste||ar

07:26 Guest70891 has quit [Ping timeout: 268 seconds]

07:26 Guest70891 has joined #ste||ar

08:30 <jbjnr> heller: if you have a spare moment, could yuo put your task_data future backing for mpi into a gist please. I'd like to possible replace my use of promise with that if it has any advantages. (Either way, I'm curious to see what you did because I can't remember now).

08:35 <jbjnr_> \quit

08:35 jbjnr_ has quit [Quit: WeeChat 2.6]

08:44 Coldblackice_ has joined #ste||ar

08:47 Coldblackice has quit [Ping timeout: 250 seconds]

08:48 <heller> jbjnr: https://github.com/STEllAR-GROUP/hpx/blob/execution_context_post/libs/basic_execution/examples/ping_pong.cpp#L31-L72

08:49 <jbjnr> Thanks

08:51 <jbjnr> aha. that line "return future_access<hpx::future<void>>::create(std::move(data));" that's what I needed. thank you very much.

08:51 Guest70891 has quit [Ping timeout: 276 seconds]

08:51 Guest70891 has joined #ste||ar

08:53 rori has joined #ste||ar

08:54 <heller> simbergm: is merging PRs blocked now?

08:55 <simbergm> heller: no, why?

08:56 <heller> since the daint builders are messed up

08:57 <simbergm> ah, well, wait until the evening

08:57 <simbergm> we can also launch a few builds on rostam if you want something merged quickly

08:59 <heller> I am in no hurry ;)

09:02 <heller> simbergm: what was the general reaction on my talk in zuerich last friday?

09:04 <heller> and also, what do you think?

09:07 <simbergm> heller: good, I'd rather wait than break master

09:07 <simbergm> some were concerned about your virtual function calls :P but I said that wasn't really the main point of your talk, the implementation can still change

09:07 <simbergm> in general they really like the direction

09:08 <simbergm> I like it too

09:08 <simbergm> I'm a bit concerned about how it'll all come together with our "standard compliance" considering things are so much in flux but that's a risk we can't really avoid

09:21 Guest70891 has quit [Ping timeout: 268 seconds]

09:21 Guest70891 has joined #ste||ar

09:24 <heller> simbergm: the standard compliance storry won't change

09:24 <heller> simbergm: we are in the realm of standard extensions there. And everything will be in line with P0443

09:25 <heller> simbergm: sure, virtual functions, you won't have those in your hot path, hopefully ;)

09:25 <heller> and if you do, hope that the branch predictor works

09:26 <heller> simbergm: the goal now is to have the first standards compliance implementation of P0443 with the goal of allowing those user defined extensions

09:26 <heller> and eventually get rid of the fragmented landscape that you already seem to have at cscs ;0

09:26 <heller> ;)

09:27 weilewei has quit [Remote host closed the connection]

09:33 <simbergm> heller: sounds good :) note that my concern mainly comes from not knowing well enough how much concensus there is in the committee on these things, but if it's all in line with what's going in that's great

09:35 <simbergm> heller, jbjnr what do you think about #4142... should I go ahead and rename the old thread_pool_executor to embedded_thread_pool_executor? they are in principle in our public api, but they'd usually be used through the <scheduler_name>_executor typedef

09:36 <jbjnr> simbergm: I do not believe anyone is using the mebedded schedulers and if they are, they must be an advanced user and will know what to change once they discover their code is broken

09:36 <heller> simbergm: I'd say we need to clean up the executors significantly

09:36 <jbjnr> +++++11111

09:36 <heller> and that's just dealing with the symptoms...

09:36 <simbergm> I'd agree

09:36 <heller> so i really don't care that much right now ;)

09:37 <simbergm> most of the executors are using the very old interface with add/add_after etc.

09:37 <simbergm> deprecating them in the next release might be a good idea

09:37 <simbergm> so that we can finally get rid of them

09:37 <jbjnr> feel free to remove it all

09:37 <jbjnr> :)

09:39 <jbjnr> do we have a simple lockfree list in hpx?

09:39 <jbjnr> not a dequeue or a queue or a stack, but just a list?

10:02 <heller> jbjnr: no, we don't have anything like that

10:02 <heller> http://concurrencykit.org/

10:54 <simbergm> jbjnr: what does a stack or queue do that you don't want from a list?

10:54 <jbjnr> a list should allow removal of items from the middle

10:55 <jbjnr> it's not a big deal, just to make the code a bit nicer

10:57 K-ballo has joined #ste||ar

10:57 <simbergm> ok, i see

11:02 <simbergm> heller: gcc-newest is okay again #4125, must've been a temporary filesystem problem or something

11:02 <simbergm> I think we can merge it now

11:12 <heller> go ahead ;)

11:37 hkaiser has joined #ste||ar

11:40 tianyi93 has quit [Ping timeout: 245 seconds]

12:07 hkaiser has quit [Ping timeout: 250 seconds]

12:29 <simbergm> hkaiser, heller: I'm having problems with the launch_process_test on #4091

12:29 <simbergm> it never returns from here: https://github.com/STEllAR-GROUP/hpx/blob/85ddb85c800c1c32bebdc257298538f21514fb27/tests/unit/component/launch_process.cpp#L106-L114

12:29 <simbergm> do you have any educated guesses what I might've broken to make that happen?

12:29 <simbergm> I think all the other tests pass now

12:30 <simbergm> does that test use anything that's not used elsewhere?

12:30 <heller> simbergm: this one should only work with a distributed runtime

12:30 <heller> simbergm: it also requires that the spawning runtime accepts incoming connections

12:31 <simbergm> heller: yeah, this is all with the distributed one

12:32 hkaiser has joined #ste||ar

12:34 <heller> simbergm: so i'd check if connecting localities is still working

12:35 <heller> simbergm: https://github.com/STEllAR-GROUP/hpx/tree/master/examples/pipeline <-- check this for example

12:38 <simbergm> heller: thanks for the hints, that might be it

12:42 <hkaiser> guys, what is the default -std=c++ mode for clang gcc if nothing is specified?

12:43 <zao> Depends on version.

12:43 <hkaiser> we're seeing a significant performance difference between no mode and c++17 explicitly specified

12:43 <hkaiser> need to look at what version, actually

12:44 <hkaiser> c++17 is slower

12:47 <zao> GCC 7 and 8 defaults to gnu++14, according to documentation.

12:48 <zao> As do 6.

12:48 <zao> (also 9)

12:48 <hkaiser> ok, thanks

12:50 <zao> Harder to tell with Clang, but I see news that Clang 6 changed their default to C++14.

12:51 <heller> which gcc version?

13:16 <hkaiser> heller: will check, need to ask

13:16 <hkaiser> I _think_ it's clang 7

13:21 Guest70891 has quit [Ping timeout: 268 seconds]

13:22 Guest70891 has joined #ste||ar

13:27 heller has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

13:29 heller has joined #ste||ar

13:36 hkaiser has quit [Quit: bye]

13:46 weilewei has joined #ste||ar

14:01 diehlpk_work has joined #ste||ar

14:07 hkaiser has joined #ste||ar

14:41 <hkaiser> heller: clang 8.0

14:41 <heller> hmmm

14:46 <jaafar> dataflow() question:

14:47 <jaafar> The launch policy argument tells how to run the continuation relative to the last supplied input

14:48 <jaafar> what if I want to be launched differently depending on which input it is? Is there some way to accomplish this?

14:48 <jaafar> e.g. I have two dependencies A and B

14:49 <jaafar> If A appears last I want to be run synchronously, so the continuation executes on the same core and the cache is warm

14:49 <jaafar> If B, async is fine

14:50 aserio has joined #ste||ar

14:51 <jaafar> dataflow(launch::sync, f, A, B) runs synchronously regardless of whether A or B arrives last, and similar if it were launch::async

14:51 <jaafar> Would be nice to be able to distinguish

14:56 <hkaiser> jaafar: so you want to use sunc if A arrives first, but async if B arrives first?

14:56 <hkaiser> sync*

14:57 <hkaiser> there is no way to distinguish that, atm

15:05 <jaafar> hkaiser: I would say "last" instead of "first" but yes

15:05 <jaafar> the application here is that I know that the supplier of A will have other data I need warm in the cache

15:06 <jaafar> so very helpful for it to simply start executing my continuation

15:06 <jbjnr> I could almost do it with my guided executor, but it'd need some adjustments I think

15:07 <jbjnr> jaafar: if you had a simple test case that demonstrated a gain from cache re-use - I'd take a look at it.

15:10 <jaafar> OK thanks jbjnr I will see what I can do

15:10 <jbjnr> I was pondering the semantics of how to do this just recently (not quite the same, cos it wasn't a dataflow example)

15:11 <jbjnr> the guided executor delays the scheduling of continuations until the input tasks have completed, so it is ideal for this, but I wanted a policy like launch::cache or something

15:12 <jbjnr> but ideally it would be an 'if you can' rather than 'you must' because if we interfere with scheduling too much then we suffer from losing stealing etc

15:12 <jaafar> yep

15:12 <jbjnr> there should be a cost function ideally

15:13 <jaafar> exclusive scan shows strong signs of being dominated by cache misses

15:13 <jaafar> and it makes two passes

15:13 <jaafar> a given chunk is visited twice

15:14 <jaafar> My hypothesis is we'd see a speedup if the two passes were close in time and on the same core

15:14 <jaafar> for a given chunk

15:14 <jbjnr> ok, that ought to be quite testable

15:15 <jbjnr> (naturally I have my own implementation of scan)

15:15 <jbjnr> :)

15:18 <weilewei> jbjnr I am rebasing your branch of DCA and get this build error. Do you have any suggestion? https://gist.github.com/weilewei/a9ab13454eaa96db1d4e42d3fe25514d

15:19 <jbjnr> weilewei: looks like some cruft introduced by the new modularization - simbergm might know the right fix, but it looks as though a dependency is missing. is there a module with a name close to agas/logging that can be added

15:20 <jbjnr> bbiab

15:22 <weilewei> jbjnr not sure about what module should be added, but ideally, the mpi_concurrency_test is independant from hpx, not sure why hpx is involved

15:23 <weilewei> HPX_WITH_LOGGING:BOOL?

15:25 <jbjnr> weilewei: it wants to link in hpx::util::logging - if it doesn't use HPX (I've forgotten that test, so I need to look more closely) - then why is HPX being included at all - check the includes and the cmake too

15:26 <jbjnr> hkaiser or aserio : my calendar just popped up a reminder about an OB meeting, is it happening or a cancelled one

15:27 <jbjnr> weilewei: if I recall right - the DCA++hpx stuff sets a #define to override the threading so that the tests run on hpx threads without the user doing anything - this is going to trigger the pulling in of everything

15:28 mbremer has joined #ste||ar

15:28 <jbjnr> but probably we now need to add a dependency to the logging stuff. HPX_WITH_LOGGING OFF might be worth a try

15:28 <weilewei> maybe here: https://github.com/weilewei/DCA/blob/9ea6849f2110a011f8e58d11ec77c6c7c04d2205/include/dca/config/hpx_defines.hpp.in#L31

15:29 <weilewei> you use a macro DCA_LOG to log things in mpi concurrency tests

15:29 <jbjnr> weilewei: no. that's just a custom debugging dump that I added, it's safe

15:29 <jbjnr> it just used std::cout to dump messages

15:29 <jbjnr> ^uses

15:29 <jbjnr> doesn't depend on agas::logging

15:30 <weilewei> jbjnr ok, I see, I will try build hpx with HPX_WITH_LOGGING OFF

15:30 <jbjnr> jaafar: the latest work on my scheduler has two thread scheduling options one is round_robin the other is thread_parent

15:30 <jbjnr> when thread_parent is used a new thread is scheduled on the same core the parent thread ran on

15:30 <aserio> jbjnr: We switched these meetings to be bi-weekly

15:31 <aserio> jbjnr: the next meeting is scheduled for October 28th

15:31 <jbjnr> but it is done at the executor level rather than the task level, so a schedulehint needs to be generated in the dataflow continuation

15:31 <jbjnr> aserio: thanks

15:31 <jbjnr> will add to calendar

15:32 <jbjnr> weilewei: when is next dca video meeting

15:32 <jbjnr> ?

15:32 <weilewei> jbjnr next Monday 10/28, 11:00 AM Eastern Time

15:32 <jbjnr> hmmm. I'm away on 28th actually

15:33 <jbjnr> ^^arrghh. both meetings!

15:34 <jbjnr> weilewei: when you finish today, push what you've done to a new weilewei-hpx branch and tomorrow I'll try building dca with hpx from it and see if I can fix anything you haven't fnished

15:34 <weilewei> jbjnr Thanks! I will let you know once I finish and push all the changes

15:34 <jbjnr> ok. but send me an email, cos once I get home, I can't log into this machine and see mesages any more (must fix that)

15:35 <weilewei> jbjnr sure, I found your cscs email on my email, so I will send you an email about everything

15:36 <jbjnr> ta

15:36 <weilewei> welcome :-)

15:37 <zao> You do have the logs in the channel topic, when really desperate?

15:38 <jbjnr> I know, but I never remember any more to check them. we use slack for everything here and I forget IRC exists most of the time now

15:52 Guest70891 has quit [Ping timeout: 265 seconds]

15:53 Guest70891 has joined #ste||ar

15:55 aserio has quit [Quit: aserio]

16:03 <weilewei> jbjnr after applying HPX_WITH_LOGGING OFF, I did not see the logging error but got a linking error

16:04 <weilewei> [ 55%] Linking CXX executable mpi_concurrency_testCMakeFiles/mpi_concurrency_test.dir/mpi_concurrency_test.cpp.o:(.data+0x0): undefined reference to hpx::hpx_check_version_1u_4u' CMakeFiles/mpi_concurrency_test.dir/mpi_concurrency_test.cpp.o:(.data+0x8): undefined reference to hpx::hpx_check_boost_version_106800'

16:04 <weilewei> https://gist.github.com/weilewei/a9ab13454eaa96db1d4e42d3fe25514d#gistcomment-3061499

16:05 hkaiser has quit [Read error: Connection reset by peer]

16:05 hkaiser has joined #ste||ar

16:06 <hkaiser> weilewei: as said before, while this application is compiled against hpx headers, it is not linked against hpx

16:06 <hkaiser> not sure why it's including hpx in the first place, however

16:07 <weilewei> hkaiser hmm, but you said you can compile successfully on your windows machine?

16:07 <hkaiser> well, apparently I didn't have hpx enabled in my build, sorry

16:08 <weilewei> oh, no worry

16:08 <weilewei> The way I build dca w/ hpx for jbjnr branch is

16:08 <weilewei> cmake -C ../build-aux/summit.cmake -DDCA_WITH_TESTS_FAST=ON -DCMAKE_BUILD_TYPE=Debug -DHPX_DIR=/gpfs/alpine/proj-shared/cph102/weile/dev/install/hpx_hwloc_Debug/lib64/cmake/HPX/ -DDCA_WITH_TESTS_EXTENSIVE=ON -DDCA_WITH_HPX=ON -DDCA_HAVE_HPX=TRUE ..

16:09 <hkaiser> k

16:10 <weilewei> I think I will send a wrap-up email to John as he said he will look into it tmr, will cc you

16:20 bibek has quit [Quit: Konversation terminated!]

16:21 bibek has joined #ste||ar

16:34 bibek has quit [Quit: Konversation terminated!]

16:34 bibek has joined #ste||ar

17:30 nikunj has joined #ste||ar

17:34 rori has quit [Quit: WeeChat 1.9.1]

17:38 nikunj has quit [Quit: Bye]

17:39 nikunj has joined #ste||ar

17:51 bibek has quit [Quit: Konversation terminated!]

17:51 bibek has joined #ste||ar

18:03 bibek has quit [Quit: Konversation terminated!]

18:03 _bibek_ has joined #ste||ar

18:23 nikunj has quit [Remote host closed the connection]

18:25 <jaafar> jbjnr: just noted your comment about having your own scan algorithms :) Do you have a branch I can try?

18:25 * jaafar has a Google Benchmark setup

18:37 _bibek_ has quit [Quit: Konversation terminated!]

18:37 _bibek_ has joined #ste||ar

18:38 _bibek_ has quit [Client Quit]

18:38 _bibek_ has joined #ste||ar

18:43 _bibek_ has quit [Quit: Konversation terminated!]

18:44 _bibek_ has joined #ste||ar

18:48 nikunj has joined #ste||ar

18:59 _bibek_ has quit [Quit: Konversation terminated!]

18:59 _bibek_ has joined #ste||ar

19:01 _bibek_ has quit [Client Quit]

19:01 _bibek_ has joined #ste||ar

19:01 _bibek_ has quit [Client Quit]

19:02 bibek has joined #ste||ar

19:09 bibek has quit [Read error: Connection reset by peer]

19:09 bibek has joined #ste||ar

19:54 bibek has quit [Quit: Konversation terminated!]

19:54 bibek has joined #ste||ar

20:04 bibek has quit [Quit: Konversation terminated!]

20:04 bibek has joined #ste||ar

20:06 bibek has quit [Client Quit]

20:06 bibek has joined #ste||ar

20:19 bibek has quit [Read error: Connection reset by peer]

20:19 bibek has joined #ste||ar

20:19 bibek has quit [Client Quit]

20:19 bibek has joined #ste||ar

20:29 bibek has quit [Quit: Konversation terminated!]

20:30 bibek has joined #ste||ar

20:56 hkaiser has quit [Quit: bye]

21:00 Guest70891 has quit [Read error: Connection reset by peer]

21:01 Guest70891 has joined #ste||ar

21:29 bibek has quit [Quit: Konversation terminated!]

21:29 bibek has joined #ste||ar

21:38 bibek has quit [Quit: Konversation terminated!]

21:39 bibek has joined #ste||ar

21:40 bibek has quit [Client Quit]

21:40 bibek has joined #ste||ar

21:46 Coldblackice has joined #ste||ar

21:48 hkaiser has joined #ste||ar

21:48 Coldblackice_ has quit [Ping timeout: 268 seconds]

22:10 bibek has quit [Read error: Connection reset by peer]

22:12 _bibek_ has joined #ste||ar

22:14 _bibek_ has quit [Read error: Connection reset by peer]

22:14 bibek has joined #ste||ar

22:18 K-ballo has quit [Quit: K-ballo]

22:21 K-ballo has joined #ste||ar

22:31 bibek has quit [Read error: Connection reset by peer]

22:32 bibek has joined #ste||ar

22:44 bibek has quit [Quit: Konversation terminated!]

22:45 bibek has joined #ste||ar

22:55 bibek has quit [Quit: Konversation terminated!]

22:56 bibek has joined #ste||ar

23:11 bibek has quit [Read error: Connection reset by peer]

23:11 _bibek_ has joined #ste||ar

23:12 _bibek_ has quit [Client Quit]

23:12 _bibek_ has joined #ste||ar

23:16 _bibek_ has quit [Client Quit]

23:16 _bibek_ has joined #ste||ar

23:56 mbremer has left #ste||ar [#ste||ar]