#ste||ar on 2019-10-17 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

00:21 K-ballo has joined #ste||ar

01:40 K-ballo has quit [Quit: K-ballo]

04:10 nikunj has joined #ste||ar

04:30 nikunj97 has joined #ste||ar

04:33 nikunj has quit [Ping timeout: 268 seconds]

05:58 nikunj97 has quit [Read error: Connection reset by peer]

06:40 jbjnr_ has joined #ste||ar

06:43 <simbergm> jaafar: ah, continuation is an overloaded term

06:43 <simbergm> I think you got the right idea for dataflow

06:44 <simbergm> (including that dataflow(fork, ...) may not make much sense)

06:44 <simbergm> I just wanted to provide some context for the terminology

09:45 <heller> ./hpx/config/config_strings.hpp:12:10: fatal error: 'hpx/config/config_defines_strings_modules.hpp' file not found

09:45 <heller> anyone else ran into that?

09:54 jbjnr_ has quit [Ping timeout: 276 seconds]

09:55 <heller> nvm. got it

11:10 jbjnr_ has joined #ste||ar

11:22 K-ballo has joined #ste||ar

12:51 <hkaiser> heller: that was my fault/merge problem - should be fine now

12:57 <heller> hkaiser: the problem seemed to be that the skeleton creation script didn't have those changes in

12:57 <hkaiser> ahh

12:57 <hkaiser> ok

12:58 <hkaiser> then it's my fault as well ;-)

13:46 aserio has joined #ste||ar

13:51 <hkaiser> aserio: can't get into webex :/

13:51 <hkaiser> g'morning, btw

13:51 <aserio> Morning

13:51 <aserio> What do you mean?

13:51 <hkaiser> incorrect email or password

13:52 <hkaiser> I used the correct password, I think

13:52 <hkaiser> so what email do I use?

13:52 <aserio> It should be your LSU email

13:52 <aserio> though do you need to to join a meeting?

13:53 <hkaiser> we have the coordination meetin gnow

13:53 <aserio> Shouldn't you be able to just click the link and join?

13:53 <aserio> now?

13:53 <hkaiser> no, it asks questions

13:53 <aserio> in 8 minutes right?

13:53 <hkaiser> I thought, yes

13:53 <hkaiser> no, can't get in

13:53 <aserio> I have just started the meeting

13:54 <hkaiser> ok, joined as guest

13:55 <aserio> Yea you should be able to join via this link: https://lsucct.webex.com/lsucct/e.php?MTID=md44da149de4f61889d3216829a6c24d8

14:00 <hkaiser> simbergm, rori: will you join?

14:01 <simbergm> hkaiser: yep, sorry

15:29 hkaiser has quit [Ping timeout: 245 seconds]

15:58 jbjnr_ has quit [Ping timeout: 246 seconds]

16:32 hkaiser has joined #ste||ar

16:40 aserio has quit [Ping timeout: 264 seconds]

17:42 K-ballo1 has joined #ste||ar

17:43 K-ballo has quit [Ping timeout: 245 seconds]

17:43 K-ballo1 is now known as K-ballo

17:55 <heller> you can't use move only types in omp task regions :(

17:56 jbjnr_ has joined #ste||ar

18:25 K-ballo1 has joined #ste||ar

18:29 K-ballo has quit [Ping timeout: 268 seconds]

18:29 K-ballo1 is now known as K-ballo

18:31 <heller> works!

18:36 <heller> hkaiser: teaser: https://gist.github.com/sithhell/94431ce48a0f36691c3641fc6fc97921

18:36 <heller> jbjnr_: ^^

18:36 aserio has joined #ste||ar

18:39 <hkaiser> heller: why not use an executor?

18:39 <heller> because executors on the execution context stuff hasn't been implemented yet :P

18:39 <heller> but yes, that would be the end goal

18:40 <hkaiser> ok, cool

18:41 <heller> no hpx_init or hpx_main though

18:41 <heller> everything just like tha

18:41 <heller> t

18:43 <heller> hkaiser: and MPI async send/recv with hpx::future: https://gist.github.com/sithhell/50fdbc934aeb4868b5cb273fe3a7515e

18:43 <heller> credits go to jbjnr_

18:44 <heller> futurize ALL the things

18:57 <hkaiser> lol

18:57 <hkaiser> that's my line ;-)

19:02 <heller> :P

19:02 <heller> I am stealing everything :P

19:05 <hkaiser> feel free!

19:06 <hkaiser> heller: but why hpx::mpi::invoke and not hpx::mpi::async?

19:07 <heller> hkaiser: async has the connotation of an RPC, at least for me

19:07 <hkaiser> or even hpx::async(hpx::mpi::executor, ...

19:07 <heller> which that one isn't

19:07 <heller> this is just wrapping the async MPI functions to return a future

19:07 <hkaiser> async has nothing in common with RPC

19:07 <heller> it has in our context

19:08 <hkaiser> it could be RDMA as well, or anything else

19:08 <hkaiser> it simply asynchronously does something

19:08 <heller> asynchronous operations: yes

19:08 <hkaiser> no

19:08 <hkaiser> it asynchronously invokes an action

19:08 <heller> std::async and hpx::async launch tasks. hpx::async can launch tasks remotely

19:09 <hkaiser> no, it launches an action which happens to do remote things

19:09 <heller> I wouldn't overload terms here

19:09 <heller> in my book, an action is all about doing RPC

19:10 <hkaiser> it could do RDMA or trigger a local operation

19:11 <heller> well, a procedure does those things, true

19:11 <heller> however, to make MPI action aware, it requires far more things

19:12 <heller> and that's not the point here ... the point is more about how to be able to use futures with thin layers over exisiting software ecosystems

19:12 <hkaiser> well, sure

19:16 <heller> I think the first step is to show that HPX as its own software ecosystem, is capable enough to adapt to other, existing, or even emerging ones

19:17 <heller> to not only open up a migration path, but to use its different modules as building blocks without imposing too much

19:17 <heller> does this make sense?

19:19 <hkaiser> sure

19:19 <hkaiser> btw stackfull vs. stackless

19:19 <hkaiser> 500000 hpx threads: 2.6s vs. 2.4s

19:20 <hkaiser> as expected, 10%% improvment

19:21 <heller> 8 :P

19:22 <heller> can you compare the numbers to master as well please?

19:23 jbjnr_ has quit [Ping timeout: 252 seconds]

19:23 <hkaiser> yah, next on my list

19:30 <hkaiser> heller: master is at 2.55s

19:31 <heller> hkaiser: interesting

19:31 <hkaiser> heller: this is obviously no thorough analysis yet

19:31 <hkaiser> just a quick manual run

19:32 <heller> hkaiser: I think the stackless tasks will really shine after streamlining the scheduling loop and task states

19:32 <hkaiser> absolutely

19:32 <hkaiser> needs more work, definitely

19:32 <heller> I think they might even be a perfect fit for a custom execution agent/execution context

19:32 <heller> which should help with that

19:47 jbjnr_ has joined #ste||ar

20:09 hkaiser has quit [Ping timeout: 250 seconds]

20:28 aserio has quit [Quit: aserio]

20:50 hkaiser has joined #ste||ar

20:51 <hkaiser> heller: those _are_ a different agent

20:53 <heller> hkaiser: exactly. they almost share nothing with thread_data ;)

20:55 <hkaiser> thread_data ?

20:55 <hkaiser> what do you mean?

20:56 <heller> that your implementation of stackless tasks derive from thread_data, or did I misunderstand something?

20:56 <hkaiser> they do, yes

21:07 <heller> so what I wanted to say, we have thread_data representing one agent, and the stackless ones representing another

21:14 <hkaiser> heller: hmmm, not really

21:14 <hkaiser> thread_data_stackfull is one thread_data_stackless the other, both are derived from thread_data

21:15 <heller> sure, that's how you implemented it

21:15 <heller> I'd argue however, that they shouldn't share the same base class

21:17 <hkaiser> only implementation sharing, a) didn't want to copy, b) wanted to keep the state management non-virtual

21:21 <heller> hkaiser: https://github.com/STEllAR-GROUP/hpx/commit/6c43b1a005bacead97c14c54fcd24c9fe0f3e52d <-- this is what's needed for the openmp/mpi stuff to work, btw

21:21 <heller> still an early stage

21:22 <hkaiser> I think you're recreating the executor interface here

21:22 <heller> partially, yes

21:22 <hkaiser> post() is an executor API function

21:22 <hkaiser> why, then?

21:23 <heller> still two different things

21:23 <hkaiser> k

21:23 <heller> a context should expose a default executor eventually

21:23 <heller> that is, a context needs to be able to spawn agents

21:24 <heller> the post function is a strawman, didn't come up with a better name

21:24 <hkaiser> sure

21:25 <hkaiser> executors are lihgtweight wrappers for contexts, so this is probably ok

21:25 <heller> but you should be able to instantiate different executors using the same context

21:25 <heller> right

21:25 <heller> executors are the API to spawn agents on contexts

21:25 <hkaiser> nod

21:26 <heller> closing the circle ... stackless tasks could be represented by their own context

21:27 <heller> spawning their specific agents

21:27 <hkaiser> nobody has said that contexts couldn't spawn various agents

21:27 <hkaiser> and I think there is a use for that

21:28 <heller> hmm

21:29 <heller> having a one to one relationship, would simplify the API

21:31 <heller> since the context carries the information on what agents to spawn, there's no need for additional parameters to distinguish between the agents to spawn and we don't impose the requirement for diferent contexts to support spawning of various agents

21:31 <hkaiser> so even different stack sizes would imply using different contexts?

21:33 <heller> good question

21:34 <heller> so would that rather be a property of the executor then?

21:41 <heller> how do we handle different stack sizes for locally launched tasks right now?

21:42 <hkaiser> it's a parameter to register_thread

21:42 <hkaiser> heller: well, it might be a property of the executor, but even then you have to have a way to pass it on to the context

21:43 <heller> right, but is a different stack size really a different agent?

21:44 <heller> isn't stacksize a different thing than stackless vs. stackfull?

21:44 <hkaiser> same for priorities, scheduling-hints, etc.

21:44 <hkaiser> shrug

21:45 <hkaiser> I could imagin gthat a user might want to mix stackfull and stackless in the same paralell context/region

21:46 <heller> the problem I have with the stack size is that they essentially leak implementation details

21:47 <heller> stackless/stackfull in parallel regions: yes, probably

21:47 <hkaiser> hmm, it's similar to priorities, no?

21:48 <heller> I don't think so, different priorities for different tasks can stem from algorithmic need

21:48 <heller> s

21:49 <heller> stack sizes are a limitation of the underlying coroutine machinery

21:49 <heller> what do stack sizes mean for stackless tasks?

21:49 <hkaiser> pthreads have an API for stacksizes as well

21:50 <heller> windows fibers don't

21:50 <hkaiser> they do

21:50 <heller> ok

21:51 <heller> std::thread doesn't, at least

21:51 <hkaiser> yah, because its meant to be platform independent

21:51 <hkaiser> anyways

21:52 <heller> let's sleep over it a few nights...

21:52 <hkaiser> right

21:52 <hkaiser> in any case, your stuff is a game changer

21:56 <heller> let's see how the acceptance is tomorrow

21:57 <heller> potentially a game changer, we just have to carry it through

21:57 <heller> plus, I still have to check how it relates to the whole new future API that eric bryce and david are cooking up

21:58 <hkaiser> their stuff is lower level

21:58 <heller> even orthogonal to this

21:58 <hkaiser> right

21:59 <hkaiser> it's a infrastructure to implement things like futures considering heterogeneous contexts/executors

22:15 <heller> nod

22:18 jbjnr_ has quit [Ping timeout: 264 seconds]

22:34 hkaiser has quit [Ping timeout: 240 seconds]