#ste||ar on 2020-04-21 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:07 <hkaiser> weilewei: yah, it's a nice way to learn how people code

00:08 <weilewei> hkaiser sure, I plan to watch more like this, and I also start reading the book

00:56 <diehlpk_work> hkaiser, simbergm Should we add a license to https://github.com/STEllAR-GROUP/hpx-docs

01:36 <hkaiser> diehlpk_work: we probably should, this repo is automatically populated, however, so we might need to do some scripting

01:36 akheir has quit [Read error: Connection reset by peer]

01:37 akheir has joined #ste||ar

01:41 <hkaiser> diehlpk_work: done, let's see if it gets overwritten

01:59 hkaiser has quit [Quit: bye]

02:01 weilewei has quit [Remote host closed the connection]

02:22 bita has joined #ste||ar

02:42 bita has quit [Quit: Leaving]

02:46 nan1 has quit [Remote host closed the connection]

02:58 akheir has quit [Quit: Leaving]

03:28 mdiers_ has quit [Remote host closed the connection]

03:28 mdiers_ has joined #ste||ar

05:09 <wash[m]> Sorry folks, what did you need my consent for?

05:10 <wash[m]> Ah for the JOSS submission? That's fine :)

07:09 <simbergm> hkaiser: thanks for adding the license

07:09 <simbergm> I think it should stay there iirc what the scripts do

07:29 <heller1> so I just noticed one thing ...

07:32 <heller1> ... and I think hpx::init needs to go. Here is the reason: Each test that requirese HPX threads to run, needs to go through hpx_init ... creating nice cycles...

08:54 <simbergm> heller: if anything the tests need to be rewritten not to use hpx::init

08:55 <simbergm> but it's only tests, the modules otherwise don't have that dependency

08:55 <simbergm> and kicking out hpx::init just for the sake that feels user-hostile

08:55 <simbergm> but there might be good solutions

08:56 <simbergm> do we have a way of making a future<void> from a future<tuple<future<void>, future<void>>> (returned from e.g. when_all) without spawning a task?

08:56 <simbergm> unwrap will actually wait for the future, but I'd like to just collapse it into a future<void>

09:29 <heller1> future<future<T>> -> works

09:30 <heller1> * future<future<T>> -> future<void> work

09:30 <heller1> but we don't have a way to further inspect that

09:31 <heller1> ms[m]: there's split_future, if that helps

09:34 <simbergm> yeah, I guess split future would actually do the right thing, even if it's semantically a bit iffy

09:34 <simbergm> thanks!

12:48 nikunj has quit [Remote host closed the connection]

12:49 nikunj has joined #ste||ar

13:14 hkaiser has joined #ste||ar

13:15 <hkaiser> simbergm: master is broken since the latest merges

13:17 <simbergm> hkaiser: right you are

13:17 <simbergm> sorry, entirely my fault

13:17 <simbergm> I'll fix it

13:17 <hkaiser> thanks a lot

13:18 <hkaiser> simbergm: thanks for your thorough review of #4540

13:20 hkaiser_ has joined #ste||ar

13:24 hkaiser has quit [Ping timeout: 260 seconds]

13:26 rtohid has joined #ste||ar

13:32 weilewei has joined #ste||ar

15:00 <simbergm> hkaiser: I think I caught all of them with https://github.com/STEllAR-GROUP/hpx/pull/4545

15:00 <simbergm> we'll wait and see...

15:05 <hkaiser_> simbergm: thanks!

15:09 <diehlpk_work> hkaiser_, I went through all ste||ar group repos and added a ticket were a license is missing

15:09 <hkaiser_> diehlpk_work: thanks

15:33 nan11 has joined #ste||ar

15:35 gonidelis has joined #ste||ar

15:43 akheir has joined #ste||ar

16:02 <hkaiser_> diehlpk_work: do we have a meeting now?

16:08 <diehlpk_work> Yes, we are already in

16:08 <diehlpk_work> hkaiser_, I sent the Zoom link to the operation bell list

16:20 karame_ has joined #ste||ar

16:29 rtohid has left #ste||ar [#ste||ar]

17:33 rtohid has joined #ste||ar

17:52 akheir has quit [Read error: Connection reset by peer]

17:52 akheir1 has joined #ste||ar

18:18 mcopik has joined #ste||ar

18:18 mcopik has quit [Client Quit]

18:22 bita has joined #ste||ar

19:17 bita_ has joined #ste||ar

19:28 bita_ has quit [Quit: Leaving]

19:34 gonidelis has quit [Ping timeout: 240 seconds]

20:30 rtohid has left #ste||ar [#ste||ar]

20:41 nan11 has quit [Remote host closed the connection]

20:41 nan11 has joined #ste||ar

20:48 weilewei has quit [Remote host closed the connection]

21:10 weilewei has joined #ste||ar

21:48 <weilewei> hkaiser_ how should I insert timer of communication phase correctly for ringG algorithm? for the computation part, I can insert start and end timer around line 73. but for the communication phase, it is an async operation and more importantly, it is a loop (depending how many ranks), also I do not want to count memorycpy phase as well

21:48 <weilewei> https://github.com/weilewei/Ring_example_MPI_CUDA/blob/master/G2_ring.cpp

21:49 <hkaiser_> you can only measure the overall time reliably, I think

21:50 <weilewei> I see

21:51 <hkaiser_> or each timestep in the loop

21:53 <weilewei> I see, is it a wise choice to time each function inside the loop and also each step? And then do communication_time = total_time_per_step - compute_time - copy_time

22:01 Amy1 has quit [Ping timeout: 256 seconds]

22:03 Amy1 has joined #ste||ar

22:07 karame_ has quit [Quit: Ping timeout (120 seconds)]

22:19 <hkaiser_> weilewei: try it - you can't really measure comunication time as it's overlapped

22:21 <weilewei> hkaiser_ ah, I see now, even when receiving data, the program is doing copy and update...

22:44 <hkaiser_> weilewei: the most you can do is to measure how long it sits in mpi_wait

22:44 <hkaiser_> to allow to assess how muc cu time is wasted ;-)

22:44 <hkaiser_> *cpu time*

22:44 Rory89 has joined #ste||ar

22:47 <weilewei> hkaiser_ hmmm true... but still, if I meaure mpi_wait, that does not reflect the complete picture of communication time. maybe just measure the whole for-loop

22:47 <hkaiser_> right

22:48 <bita> hkaiser_, Rory89 and I was talking about parallel inverse. What kind of algorithm should be worked on? I was telling Rory that having a for loop parallelized with constraint is not what we do in Phylanx

22:49 <hkaiser_> Rory89 and Avah have discussed what algorithm to use, no?

22:50 <bita> I think Avah has OpenMP thing in her mind

22:50 <hkaiser_> sure, that's what we could start with, no?

22:50 <Rory89> Yeah, it's just Gauss Inverse with different localities owning different columns

22:51 <hkaiser_> in the first step our implementation will suck perf-wise anyways ;-)

22:51 <bita> I am not sure how it can be implemented. Rory can you explain more about its detail?

22:51 <hkaiser_> Rory89: several columns per locality?

22:52 <Rory89> Perhaps more than one column per locality. If you have an nxn matrix, it just splits those n columns up evenly, or approximately so, across the localities

22:52 <hkaiser_> nod

22:52 <hkaiser_> makes sense

22:53 <hkaiser_> so you need to do different operatiions: a) find pivot, b) find coefficients, and c) apply coefficients

22:53 <hkaiser_> is there more?

22:53 <bita> I was telling Rory that he needs to make a distributed matrix (for the one that starts as the identity)

22:54 <Rory89> The problem I was having was how to handle the result matrix.

22:54 <hkaiser_> yes

22:54 <bita> and he had questions about where we have annotations

22:54 <hkaiser_> ok, what's the problem?

22:54 <hkaiser_> the inverse returns a new (tiled) matrix with a corresponding annotation attached

22:55 <bita> Rory, can you implement the 3 functions that hkaiser_ mentioned?

22:55 <hkaiser_> it's very similar to what we have done in other places

22:55 <bita> In other places we didn't have iterations

22:55 <Rory89> The user sends off a matrix A to be inverted, all of the localities need access to another matrix, call it B. So thats the only problem, creating a new matrix in the code that isnt sent in the test

22:55 <Rory89> that all of the localities can read and write to their respective locations.

22:56 <bita> in guass inverse by Rory everything is happening in a for loop

22:56 <hkaiser_> Rory89: same is done in dot product

22:56 <hkaiser_> it creates a new matrix and fills it with the result of the operation

22:56 <hkaiser_> the only difference is that you attach the distributed matrix to the result and not to the input

22:56 <hkaiser_> or possibly to both, not sure

22:57 <hkaiser_> you start off with an identity matrix, right?

22:57 <Rory89> Ah, so B in this case is essentially like "result_matrix" in dist_dot?

22:58 <Rory89> Yep, exactly

22:58 <hkaiser_> yah

22:59 <hkaiser_> we can even call inverse with both matrices, the one to invert and the identity matrix generated by nan's identity_d()

23:00 <hkaiser_> inverse_d(A, __arg(B, identity_d(shape(A))) or somesuch

23:01 <hkaiser_> if that helps, that is

23:01 <hkaiser_> so you don't have to duplicate Nan's code

23:02 <Rory89> Yep that makes sense, thanks!

23:07 nan11 has quit [Remote host closed the connection]

23:20 bita has quit [Quit: Leaving]

23:41 Rory89 has quit [Remote host closed the connection]