#ste||ar on 2020-09-06 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:00 <hkaiser> parsa: yes

00:00 <hkaiser> also, you have almost no work at all - how many timestep do you run?

00:22 <parsa> hkaiser: 10K timesteps

00:27 <parsa> had a typo, it was running for the default 45 timesteps instead of 10K… i have to redo

00:44 <parsa> update: 10k was a bad choice. it's too long and completely hides migration

00:53 <hkaiser> parsa: isn't that what we want?

00:55 <parsa> i mean, it's ideal, but it would make all cases, even the blocking ones look the same

02:10 hkaiser has quit [Quit: bye]

08:05 <peltonp1> is there a way to launch multiple localities on a desktop or how do you test distributed programs without a cluster?

08:12 <zao> peltonp1: Yup. A locality is just a process, batch systems just help orchestrate the launching of multiple processes and letting them know which other processes there are to communicate with.

08:13 <zao> You can start them yourself and give arguments to let them know which one is the first rank and then tell the others to connect to the them. There might be some helpful information in the bundled Python scripts that are used by the test suite.

10:26 gonidelis[m] has quit [*.net *.split]

10:26 gnikunj[m] has quit [*.net *.split]

10:26 norbert[m] has quit [*.net *.split]

10:26 rori has quit [*.net *.split]

10:32 gonidelis[m] has joined #ste||ar

10:32 gnikunj[m] has joined #ste||ar

10:32 norbert[m] has joined #ste||ar

10:32 rori has joined #ste||ar

12:51 hkaiser has joined #ste||ar

13:41 <hkaiser> parsa: yt?

14:34 <parsa> parsa: yeah

14:37 <hkaiser> parsa: hey

14:38 <hkaiser> parsa: I'd be available to talk whenever you have time

14:39 <parsa> i'm all set

14:39 <hkaiser> sec

16:12 bita has joined #ste||ar

16:22 <hkaiser> parsa: yt?

16:22 <parsa> yes

16:23 <hkaiser> parsa: for the shifted case, could you change the measurements such that it starts off with the shifted placement and migrate it into the optimal solution instead of doing it the other way around?

16:23 <hkaiser> I think that would be more consistent with the impaired case

16:25 <hkaiser> because if you start with the optimal case introducing non-optimal data placement usingn migration will not end up with a use case that corresponds what people would like to have

16:29 <parsa> okay. so just to check: we want 4 new sets of data: impaired case with no migration, shifted case but shift the data when creating the components blocked, overlapped, and without migration. right?

16:31 <parsa> hkaiser: ^

16:31 <hkaiser> parsa: overall: start impaired, migrate in place blocked and overlapped; start shifted, migrate in place blocked and overlapped; baseline (no migration, optimal data placement); impaired (no migration); and shifted (no migration)

16:32 <hkaiser> so three baseline measurements, and 4 migration scenarios

16:33 <hkaiser> one baseline and the overlapped migrations are there - no need to redo those

16:33 <hkaiser> what I think we would need is two baseline measurements of the impaired (start load-imbalanced) and shifted (start with bad data locality) cases

16:34 <hkaiser> and the shifted migration scenarios, just not start with the good placement, but start with the bad placement

16:36 <hkaiser> parsa: does this make sense?

16:39 <parsa> not yet, still going through it

16:42 <hkaiser> parsa: sorry, I misspoke

16:42 <parsa> i get the baseline measurements part… no migration... 1: optimal; 2: impaired, 3: all data shifted to the neighbor at start)

16:42 <hkaiser> I say 'one baseline and the overlapped migrations are there - no need to redo those' but I meant 'one baseline and the impaired migrations are there - no need to redo those'

16:42 <hkaiser> yes

16:43 <hkaiser> now the migration scenarios are simple

16:43 <hkaiser> the impaired are ok, no need to redo

16:44 <hkaiser> what would be nice is to have shifted migration measurements (overlapped and blocked) but start with the shifted placement and migrate things to the optimal layout

16:44 <parsa> well we don't have the impaired without migration… i'll run on one node a couple of times and get you the average

16:44 <hkaiser> parsa: yes, thanks

16:58 <parsa> hkaiser: impaired case is 641.877 seconds… i'll email the rest once i get them

17:17 <hkaiser> parsa: on one locality? or two?

17:18 <hkaiser> parsa: btw, the overall scaling is almost a factor of 9 when going from two to ten nodes - that seems to be over the top and needs explanation

17:19 <hkaiser> if this 641.877s is on one node, what's the optimal base line for one node?

17:21 parsa[m] has joined #ste||ar

17:40 <hkaiser> parsa[m]: can you read back?

17:41 <parsa[m]> I’ll be back in an hour

17:41 <hkaiser> ok

18:16 <parsa> hkaiser: that is on one node… i did run it on two nodes for good measure, and as expected, get the same exec time

18:20 <parsa> right… going from 2 to 8 localities and getting a speedup of 14 :O

18:20 <parsa> 1* to 8 localities

18:33 <hkaiser> parsa: can you run the others on one node as well

18:33 <hkaiser> at least the optimal baseline?

18:33 <parsa> yes

18:33 <hkaiser> cool, thanks

18:33 <hkaiser> I think the data we'll have is more than sufficient, then

21:50 <parsa> hkaiser: ping

21:50 <hkaiser> here

21:51 <parsa> i've figure putting the raw data in a gsheet is expedient... see link in pm