#ste||ar on 2019-06-30 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

00:25 hkaiser has joined #ste||ar

00:31 nikunj has quit [Remote host closed the connection]

00:31 nikunj has joined #ste||ar

02:19 hkaiser has quit [Quit: bye]

02:26 K-ballo has quit [Quit: K-ballo]

07:47 nikunj has quit [Remote host closed the connection]

12:34 K-ballo has joined #ste||ar

14:33 hkaiser has joined #ste||ar

14:57 <simbergm> tarzeau: yt? the -dev package also needs the libraries as dependencies :)

14:57 <simbergm> and you can remove the gfortran dependency

15:59 <tarzeau> yt?

16:00 <tarzeau> i have these: Depends: libatomic1, libhpx1 (= 1.3.0-1)

16:00 <tarzeau> which libs are missing ?

16:17 nikunj has joined #ste||ar

17:22 quaz0r has quit [Ping timeout: 258 seconds]

17:36 quaz0r has joined #ste||ar

21:08 quaz0r has quit [Ping timeout: 246 seconds]

21:22 <nikunj> hkaiser: want to listen to some good news?

21:25 <nikunj> I don't see any difference in overheads on my laptop

21:25 <nikunj> I'll run them on marvin now and see if it's the same case there as well

21:32 <hkaiser> nikunj: what do you mean?

21:33 <nikunj> I mean that the running replay over the normal one has no overhead

21:33 <hkaiser> heh

21:33 <nikunj> they run about the same time

21:33 <hkaiser> on stencil1d_4?

21:33 quaz0r has joined #ste||ar

21:33 <nikunj> that's without errors though

21:33 <nikunj> I mean the implementation overheads only

21:33 <hkaiser> ok

21:33 <hkaiser> cool

21:34 <nikunj> yes, on stencil1d_4

21:34 <hkaiser> nice

21:34 <nikunj> I'll add the checksum function in the evening

21:34 <hkaiser> without errors there shouldn't be too much overhead to begin with

21:34 <nikunj> till then I'll write a script to compare the standard with replay

21:34 <hkaiser> ok

21:35 <nikunj> GaTech have 3s of overhead without failures xD

21:35 <nikunj> but they have more workers and more iterations

21:35 <nikunj> we don't have multiple time steps in a single iteration so we can't compare directly

21:36 <nikunj> but overall, I really like where we're going

21:36 <hkaiser> ok, good

21:39 <nikunj> just ran some on marvin

21:39 <nikunj> 1600 points per tiles is not good enough work to hide the overheads

21:39 <nikunj> I see some 1.6s difference

21:40 <nikunj> but 32000 points per tile reduces this to 0.5-0.7s

21:40 <hkaiser> ok

21:40 <hkaiser> how long does one thread/tile take?

21:41 <nikunj> didn't get what you mean

21:41 <hkaiser> how much work (time-wise) is '1600 points/tile'?

21:42 <nikunj> 16000 points within one tile with 128 tiles in total over 8192 iterations take some 3.8s

21:42 <nikunj> with replay added it increases to 5.5s

21:42 <nikunj> it's over 4 os threads

21:43 <hkaiser> nikunj: I meant per tile/timestep? how much work is that?

21:43 <nikunj> wait no, over 16 os threads

21:43 <nikunj> subdomain width right?

21:43 <hkaiser> let's talk tomorrow ;-)

21:44 <nikunj> I think I'm mis understanding. alright, let's do it tomorrow :)

21:44 <nikunj> I'll run some tests in the meantime

21:44 <nikunj> so we can show them the results on tuesday

21:44 <nikunj> btw Jackson's code benchmarks are here, I'll generate graphs for them as well

21:45 <hkaiser> good