#ste||ar on 2019-07-01 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

02:01 K-ballo has quit [Quit: K-ballo]

02:29 hkaiser has quit [Quit: bye]

02:48 lsl88 has quit [Ping timeout: 245 seconds]

03:03 lsl88 has joined #ste||ar

03:29 jbjnr_ has joined #ste||ar

03:32 jbjnr has quit [Ping timeout: 252 seconds]

06:00 nikunj has quit [Remote host closed the connection]

07:07 daissgr has joined #ste||ar

07:21 <simbergm> hkaiser: FYI if you read this later: http://rostam.cct.lsu.edu/builders/hpx_gcc_6_boost_1_64_centos_x86_64_release/builds/314, i.e. the all_to_all test segfaults sometimes :/ maybe only with older boost versions but not sure yet

07:29 <simbergm> heller: I'm guessing we'll need your alps PR for the course?

07:29 <heller> Yes

07:38 daissgr has quit [Ping timeout: 252 seconds]

07:50 <simbergm> I see you started updating some dependencies etc, how far did you get? I can try to set up the rest today

07:50 <simbergm> heller: ^

08:10 <heller> simbergm: yes, everything is updated, more or less

08:10 <heller> Under ~/hpx

08:10 <simbergm> so just the hpx installs missing?

08:10 <heller> ~/hpx/build/debug is uptodate

08:10 <simbergm> boost, hwloc, jemalloc all look like they're up to date

08:11 <heller> So I guess we need another release and Apex build

08:11 <heller> Yup

08:11 <simbergm> all right, thanks

08:12 <simbergm> no gpus it seems like

08:12 <simbergm> so the cuda examples we'll just talk about I guess

08:12 <simbergm> ?

08:42 Yorlik has joined #ste||ar

10:39 K-ballo has joined #ste||ar

10:51 <heller> simbergm: seems so, unless we want to show them on daint or so

11:00 <simbergm> heller: probably not... let's see how much time there is

11:00 <simbergm> btw, did you get: fatal: unable to access 'https://github.com/khuck/xpress-apex.git/': Failed to connect to localhost port 7777: Connection refused?

11:05 rori has joined #ste||ar

11:29 <heller> Erm

11:30 <heller> You need to set up a https tunnel

11:30 <heller> simbergm: there's a FAQ for that

11:31 <simbergm> heller: ugh, ok, thanks

11:31 <simbergm> I just scp:d a copy over for now

11:33 <heller> That'll work as well, I guess

12:19 hkaiser has joined #ste||ar

13:44 hkaiser has quit [Quit: bye]

14:19 <diehlpk_work> jbjnr_, simbergm DAINT Usage: 0 NODE HOURS (NH) Quota: 18,000 NH 0.0%

14:19 <diehlpk_work> Does this mean we have 18k NH?

14:23 hkaiser has joined #ste||ar

14:31 <diehlpk_work> jbjnr_, Are you still intend to join today;s meeting?

14:35 akheir has joined #ste||ar

14:59 hkaiser has quit [Quit: bye]

15:06 diehlpk has joined #ste||ar

15:06 <diehlpk> jbjnr_, yet?

15:41 diehlpk has quit [Ping timeout: 264 seconds]

15:46 hkaiser has joined #ste||ar

16:25 <heller> simbergm: what's the state of the installation@

16:35 <rori> quit

16:35 rori has quit [Quit: WeeChat 1.9.1]

16:55 <simbergm> heller: release and profiling-apex are there now

16:56 <simbergm> but I'm not sure if I got the apex build right, vampir refuses to read the otf files

16:56 <heller> Hmmm

16:56 <simbergm> diehlpk_work: I would assume so (john is away now)

16:56 <heller> Maybe wrong otf2 library?

16:57 <diehlpk_work> simbergm, Cool, even better

16:58 <simbergm> heller: maybe, I just used the one that was there

16:59 <heller> That might have been too old

16:59 <heller> That's 2 years old, IIRC

16:59 <simbergm> ok, I'll try that tomorrow

17:35 <simbergm> heller: did you use the vtune/itt build last time? do we need it?

17:36 <heller> I don't think we need it

17:36 <heller> We showed some itt results last time, but I think it's good enough to focus on vampir

18:51 hkaiser has quit [Quit: bye]

19:58 hkaiser has joined #ste||ar

20:17 <diehlpk_work> Name: Modern CUDA and C++ by Bryce Adelstein Lelbach

20:18 <diehlpk_work> hkaiser, Do you want to have a different title for Bruyce's talk?

20:20 <diehlpk_work> Playlist: Talks @ Ste||ar group

20:23 nikunj97 has joined #ste||ar

20:35 <simbergm> tarzeau: the various dev packages are needed for libhpx-dev

20:35 <simbergm> although I don't fully understand how the boost packages work

20:37 <simbergm> and if you want to have google-perftools as a dependency you can use tcmalloc instead ofjemalloc, although I'll admit I don't really know what one gets by linking hpx with google-perftools (not talking about tcmalloc)

20:39 <nikunj97> hkaiser: yt?

20:53 <hkaiser> nikunj97: here

20:53 <hkaiser> diehlpk_work: nah, I think it's fine as it is

20:54 <nikunj97> hkaiser: I calculated the amortized time for 1 tile/timestep. It is pretty low at about 5-7 us only

20:54 <hkaiser> nikunj97: ok, that's not what I meant :/

20:54 <nikunj97> :/

20:55 <hkaiser> let's talk again tomorrow

20:55 <nikunj97> I have all the graphs btw

20:55 <hkaiser> ok, nice - let's look over them tomorrow as well

20:55 <nikunj97> 32000 points and 64 domains is the sweet spot

20:55 <hkaiser> nod

20:56 <heller> simbergm: did you get Apex working? Could you send me steps to reproduce?

20:57 <simbergm> heller: haven't tried again

20:57 <heller> ok

20:57 <simbergm> you want to try the build or running with apex?

20:57 <heller> both

20:57 <diehlpk_work> hkaiser, Ok, so I will let the IT guys know and they can publish the talk

20:58 <diehlpk_work> So we can release the talk and get rid of my stalkers :)

20:58 <simbergm> I'll look at it again tomorrow, but you can try opening ~/OTF_archive/APEX.otf (or something like that), I think I left it there

20:58 <diehlpk_work> people still ask me for his talk on Twitter

21:00 <hkaiser> diehlpk_work: sure, and thanks!

21:50 <nikunj97> hkaiser: to implement validate as they have done, I will need to know about the sum of all points from previous and next tile. Also, I will need the first the element from next tile.

22:12 K-ballo1 has joined #ste||ar

22:14 K-ballo has quit [Read error: Connection reset by peer]

22:17 K-ballo1 has quit [Ping timeout: 258 seconds]

22:18 K-ballo has joined #ste||ar

22:21 <nikunj97> I think I know how to implement checksums

23:12 <nikunj97> hkaiser: I think I finally understood everything Jackob is trying in his code. I'll convert the 1d stencil to do exactly what his code. As for tomorrow, let's just say we found scope for optimization so we're working on it.

23:13 <nikunj97> I can port 1d stencil to work same as Jackob, but much faster. It will take some time though.

23:46 <nikunj97> hkaiser: just implemented the checksums and ported Jackson's code

23:46 <nikunj97> am going home now. Will write benchmarking scripts tomorrow morning

23:49 <nikunj97> just in case you want to know how it performs, then it takes about 8s to run

23:50 <nikunj97> I think I can optimize it further, but I'm too tired now