#ste||ar on 2020-02-19 — irc logs at irclog.cct.lsu.edu

2019-12-03 02:04 hkaiser changed the topic of #ste||ar to: The topic is 'STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

02:15 diehlpk has joined #ste||ar

02:16 <diehlpk> hkaiser, I only got my apointment at the notary at 11:30, so I might need to leave the meeting earlier

02:17 <hkaiser> diehlpk: sure, np

02:19 <diehlpk> For the second level scaling looks good

02:19 <diehlpk> At least with the MPI port we see good scaling

02:20 <diehlpk> hkaiser, Can you take care of the performance counter and hpx section

02:31 <hkaiser> diehlpk: I can try

02:32 <diehlpk> Ok, I will take care of the computer science results and can join later

02:32 <diehlpk> hkaiser, Areyou fine with the technical report for the hpxmp paper?

02:33 <diehlpk> So I will submit it to arxiv.org and we can add it to the webpage

02:35 <hkaiser> yes, sure

02:35 <hkaiser> there no body interested in continuing that work, so let's get it uploaded

02:36 <diehlpk> Ok, will do it tonight.

02:36 <diehlpk> Dominic checked the simulation results with previous results and it looks good

02:36 <hkaiser> nice

02:36 <diehlpk> We are off at some points but they believe this could be due to the low res

02:36 <hkaiser> is this the 0.5 or the 0.7 run?

02:37 <diehlpk> double white dwarf

02:37 <hkaiser> well, the 400 million K temperature didn't look convincing...

02:37 <diehlpk> At least the other quantitites look better

02:37 <hkaiser> ok

02:38 <diehlpk> Dominic will check the resolution of the previous run

02:38 <diehlpk> yet the small start is only 20 cells

02:38 <diehlpk> *star

03:19 diehlpk has quit [Ping timeout: 245 seconds]

04:03 hkaiser has quit [Quit: bye]

04:56 nikunj97 has joined #ste||ar

07:34 nikunj97 has quit [Ping timeout: 260 seconds]

07:50 nikunj97 has joined #ste||ar

09:13 nikunj has joined #ste||ar

09:31 nikunj97 has quit [Ping timeout: 248 seconds]

09:31 nikunj has quit [Ping timeout: 248 seconds]

11:02 diehlpk_work has quit [Remote host closed the connection]

11:51 heller1 has quit [*.net *.split]

13:08 hkaiser has joined #ste||ar

13:43 nikunj has joined #ste||ar

13:43 nikunj97 has joined #ste||ar

13:49 <simbergm1> it's been tagged and it looks like I managed to make it point to the correct tag :)

13:49 <simbergm1> https://github.com/STEllAR-GROUP/hpx/releases/tag/1.4.1

13:49 <simbergm1> *commit

13:49 <hkaiser> \o/

13:49 <simbergm1> announcements etc will follow

13:50 <simbergm1> but docs and download links on stellar-group.org should already be up

13:51 <rori> Thanks :D

13:58 <hkaiser> simbergm1: thanks a lot!

14:38 nikunj has quit [Ping timeout: 272 seconds]

14:38 nikunj97 has quit [Ping timeout: 272 seconds]

14:38 nikunj has joined #ste||ar

14:38 nikunj97 has joined #ste||ar

14:40 Nikunj__ has joined #ste||ar

14:41 Nikunj__ has quit [Client Quit]

14:42 Nikunj__ has joined #ste||ar

14:44 nikunj97 has quit [Ping timeout: 240 seconds]

14:44 nikunj has quit [Ping timeout: 240 seconds]

15:02 hkaiser has quit [Quit: bye]

15:27 hkaiser has joined #ste||ar

15:46 nikunj97 has joined #ste||ar

15:49 Nikunj__ has quit [Ping timeout: 255 seconds]

16:11 hkaiser has quit [Ping timeout: 240 seconds]

16:12 hkaiser has joined #ste||ar

16:17 <hkaiser> simbergm1: hey

16:17 <hkaiser> we now have two threading modules: threading_base and thread_support

16:18 <hkaiser> do you think we could unify the naming scheme (i.e. either 'thead_base' or threading_support')?

16:18 <hkaiser> from looking at the other modules, we might want to name it threading_support

16:19 <hkaiser> well, not quite, we have both naming styles (iterator_support/allocator_support and naming_base)

16:19 <hkaiser> should we think about such details at all?

16:19 <hkaiser> :/

16:21 <K-ballo> who is the official namer of modules?

16:21 <hkaiser> do we have one?

16:23 <hkaiser> sounds like something to discuss in the next coordination/PMC call

16:24 <simbergm1> hkaiser: K-ballo we definitely don't have an official namer :P

16:25 <simbergm1> yeah, I'm not married to the name...

16:25 <simbergm1> I guess base and support have slightly different connotations

16:26 <simbergm1> threading_base has the base classes (plus helper stuff which could of course be called support...)

16:26 <simbergm1> thread_support is for os threads

16:27 <simbergm1> then there's basic_execution as well...

16:29 <simbergm1> I'm not sure how to distinguish between modules that deal with os threads and ones that deal with hpx threads

16:30 <simbergm1> concurrency is also for os threads, synchronization is for hpx threads...

16:30 <simbergm1> added a bullet point for tomorrow's call

16:36 nikunj97 has quit [Read error: Connection reset by peer]

17:23 K-ballo has quit [Read error: Connection reset by peer]

17:25 K-ballo has joined #ste||ar

17:28 K-ballo has quit [Read error: Connection reset by peer]

17:31 <hkaiser> simbergm1: I'm more about the thread vs. threading naming convention, _base and _support are ok

17:37 K-ballo has joined #ste||ar

17:41 K-ballo has quit [Read error: Connection reset by peer]

17:43 K-ballo has joined #ste||ar

17:43 K-ballo has quit [Read error: Connection reset by peer]

17:48 K-ballo has joined #ste||ar

17:50 K-ballo has quit [Read error: Connection reset by peer]

17:51 K-ballo has joined #ste||ar

17:51 K-ballo has quit [Read error: Connection reset by peer]

17:54 K-ballo has joined #ste||ar

17:56 K-ballo has quit [Read error: Connection reset by peer]

17:58 <simbergm1> hkaiser: ah, sure

17:59 <simbergm1> threading_support sounds a tiny bit better than thread_base, but I don't care too much

17:59 <simbergm1> I can definitely change it if you'd like

18:00 <simbergm1> note that #4399 adds one named just hpx_thread because all it contains is hpx::thread

18:03 K-ballo has joined #ste||ar

18:03 K-ballo has quit [Read error: Connection reset by peer]

18:04 <hkaiser> sure

18:04 <hkaiser> simbergm1: I know it's a superfluous problem ;-)

18:06 K-ballo has joined #ste||ar

18:08 K-ballo has quit [Read error: Connection reset by peer]

18:10 <simbergm1> hkaiser: should we wait a bit with renaming? We might want to change others as well when we have (even) more modules

18:14 <hkaiser> we might indeed, otoh, for the new module now might be a good time to avoid having to generate the forwarding headers

18:14 <hkaiser> but let's discuss this tomorrow

18:15 <simbergm1> Yeah, let's discuss tomorrow

18:47 <Yorlik> Would this be UB or just dangerous? I'm experimenting with data containing its own type. https://godbolt.org/z/XqeXkJ

18:49 <Yorlik> The idea would be to store this kind of data in a monotonic allocator.

18:50 <Yorlik> And then traverse the list of void* in a thin wrapper.

18:50 <hkaiser> casting through void* is always at least dangerous

18:51 <Yorlik> Ya - that's clear. But do you spot any UB stuff in what I'm doing here?

18:51 <Yorlik> And - what would be a better idea, if you want polymorphic data in a monotonic storage area and you need to iterate over it.

18:52 <Yorlik> I am pondering top use this for my internal message system and have the mailboxes be backed my monotonic memory buffers

18:53 <Yorlik> After all the system will process a lot of messages, so I'm thinking about throughput optimization for these too.

18:53 <hkaiser> Yorlik: I usually stop looking when I see (void*) casts

18:53 <hkaiser> so I don't know

18:53 <hkaiser> you're 100% on your own here

18:53 <Yorlik> OK. Got it.

18:54 <Yorlik> Is it considered UB?

18:54 <Yorlik> With danger I can live, with UB not.

18:58 <hkaiser> well, casting a void* to some type T* should be done using reinterpret_cast and this is UB for anything but if the void* was the result of a reinterpret_cast from a T*

18:59 <hkaiser> IOW: T1*->void*->T2* is ok only if T1 == T2

18:59 <hkaiser> you should never use C-style casts, btw

19:16 K-ballo has joined #ste||ar

19:16 K-ballo has quit [Read error: Connection reset by peer]

19:19 K-ballo has joined #ste||ar

19:19 K-ballo has quit [Read error: Connection reset by peer]

19:28 K-ballo has joined #ste||ar

19:48 nikunj has joined #ste||ar

19:51 nikunj has quit [Read error: Connection reset by peer]

20:05 hkaiser has quit [Quit: bye]

20:16 nikunj has joined #ste||ar

20:34 hkaiser has joined #ste||ar

20:39 <Yorlik> hkaiser: I will guarantee, that the void* arriving at this system are derived from the correct classes. I think I can make this thing safe to the outside.

20:40 <hkaiser> *sure* - famous last words

20:40 <Yorlik> BTW - I measured my machines memory throughput - from what I see we are practically at the limit for my machine with the server. So - it's good :)

20:41 <hkaiser> nice

20:41 <hkaiser> the auto_chunk_size was merged, btw

20:42 <Yorlik> The best measurement I made was 22 GB/sec and that's practically the result for my machine loading large datasets with 4 threads.

20:42 <Yorlik> I used this thing for measuring the memory: file:///C:/__A/download/pmbw-0.6.2-win64/pmbw%20-%20Parallel%20Memory%20Bandwidth%20Benchmark%20_%20Measurement%20-%20panthema.net.html

20:42 <Yorlik> Argh - sec

20:42 <Yorlik> https://panthema.net/2013/pmbw/index.html#description

20:43 <Yorlik> This link

20:47 <Yorlik> The relevant image for my machine is this one: https://i.imgur.com/MnXq2mg.png

20:47 <hkaiser> ok

20:47 <Yorlik> So - with large datasets it practically flattens out at ~20-25 GB/s

20:47 <Yorlik> With 4 threads

20:48 <Yorlik> Since we will process all messages for an object in one go, we will profit from cache effects by maximising temporal locality.

20:49 <Yorlik> Can't wait to have it done and measure it :)