#ste||ar on 2019-10-22 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

00:02 _bibek_ has quit [Quit: Konversation terminated!]

00:03 _bibek_ has joined #ste||ar

00:06 bibek has joined #ste||ar

00:06 _bibek_ has quit [Client Quit]

00:07 bibek has quit [Client Quit]

00:08 _bibek_ has joined #ste||ar

00:11 _bibek_ has quit [Client Quit]

00:12 _bibek_ has joined #ste||ar

00:12 _bibek_ has quit [Client Quit]

00:12 _bibek_ has joined #ste||ar

00:22 _bibek_ has quit [Quit: Konversation terminated!]

00:23 _bibek_ has joined #ste||ar

00:30 _bibek_ has quit [Quit: Konversation terminated!]

00:30 _bibek_ has joined #ste||ar

00:30 _bibek_ has quit [Read error: Connection reset by peer]

00:31 _bibek_ has joined #ste||ar

00:43 _bibek_ has quit [Quit: Konversation terminated!]

00:43 _bibek_ has joined #ste||ar

00:52 _bibek_ has quit [Quit: Konversation terminated!]

00:52 _bibek_ has joined #ste||ar

00:56 K-ballo has quit [Quit: K-ballo]

01:15 _bibek_ has quit [Quit: Konversation terminated!]

01:16 _bibek_ has joined #ste||ar

01:17 _bibek_ has quit [Client Quit]

01:18 _bibek_ has joined #ste||ar

01:36 _bibek_ has quit [Quit: Konversation terminated!]

01:36 _bibek_ has joined #ste||ar

01:56 _bibek_ has quit [Quit: Konversation terminated!]

01:56 _bibek_ has joined #ste||ar

01:58 _bibek_ has quit [Client Quit]

01:58 _bibek_ has joined #ste||ar

01:59 _bibek_ has quit [Client Quit]

02:00 bibek has joined #ste||ar

02:05 bibek has quit [Quit: Konversation terminated!]

02:06 bibek has joined #ste||ar

02:10 bibek has quit [Client Quit]

02:10 bibek has joined #ste||ar

02:18 bibek has quit [Quit: Konversation terminated!]

02:19 bibek has joined #ste||ar

02:20 bibek has quit [Client Quit]

02:20 bibek has joined #ste||ar

02:21 bibek has quit [Client Quit]

02:21 bibek has joined #ste||ar

02:24 bibek has quit [Client Quit]

02:24 bibek has joined #ste||ar

02:35 hkaiser has quit [Quit: bye]

02:35 bibek has quit [Quit: Konversation terminated!]

02:36 bibek has joined #ste||ar

02:40 bibek has quit [Client Quit]

02:40 bibek has joined #ste||ar

02:42 bibek has quit [Client Quit]

02:43 bibek has joined #ste||ar

02:46 bibek has quit [Client Quit]

02:46 bibek has joined #ste||ar

02:49 Guest70891 has quit [Ping timeout: 268 seconds]

02:50 bibek has quit [Client Quit]

02:50 bibek has joined #ste||ar

02:50 Guest70891 has joined #ste||ar

02:51 bibek has quit [Client Quit]

02:51 bibek has joined #ste||ar

04:29 _bibek_ has joined #ste||ar

04:30 _bibek_ has quit [Client Quit]

04:31 bibek has quit [Read error: Connection reset by peer]

04:31 _bibek_ has joined #ste||ar

04:34 _bibek_ has quit [Read error: Connection reset by peer]

04:34 bibek has joined #ste||ar

04:35 nikunj has quit [Quit: Bye]

04:42 bibek has quit [Quit: Konversation terminated!]

04:43 bibek has joined #ste||ar

04:49 bibek has quit [Quit: Konversation terminated!]

04:49 bibek has joined #ste||ar

05:36 bibek has quit [Quit: Konversation terminated!]

05:38 bibek has joined #ste||ar

05:39 bibek has quit [Client Quit]

05:39 bibek has joined #ste||ar

05:43 bibek has quit [Client Quit]

05:44 bibek has joined #ste||ar

05:46 bibek has quit [Read error: Connection reset by peer]

05:47 bibek has joined #ste||ar

05:49 bibek has quit [Client Quit]

05:49 bibek has joined #ste||ar

05:53 bibek has quit [Client Quit]

05:53 bibek has joined #ste||ar

05:57 bibek has quit [Client Quit]

05:58 bibek has joined #ste||ar

06:38 bibek has quit [Quit: Konversation terminated!]

06:38 _bibek_ has joined #ste||ar

07:02 _bibek_ has quit [Quit: Konversation terminated!]

07:03 bibek has joined #ste||ar

07:03 bibek has quit [Client Quit]

07:04 bibek has joined #ste||ar

07:13 bibek has quit [Quit: Konversation terminated!]

07:14 bibek has joined #ste||ar

07:25 bibek has quit [Quit: Konversation terminated!]

07:25 bibek has joined #ste||ar

07:25 bibek has quit [Client Quit]

07:25 bibek has joined #ste||ar

07:26 bibek has quit [Client Quit]

07:26 bibek has joined #ste||ar

07:33 _bibek_ has joined #ste||ar

07:33 bibek has quit [Ping timeout: 276 seconds]

07:38 rori has joined #ste||ar

07:42 _bibek_ has quit [Quit: Konversation terminated!]

07:42 _bibek_ has joined #ste||ar

07:59 Guest70891 has quit [Ping timeout: 250 seconds]

08:00 _bibek_ has quit [Quit: Konversation terminated!]

08:00 _bibek_ has joined #ste||ar

08:01 Guest70891 has joined #ste||ar

08:04 _bibek_ has quit [Client Quit]

08:05 _bibek_ has joined #ste||ar

08:07 _bibek_ has quit [Client Quit]

08:08 _bibek_ has joined #ste||ar

08:10 Guest70891 has quit [Ping timeout: 264 seconds]

08:10 _bibek_ has quit [Read error: Connection reset by peer]

08:11 _bibek_ has joined #ste||ar

08:12 Guest70891 has joined #ste||ar

08:18 quaz0r has quit [Ping timeout: 265 seconds]

08:21 hkaiser has joined #ste||ar

08:22 <hkaiser> heller: here is what we see: clang 8 with -std=c++17 generates slower code than with -std=c++14

08:22 <hkaiser> significantly slower, about 20%

08:26 <heller> hkaiser: woah

08:26 <heller> hkaiser: which application?

08:26 <hkaiser> some hpxmp perf tests

08:26 <heller> hkaiser: did you profile both runs

08:26 <heller> ?

08:26 <hkaiser> we have not yet

08:26 <heller> is both HPX and hpxmp compiled with the same flags?

08:26 <hkaiser> still closing in to understanding what's causing the slowdown

08:26 <hkaiser> yes

08:26 <hkaiser> we have made sure of that

08:28 <heller> I would have expected it to be the other way around...

08:28 <jbjnr> hkaiser: just clang?

08:28 <hkaiser> I would have expected equal perf, actually

08:28 <heller> which standard library are you using?

08:28 <hkaiser> same code, so why should one be faster than the other

08:29 <heller> well, we enable different code paths for different standard versions

08:29 <hkaiser> jbjnr: still investigating, we tried gcc yesterday, but I have not seen those results yet

08:29 <hkaiser> heller: fair enough

08:29 <jbjnr> (one expects newer compilers and later c++ implementations to have better optimization, I would have expected imporovements)

08:29 <hkaiser> ok

08:29 <jbjnr> $0.02

08:30 <hkaiser> could be a perf bug on our end, though, let's see what the gcc results give us

08:30 <heller> hkaiser: check std::hardware_destructive_interference_size

08:30 <hkaiser> nod, good point

08:30 <jbjnr> hkaiser: simbergm I am working on DCA++ today and not going to get my PR for the scheduler ready for a bit - therefore I must sadly advise you to go ahead with your thread based changes and I'll just have to keep playing catch up later

08:31 <hkaiser> jbjnr: thanks for looking into dca, and I think we can wait for another couple of days

08:31 <heller> hkaiser: and the corresponding objects using the cache aligned data

08:31 <jbjnr> (I think the main cause of my trouble is that the thread indexing was not consistent somewhere, this caused all my stuff to break in a way that worked fine, but performance degraded)

08:31 <hkaiser> heller: sure

08:32 <heller> hkaiser: that's the only place I can spot which might incur performance differences

08:32 quaz0r has joined #ste||ar

08:32 <hkaiser> jbjnr: did you find the cause for this?

08:32 <hkaiser> heller: nod

08:32 <jbjnr> (in one of my major merges over the last couple of months, I must have messed up something - or perhaps someone changed the indexing somewhere and I didn't notice)

08:33 <jbjnr> I get good performance on daint and laptop now, but ault and tave were down or mainenance so I've not tested them. cholesky gives terrible performance and I want to fix it

08:34 <hkaiser> ok

08:34 <jbjnr> and apex doesn't work at all now. Can't even get OTF output working

08:34 <jbjnr> not sure what I've done wrong

08:34 <hkaiser> understood - we can wait, I think - np

08:34 <heller> we really need someone to take of care of apex

08:34 <hkaiser> heller: Kevin is back on board, financing has been re-established ;-)

08:35 <jbjnr> our simple task p;lot output gives "bus error" on daint!

08:35 <hkaiser> so we can ask things of him now

08:36 <jbjnr> hkaiser: the problem with dca++ is that all the include paths have changed and for whatever reason the new modules are not being included

08:36 <hkaiser> k

08:37 <hkaiser> jbjnr: all the old path should still work

08:37 <hkaiser> you will get the warnings, but otherwise you should be fine

08:38 <simbergm> jbjnr: :/

08:39 <simbergm> build or install dir?

08:39 <simbergm> what hkaiser said ^

08:39 <jbjnr> things like <hpx/config.hpp>

08:40 <hkaiser> should still be fine

08:40 <hkaiser> if not, then the build system is broken

08:41 <jbjnr> looks liek config.hpp is no longer generated

08:41 <hkaiser> jaafar: it never was

08:41 <hkaiser> jbjnr: ^^

08:41 _bibek_ has quit [Quit: Konversation terminated!]

08:42 _bibek_ has joined #ste||ar

08:44 <simbergm> should be in libs/config/include/hpx/config.hpp

08:46 Coldblackice_ has joined #ste||ar

08:50 _bibek_ has quit [Quit: Konversation terminated!]

08:50 Coldblackice has quit [Ping timeout: 250 seconds]

08:50 _bibek_ has joined #ste||ar

08:52 _bibek_ has quit [Read error: Connection reset by peer]

08:52 _bibek_ has joined #ste||ar

08:53 <jbjnr> heller: looking again at what I was doing with DCA++ makes me think I could use your context stuff here. I created an "abstraction layer" between the std::threads and the hpx::threads - but it would have been much easier with your stuff. (Except that I'd still need to rewrite everything anyway :( )

08:53 <heller> jbjnr: yeah, probably. that's the spirit :D

08:54 <jbjnr> did you say the std::threads version is feature complete?

08:55 <jbjnr> how soon before we can actually use it?

08:55 <hkaiser> I wouldn't hold my breath

08:55 <jbjnr> lol

09:00 <heller> I didn't say it is feature complete

09:01 <heller> it's ready when the libfabric PR is in ;)

09:01 <jbjnr> ok. I probably just assumed that "everything works" mean that!

09:01 <jbjnr> libfabric + scheduler you mean

09:01 <heller> right

09:02 <heller> well. everything works that has been implemented

09:02 <heller> that is, the agent stuff works, as in yield, suspend, resume

09:02 <heller> all the context stuff and spawning functions on agents etc still requires work

09:07 <simbergm> jbjnr (and others): I'd like your opinion on the resource partitioner

09:07 <jbjnr> delete it all!

09:07 <hkaiser> simbergm: which one? we have two ;-)

09:07 <jbjnr> (kidding obviously)

09:08 <jbjnr> simbergm: clean it up and reduce all the duplication

09:08 <simbergm> currently it depends on the runtime for sanity checks (check that the runtime ptr is null or not)

09:09 <simbergm> I could either add global functions that forward to the partitioner instance, or make init/start take an optional partitioner instance

09:09 <simbergm> resource partitioner

09:09 <simbergm> or just remove the checks...

09:09 <simbergm> bah, it's not about duplication

09:10 <jbjnr> remove the checks if possible

09:10 <jbjnr> only in pre-main init does it ever not have them

09:11 <simbergm> removing is easy :P

09:11 <jbjnr> I like the idea of init/start taking a partitioner

09:11 <jbjnr> you mean if the user instantiated one already?

09:11 <jbjnr> as in some of the examples that create custom pools

09:12 <simbergm> yeah, exactly

09:12 <jbjnr> that would be clean I think. Not sure why the runtime pointer is needed. can't remember

09:12 <simbergm> I also like that, but it adds more overloads to the already too many overloads...

09:13 <simbergm> would let us get rid of the partitioner singleton though...

09:13 hkaiser has quit [Ping timeout: 250 seconds]

09:13 <simbergm> so my question is mainly how badly do we want those sanity checks there

09:13 <simbergm> essentially they check that you don't create thread pools after the runtime has been started

09:13 <simbergm> or shirink/expand pools when it's not started (I wonder if that even works...)

09:13 <simbergm> runtime pointer is needed for ^ checks

09:14 <jbjnr> remove the checks and we can add new (improved?) ones if stuff fails?

09:14 hkaiser has joined #ste||ar

09:14 <jbjnr> adding new thread pools after startup should be supported eventualy

09:14 <jbjnr> but not in this incarnation

09:16 <hkaiser> simbergm: not sure if we should always require passing a partitioner to init by the user

09:16 <hkaiser> that sounds aweful

09:17 <jbjnr> (not always. just when the user created one)

09:17 <simbergm> hkaiser: no, optional of course

09:17 <jbjnr> but he's right anout start/init having way too many overloads already

09:17 <jbjnr> it's confusing even for us

09:17 <simbergm> but making it optional adds a billion new overloads

09:18 <jbjnr> why is the singleton a problem? becuase it uses the runtime ptr stuff?

09:19 <simbergm> jbjnr: the singleton is not a problem, it's just ugly

09:24 <simbergm> I'll remove the checks for now, the examples are the best documentation anyway and not following them is naughty...

09:26 <hkaiser> simbergm: couldn't we have some 'global' sanity checkers?

09:26 <simbergm> anyway, creating pools at runtime will need cooperation with the runtime to actually start the pools with the threadmanager

09:27 <simbergm> hkaiser: yeah, that was my other suggestion

09:27 <simbergm> wrap get_partitioner.create_pool() in hpx::create_pool which does the check

09:28 <simbergm> or something like that...

09:29 _bibek_ has quit [Read error: Connection reset by peer]

09:29 <hkaiser> right

09:29 _bibek_ has joined #ste||ar

09:34 _bibek_ has quit [Quit: Konversation terminated!]

09:34 _bibek_ has joined #ste||ar

09:36 _bibek_ has quit [Client Quit]

09:36 _bibek_ has joined #ste||ar

09:47 _bibek_ has quit [Quit: Konversation terminated!]

09:47 _bibek_ has joined #ste||ar

09:48 _bibek_ has quit [Client Quit]

09:49 _bibek_ has joined #ste||ar

09:51 _bibek_ has quit [Client Quit]

09:51 _bibek_ has joined #ste||ar

10:01 _bibek_ has quit [Quit: Konversation terminated!]

10:02 _bibek_ has joined #ste||ar

10:02 weilewei has quit [Remote host closed the connection]

10:13 _bibek_ has quit [Read error: Connection reset by peer]

10:13 _bibek_ has joined #ste||ar

10:16 _bibek_ has quit [Client Quit]

10:18 _bibek_ has joined #ste||ar

10:30 <hkaiser> simbergm: when you were converting the qbk files to the new documentation format, did you use some tool?

10:46 _bibek_ has quit [Quit: Konversation terminated!]

10:47 _bibek_ has joined #ste||ar

10:53 _bibek_ has quit [Quit: Konversation terminated!]

10:54 _bibek_ has joined #ste||ar

10:58 K-ballo has joined #ste||ar

10:58 _bibek_ has quit [Client Quit]

10:59 _bibek_ has joined #ste||ar

11:01 <simbergm> hkaiser: no, just an ad-hoc set of sed replacements for most of it, the rest manually

11:01 <simbergm> I may still have it around

11:02 _bibek_ has quit [Client Quit]

11:03 _bibek_ has joined #ste||ar

11:03 <simbergm> probably not it looks like.. :/

11:22 <hkaiser> simbergm: no worries

11:23 <hkaiser> and thanks for checking

11:36 <hkaiser> heller: yt?

11:37 <hkaiser> heller: where is the extra data item for pointer tracking enabled for output_archives?

11:38 <hkaiser> do you lazily construct those nowadays?

11:41 <heller> hkaiser: should be, yes

11:42 _bibek_ has quit [Read error: Connection reset by peer]

11:42 _bibek_ has joined #ste||ar

11:42 <hkaiser> heller: what about if I want to be able to check whether certain extra data is supported by the archive?

11:43 <hkaiser> i.e. 'does this archive support credit splitting'?

11:43 <hkaiser> heller: ^^

11:43 <heller> hkaiser: the extra data is not a property of the archive, I think

11:43 _bibek_ has quit [Client Quit]

11:43 <heller> hkaiser: they are set by the objects you serialized to the archive

11:44 <hkaiser> not necessarily

11:44 _bibek_ has joined #ste||ar

11:44 <hkaiser> I might not want to do credit splitting in certain cases

11:48 <hkaiser> heller: ?

11:48 <simbergm> hkaiser: stackless threads is ready to go in right? ci looks very happy :)

11:49 <simbergm> hkaiser: stackless threads is ready to go in right? ci looks very happy :)

11:49 <hkaiser> simbergm: yah, let's go if jbjnr doesn't object, he was mumbling something about this

11:49 <K-ballo> someone's excited about stackless threads...

11:49 <hkaiser> _very_ excited

11:50 <hkaiser> heller: I think the extra data items should be explicitly enabled dependeing on the context the archive is used

11:50 _bibek_ has quit [Quit: Konversation terminated!]

11:50 _bibek_ has joined #ste||ar

11:51 <heller> hkaiser: the main motiviation behind doing it lazily is to avoid to pay the cost if the archive doesn't require it

11:51 <hkaiser> understood

11:51 <heller> hkaiser: I guess this discussion is in the context of checkpointing?

11:51 <hkaiser> yes

11:52 <hkaiser> I want to get back to this

11:52 <hkaiser> it's sitting there for too long

11:52 <heller> so, what should happen if an id_type is supposed to get checkpointed?

11:52 <hkaiser> no credit splitting, at best the id should be saved verbatim

11:53 <simbergm> I think my excitement was amplified somewhere along the way...

11:53 <simbergm> I got the impression jbjnr was fine with it

11:53 <hkaiser> simbergm: so let's do it

11:55 <simbergm> I think we'll go ahead with the cmake branch now as well, hope things just keep working normally

11:55 <heller> hkaiser: 1) Why no credit spliltting? Why isn't the component behind the GID not being kept alive when it is split? 2) Wouldn't a deep copy make more sense here?

11:56 <jbjnr> if the stackless PR doesn't completely change all the threading and scheduler API, then go ahead

11:56 <hkaiser> heller: what's the purpose of checkpointing an id_type - I think it's to safe the value of the id, if you want to store the thing it refers to use a client

11:57 <simbergm> hkaiser: already merged stackless threads

11:57 <hkaiser> ok, thanks

11:59 <heller> hkaiser: ok, isn't the point of a checkpoint to be able to restore it later on?

11:59 <hkaiser> sure, but you might want to restore it to the same id

12:01 <hkaiser> even if we do some special handling for id_types during checkpointing, i.e. deep save - how would I know inside the id_type::save() function what to do?

12:01 <hkaiser> heller: ^^

12:02 <hkaiser> this is some information that has to be associated with the archive

12:03 <heller> How about you use the split_gid map after you serialized everything?

12:03 <hkaiser> what should I do there?

12:03 _bibek_ has quit [Read error: Connection reset by peer]

12:03 <hkaiser> the credit was split at that point, should I undo the splitting?

12:03 _bibek_ has joined #ste||ar

12:09 <heller> hkaiser: I don't think so. If you want to restore it verbatim since you want to restore it with the old GID, you need to keep it alive as log as the checkpoint is alive. otherwise you will get into lifetime troubles.

12:11 <heller> with a deep save, you can not easily reuse the same GID, so you perform a recursive deep save. You need to keep the split_gid map around to avoid duplicates, once you are done, you can undo the splitting

12:14 _bibek_ has quit [Quit: Konversation terminated!]

12:15 _bibek_ has joined #ste||ar

12:15 <heller> the other option would be to attach an extra data for checkpointing, and try_get it when doing the id_type::save/id_type::load operation

12:16 <hkaiser> heller: I agree with the split_map (or similar), but I don't agree with having to undo splitting just because we don't want to have a means of carrying context information in the archive

12:16 <heller> well, we do

12:17 <hkaiser> ok, that could work

12:17 aserio has joined #ste||ar

12:17 <heller> https://github.com/STEllAR-GROUP/hpx/blob/master/libs/serialization/include/hpx/serialization/basic_archive.hpp#L151

12:17 <heller> this returns nullptr if the data wasn't there

12:18 <hkaiser> nod

12:18 <heller> https://github.com/STEllAR-GROUP/hpx/blob/master/src/runtime/parcelset/detail/parcel_await.cpp#L56-L83

12:20 <hkaiser> heller: ok, I'll try that

12:20 <heller> or just disallow checkpointing of id_types...

12:20 <hkaiser> even that needs detection

12:20 <heller> if (split_gids != nullptr) throw ...;

12:20 <heller> ;P

12:27 <heller> also, the split_gids should do that automatically and abort if the map hasn't been moved out

12:27 aserio has quit [Quit: aserio]

12:29 _bibek_ has quit [Quit: Konversation terminated!]

12:30 _bibek_ has joined #ste||ar

12:35 nikunj has joined #ste||ar

12:40 nikunj has quit [Remote host closed the connection]

12:44 <hkaiser> heller: I'll use an extract tag type in the archive as extra data, no overhead

12:45 <hkaiser> extra tag type*

12:47 <heller> hkaiser: which policy are you going for now?

12:47 <hkaiser> for now I'll store the gid_type verbatim, we can discuss this further and change it to deep-save later

12:48 <heller> without keeping the component alive?

12:48 <hkaiser> yes

12:48 <heller> ugh

12:48 <hkaiser> checkpoints live longer than components

12:48 <hkaiser> could be stored in a file after all

12:48 <heller> sure, but what's the point of restoring them the?

12:48 <heller> then*

12:49 <hkaiser> restoring the id_type?

12:49 <heller> if the id_type is checkpointed, it will be restored, no?

12:49 <hkaiser> to be able to use the same value down the road - Yorlik was requesting this

12:49 <heller> yes sure

12:50 <heller> but if the checkpoint outlives the component, where is the point

12:50 <hkaiser> ok, ok - I'll do the deep save ;-)

12:50 <hkaiser> same as for clients

12:51 <heller> yes

12:52 <heller> here is a suggestion: checkpoints do indeed keep the components alive. If you don't want that, we already have a policy for that: unmanaged id_types

12:53 <heller> then you have to make sure to clean up your checkpoints after a while

12:53 <hkaiser> no, I don't think this is a good idea

12:53 <hkaiser> the things in the checkpoint are not components anymore

12:54 _bibek_ has quit [Quit: Konversation terminated!]

12:54 <heller> right, a deep save is the only real option there

12:56 _bibek_ has joined #ste||ar

13:00 hkaiser has quit [Ping timeout: 250 seconds]

13:02 _bibek_ has quit [Quit: Konversation terminated!]

13:03 _bibek_ has joined #ste||ar

13:06 _bibek_ has quit [Client Quit]

13:06 _bibek_ has joined #ste||ar

13:07 simbergm has quit [Write error: Connection reset by peer]

13:16 hkaiser has joined #ste||ar

13:16 <hkaiser> heller: we can't do a deep save :/

13:17 <hkaiser> I'll simply prevent managed id_types from being checkpointed

13:17 aserio has joined #ste||ar

13:17 <heller> hkaiser: why can't we do a deep save?

13:18 <hkaiser> we don't have the type of the component

13:18 <heller> virtual dispatch through component_base?

13:19 <hkaiser> components don't have a virtual base, usually, should we really add that just for this?

13:19 weilewei has joined #ste||ar

13:20 <hkaiser> anyways, gotta run...

13:20 hkaiser has quit [Client Quit]

13:23 rtohid has joined #ste||ar

13:23 rori has quit [Ping timeout: 245 seconds]

13:31 aserio has quit [Read error: Connection reset by peer]

13:56 hkaiser has joined #ste||ar

14:10 bibek has joined #ste||ar

14:10 bibek has quit [Client Quit]

14:10 bibek has joined #ste||ar

14:10 _bibek_ has quit [Ping timeout: 250 seconds]

14:13 bibek has quit [Client Quit]

14:13 bibek has joined #ste||ar

14:13 aserio has joined #ste||ar

14:15 _bibek_ has joined #ste||ar

14:15 _bibek_ has quit [Client Quit]

14:15 bibek has quit [Read error: Connection reset by peer]

14:16 _bibek_ has joined #ste||ar

14:20 _bibek_ has quit [Client Quit]

14:20 Coldblackice_ has quit [Ping timeout: 268 seconds]

14:21 _bibek_ has joined #ste||ar

14:22 _bibek_ has quit [Read error: Connection reset by peer]

14:23 _bibek_ has joined #ste||ar

14:24 _bibek_ has quit [Client Quit]

14:25 _bibek_ has joined #ste||ar

14:27 _bibek_ has quit [Client Quit]

14:28 _bibek_ has joined #ste||ar

14:28 _bibek_ has quit [Client Quit]

14:28 bibek has joined #ste||ar

14:29 bibek has quit [Client Quit]

14:29 bibek has joined #ste||ar

14:34 aserio has quit [Quit: aserio]

14:49 bibek has quit [Quit: Konversation terminated!]

14:54 bibek has joined #ste||ar

14:55 bibek has quit [Client Quit]

15:21 <hkaiser> heller: btw, it's not a clang issue, gcc shows the same behavior - it has to be something on our end

15:23 <heller> Did you check the cache alignment?

15:24 <heller> Should be easy enough to check with a small testcase

15:28 <hkaiser> not yet, but this is the prime suspect

15:28 <hkaiser> another suspect would be the overaligned allocator

15:29 <hkaiser> c++14 does not support that

16:01 <heller> So you're saying that the extra alignment actually hurts performance?

16:02 <hkaiser> heller: not the alignment itself, I think the allocator that has to ensure alignment might be slower

16:03 <heller> That would suck

16:03 <heller> Can you reproduce this on msvc as well?

16:03 <heller> Would also be interesting what happens if libc++ was used

16:17 <hkaiser> heller: this is libc++

16:17 <heller> Oh, ok

16:18 <heller> Strange that it also happens with gcc then

16:23 <hkaiser> tcmalloc?

16:31 diehlpk has joined #ste||ar

16:49 <heller> tcmalloc shouldn't be affected by the c++ std

16:49 <heller> What does the cache line test give you?

16:54 diehlpk has quit [Ping timeout: 264 seconds]

17:00 <heller> hkaiser: turns out that neither gcc nor clang his this :/

17:01 <heller> has*

17:02 <heller> so that's probably not it

17:03 <hkaiser> ok, fair enough

17:09 <heller> hkaiser: there you go: http://quick-bench.com/flodSfa8WJr4jZip-39x6ayA9yE

17:15 <hkaiser> right

17:15 <heller> http://quick-bench.com/tn0AzNrQpe3t5gczE8pAJu1N5PI

17:16 <hkaiser> nice!

17:16 <hkaiser> excellent catch

17:16 <heller> same picture for gcc: http://quick-bench.com/CiedkW0scywxbMi8T_CKs8IAvH4

17:17 <hkaiser> heller: for c++14 all are the same

17:17 <heller> for C++14, we don't use the overaligned new

17:17 <hkaiser> http://quick-bench.com/-Ym2nOt3Sn6fxm6kJWP9G4EauEs

17:18 <hkaiser> heller: would you mind creating a ticket for this?

17:19 <heller> sure

17:20 <heller> hkaiser: ahh, clang ignores the overalignment in this case ;)

17:28 <heller> hkaiser: https://github.com/STEllAR-GROUP/hpx/issues/4148

17:29 <heller> hkaiser: is there a problem with msvc as well?

17:29 <hkaiser> heller: have not checked yet

17:29 <hkaiser> but seems to be something that forces the compilers into doing this

17:30 <heller> yes

17:30 <hkaiser> heller: K-ballo might know

17:30 <heller> if you look at the assembly, it passes along the extra alignment data to new

17:30 <K-ballo> know what?

17:30 <hkaiser> see above

17:31 <hkaiser> or here: http://quick-bench.com/CiedkW0scywxbMi8T_CKs8IAvH4

17:35 <K-ballo> why would you not just use alignas?

17:35 <heller> sometimes, those objects need to be allocated

17:36 <K-ballo> http://quick-bench.com/VbbdybXU2x84p3BAXjEH4kHlu4M

17:36 <K-ballo> else you'll have to overallocate and use the dreaded `std::align`

17:39 <hkaiser> we don't need the alignment, really

17:39 <hkaiser> all we need is the size

17:46 <heller> hkaiser: also interesting: http://quick-bench.com/PkunktMrNXhZ5pdOusj9s9MRQ7c

17:54 <hkaiser> yah, that pays only once for the overalignment

17:56 <hkaiser> ahh no, it's not initializing the memory, so its faster

17:56 <heller> yup

17:57 <heller> we need to add those ctors

17:58 <hkaiser> uhh

17:59 <hkaiser> ahh, the generated constructor is initializing the padded crap as well

17:59 <heller> *nod*

17:59 <hkaiser> nice catch

18:06 <heller> now the big question: do we really not care about alignment?

18:08 <heller> btw, why do we have cache_line_data and cache_aligned_data, they are both the same

18:10 <hkaiser> one is just sized, the other one aligned and sized, I think

18:10 <hkaiser> heller: we don't care about alignment if the elements are allocated as an array

18:11 <hkaiser> well, the whole thing could be aligned, but I don't know how to do that

18:11 <hkaiser> but in the worst case the first and last elements share their cache line with some other data

18:12 <heller> yeah ... we might want to move the padding upfront

18:16 <hkaiser> doesn't make a difference, does it?

18:18 <heller> for the allocation: no

18:32 <weilewei> https://gist.github.com/weilewei/3b09cc6771e000e3f9a7441cfc43661a

18:33 <weilewei> What is the reason of getting this error? I am just running a simple program

18:34 <weilewei> {what}: Assertion 'type_ == data_type_address' failed: HPX(assertion_failure)

18:36 <weilewei> I found a similar issue on HPX GitHub, but not sure how it is solved: https://github.com/STEllAR-GROUP/hpx/issues/2860

18:39 <hkaiser> uhh

18:39 <hkaiser> I have seen this whenever hpx versions mismatch

18:40 <hkaiser> some stale binaries somewhere in the system or somesuch

18:40 weilewei has quit [Remote host closed the connection]

18:41 weilewei has joined #ste||ar

18:41 <hkaiser> weilewei: have you seen my answer?

18:41 <weilewei> yes, mismatch

18:41 <weilewei> so how should I clean those mismatch? I only install hpx on one folder

18:42 <hkaiser> make sure that you have only one version of hpx that is found

18:42 <hkaiser> don't install in system or similar

18:43 <weilewei> what do you mean don't install in system? hkaiser

18:45 <hkaiser> weilewei: just make sure there is only one hpx version found and that there is no debug/release mismatch between hpx and your application

18:45 <weilewei> hkaiser got it, thanks, I will work on this

18:59 <weilewei> Is it there anyway to check how many hpx versions can I find in the system? I believe I only left one now, but still get the same error

19:07 <heller> find / -name naming.hpp

19:08 <weilewei> ah, I need to pass -DCMAKE_BUILD_TYPE=Debug when I am building my application

19:09 <weilewei> Then everything works. (note, my hpx is built in Debug version)

19:09 <jaafar> Is there any kind of startup delay that one should expect on the first use of a parallel algorithm?

19:10 <jaafar> Like, does it make sense that the very first call would have some overhead on the order of tens of ms?

19:11 <heller> Yeah, Stack allocation and associated page faults

19:42 <jaafar> heller: OK I will try to exclude the first run from my benchmarking, thanks

19:52 weilewei has quit [Remote host closed the connection]

19:53 weilewei has joined #ste||ar

20:07 Coldblackice has joined #ste||ar

20:11 hkaiser has quit [Quit: bye]

20:22 weilewei has quit [Remote host closed the connection]

20:42 weilewei has joined #ste||ar

20:53 Guest70891 has quit [Ping timeout: 250 seconds]

20:54 Guest70891 has joined #ste||ar

21:23 hkaiser has joined #ste||ar

21:53 jaafar has quit [Quit: Konversation terminated!]

22:01 rtohid has left #ste||ar ["Konversation terminated!"]

22:07 jaafar has joined #ste||ar

22:42 hkaiser has quit [Ping timeout: 264 seconds]

22:47 quaz0r has left #ste||ar ["WeeChat 2.6-dev"]