#ste||ar on 2019-02-25 — irc logs at irclog.cct.lsu.edu

2018-08-26 23:03 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

01:08 quaz0r has quit [Ping timeout: 255 seconds]

02:06 quaz0r has joined #ste||ar

03:11 hkaiser has quit [Quit: bye]

03:28 K-ballo has quit [Remote host closed the connection]

04:14 eschnett_ has quit [Quit: eschnett_]

05:15 <Yorlik> Is there a hash functor for id_types somewhere i could use for an unoprdered map key?

05:15 <Yorlik> I would like to use id_types as kleys in an unordered map.

05:17 <heller_> they should support it out of the box

05:19 <heller_> or do they?

05:19 <Yorlik> I can't find a hash functor anywhere and have issues using them

05:19 <heller_> https://github.com/STEllAR-GROUP/hpx/blob/master/hpx/runtime/naming/name.hpp#L999

05:19 <heller_> only gid_type, as it seems

05:20 <heller_> since id_type is a smart pointer to gid_type, this should be straight forward

05:20 <Yorlik> And how would I hash t he gid type?

05:20 <Yorlik> And how would I hash the gid type?

05:20 <Yorlik> woops

05:20 <Yorlik> I mean which members shouzld I feed into the hash function?

05:20 <Yorlik> Such, that I have the full ID

05:21 <Yorlik> Also I probably can't use a default hash function since the ID is longer than 64 bit I think

05:22 <Yorlik> Just wondering - first time I have to do this

05:22 <Yorlik> I'm kinda stuck at the question marks: size_t operator()( const hpx::id_type &x const {

05:22 <Yorlik> return hash<long long> ( long long )(x.get_gid().???? );

05:22 <Yorlik> }

05:22 <Yorlik> But prpbably the <long long> already is a problem

05:23 <heller_> return hash<hpx::naming::gid_type>()(x.get_gid());

05:24 <Yorlik> It doesn't like the type parameter

05:24 <Yorlik> <hpx::naming::gid_type> gets wiggled

05:25 <heller_> what happens?

05:25 <Yorlik> "type name is not allowed"

05:26 <heller_> try to fully qualify the name

05:26 <heller_> std::hash<...

05:26 <Yorlik> Oh man .. lol

05:26 <Yorlik> Thanks !

05:30 <Yorlik> It compiled ! :)

05:30 <Yorlik> Functional checks later ... :)

07:02 <jbjnr__> simbergm: any idea which tests might be causing time outs on cdash for the moody camel branch

07:02 <jbjnr__> what timeout do you have for ctest, 30s 1m?

07:50 <simbergm> jbjnr__: 200 seconds it seems like

07:50 <simbergm> you'll probably need to run the tests manually to find out

07:51 nikunj has quit [Remote host closed the connection]

07:51 <simbergm> jbjnr__: not sure if you saw this yet: http://cdash.cscs.ch/viewBuildError.php?buildid=42734 ?

07:51 <simbergm> short story: one of the recent PRs broke the guided pool executor on gcc (only gcc, not clang)

07:51 <jbjnr__> hmm. Everything passes fine on my local setup using moody camel

07:52 <simbergm> would you have time to check what it's supposed to do there?

07:52 <simbergm> it doesn't find an overload

07:52 <simbergm> hmm

07:52 <simbergm> I'll try running the tests on that branch too, see if I find something

07:53 <simbergm> K-ballo might try to fix it if he knows what it's supposed to call

07:53 <jbjnr__> I'll look at the guided pool exec.

07:53 <jbjnr__> might not be able to do it till later though. got some broken stuff to fix here first.

07:53 <simbergm> sure, that's ok

07:54 <jbjnr__> been working on my scheduler stuff and redoing all the queues etc.

07:55 nikunj has joined #ste||ar

08:08 nikunj has quit [Remote host closed the connection]

08:08 nikunj has joined #ste||ar

08:08 nikunj has quit [Remote host closed the connection]

08:11 nikunj has joined #ste||ar

08:39 <heller_> we should probably work with a timeout of 60 seconds

08:40 <jbjnr__> yup. Tests should be less than that.

08:40 <jbjnr__> (There are a couple that take time in debug mode, but maybe we should make them a bit shorter.)

08:45 <heller_> yeah, or not run those long running tests in debug mode to begin with

08:46 <heller_> looks like the unit tests now run cleanly with ubsan on my branch!

08:46 <heller_> wee

08:46 <heller_> ok, defense practice session now

08:50 <heller_> except for tests.unit.resource.suspend_pool :(

08:50 <heller_> oh no, it passed!

08:53 <heller_> 100% tests passed, 0 tests failed out of 471

08:53 <heller_> Total Test time (real) = 804.70 sec

08:53 <heller_> heller@g2:~/programming/hpx/build/debug_ubsan$

08:53 <heller_> yes!

09:48 Abhishek09 has joined #ste||ar

09:52 nikunj has quit [Quit: Leaving]

10:32 Abhishek09 has quit [Quit: Page closed]

11:00 <jbjnr__> heller_: \o/

11:00 <jbjnr__> (For your tests)

11:21 pmikolajczyk41 has joined #ste||ar

11:28 <jbjnr__> does anyone know what was changed that broke the guided pool executor stuff? I just had a quick look, but knowing what was changed might be useful

11:29 <jbjnr__> seems to be dataflow related.

11:30 pmikolajczyk41 has quit [Ping timeout: 256 seconds]

11:42 <heller_> jbjnr__: could be some missing includes

11:43 K-ballo has joined #ste||ar

12:16 <jbjnr__> seems to be a tuple unwrapping issue. I expct KBallo fixed something and my workarounds are now causing a problem

12:16 <jbjnr__> I know where to look

12:17 <K-ballo> nope, I didn't fix anything

12:17 <K-ballo> let me point you to the line that causes it

12:18 <K-ballo> https://github.com/STEllAR-GROUP/hpx/blob/master/hpx/lcos/dataflow.hpp#L283

12:37 <jbjnr__> It seems that my overloads were called with a,b,c, but now I get tuple(a,b,c) when dataflow passes things on to the executor internals

12:37 <jbjnr__> I need to recompile with template backtrace limit unset

12:39 <jbjnr__> probably one of my specializations just foesn't get picked any more and so the tuple stuff is not being handled as it used to

12:40 <K-ballo> my suspicion is some substitution going wrong on an overload that is not intended to be called, but could not figure out which ones are intended

12:41 <K-ballo> although to be fair, that is usually my first suspicion for everything that involves overloads behaving differently for different compilers

12:48 <jbjnr__> K-ballo: do you know which merge triggered the new errors?

12:48 <K-ballo> yeah

12:49 <K-ballo> https://github.com/STEllAR-GROUP/hpx/commit/7c2a010cb38bfa9ee6b05e354b78d305b513757c#diff-040de9e61eef02088de58b48a421a1e5L276

12:50 <jbjnr__> thanks

12:51 <K-ballo> is just that change in that one function, the other changes don't affect it

12:54 <jbjnr__> I see. It breaks my specialization in the Guided pool exec. Ok thnks. I'll take another look later and fix it then.

12:57 <jbjnr__> FYI: I think it's this bit here https://github.com/STEllAR-GROUP/hpx/blob/31fc341031ca0d4c2ccd3e6f5f2c90520e1fefe0/hpx/runtime/threads/executors/guided_pool_executor.hpp#L396-L408

12:58 hkaiser has joined #ste||ar

12:58 <K-ballo> I see

12:58 <jbjnr__> so it drops through to the other async and doesn't do the unwrapping as before.

12:58 <jbjnr__> I'll experiment later and hopfully work around it

13:00 <K-ballo> how does dataflow supporting scalars affect this

13:01 <jbjnr__> not sure, probably isn't handled by my stuff.

13:01 <jbjnr__> I would need to add a test of that

13:02 <jbjnr__> in here I'd put it https://github.com/STEllAR-GROUP/hpx/blob/d26d6d7513d842730342e41af8d291ca4ec6e965/examples/resource_partitioner/guided_pool_test.cpp#L215

13:02 <jbjnr__> or just below rather

13:03 <K-ballo> oh but this is not about arbitrary dataflow calls anyways, is it? just the internal ones related to executors?

13:03 <jbjnr__> (the other test is called async_customization and has more stuff that might be appropriate)

13:03 <hkaiser> jbjnr__: there have been changes to the default parallel executor, not sure how that affects yours

13:04 <hkaiser> shouldn't have any effect, though

13:04 <jbjnr__> K-ballo: yeah. It would only be a problem if dataflow was used with my executor and had some scalrs mixed with futures. I will try it once I fix this

13:20 Abhishek09 has joined #ste||ar

13:49 <Abhishek09> hello

13:53 Abhishek09 has quit [Ping timeout: 256 seconds]

14:28 <jbjnr__> hkaiser: Just out of curiosity ... "there have been changes to the default parallel executor, not sure how that affects yours" - aren't we supposed to not merge PRs that break things? (Or has policy changed)

14:29 <jbjnr__> (and please note the heavy use of sarcasm - I'm just surpised that it wasn't considered)

14:33 Abhishek09 has joined #ste||ar

14:33 daissgr has joined #ste||ar

14:33 <hkaiser> jbjnr__: I don't think this PR has broken anything - you were asking what has changed and I tried to be helpful in telling you

14:35 <jbjnr__> (the guided pool stuff broke). I don't really care much, but am just surprised by the nonchalence

14:38 <hkaiser> jbjnr__: I doubt the guided pool stuff broke because of this particular PR

14:38 <jbjnr__> No it was the other one.

14:38 <hkaiser> but if it did, I apologize, the tests I looked at were all green - so it couldn't have broken everywhere

14:39 <hkaiser> ok, if it was the other one - why do you accuse me of nonchalance?

14:41 <jbjnr__> Sorry. I didn't mean to accuse you of and wrongdoing.

14:41 aserio has joined #ste||ar

14:41 <jbjnr__> I just thought things seem very casual round here at the moment

14:41 <jbjnr__> ^of any^

14:41 <jbjnr__> not ^of and^

14:45 hkaiser has quit [Quit: bye]

14:50 <Abhishek09> parsa:how does circleci works in phylanx?

15:01 eschnett_ has joined #ste||ar

15:11 Abhishek09 has quit [Ping timeout: 256 seconds]

15:12 Abhishek09 has joined #ste||ar

15:12 <Abhishek09> hey parsa: are you here?

15:17 <zao> Abhishek09: A lot of CI systems are controlled by files in the repositories for the software to test.

15:17 <zao> How so?

15:18 <zao> If you're curious, you might want to look up some quick guides describing how to get started with a CI environment and check that against what's configured in the repos.

15:21 <Abhishek09> i think it is a server which run cibuild.

15:22 <zao> Could you clarify what your question is actually about?

15:23 <zao> I interpreted it as you asking how Phylanx is tested with CircleCI and other CI environments.

15:23 <zao> Is the actual question about something else, like what CircleCI is at all?

15:29 <Abhishek09> zao:you mean to say it runs phylanx for testing test.But I thought it can be used for building:

15:30 <zao> Abhishek09: The purposes of CI systems are many.

15:30 <zao> They can be used to test if software compiles. They can be used to run test suites. They can be used to prepare release packages.

15:30 <zao> Anything that falls under "continuous integration".

15:31 <zao> You can read the configuration files in the phylanx repository to see what that particular project uses CircleCI and Appveyor for.

15:32 <Abhishek09> what about travis CI?

15:32 <zao> What about it?

15:32 <Abhishek09> phylanx doesn't uses.

15:33 <zao> There are many different CI services. HPX used to use Travis before moving to CircleCI.

15:33 <Abhishek09> is circleci is better than travis ci?

15:33 <zao> Don't know.

15:33 <zao> Travis recently fired a lot of senior staff, so they might be getting worse now :D

15:37 <K-ballo> I think we switched out of travis because our builds take just too long and there's a time limit.. but that was years ago

15:41 bita_ has joined #ste||ar

15:42 Abhishek09 has quit [Ping timeout: 256 seconds]

15:48 bita_ has quit [Quit: Leaving]

15:51 <heller_> would have been interesting what the actual question was

16:12 <zao> So say we all ;)

16:29 <zao> heller_: I tried at least :)

16:45 Abhishek09 has joined #ste||ar

16:56 aserio1 has joined #ste||ar

16:59 aserio1 has quit [Client Quit]

17:00 aserio has quit [Ping timeout: 268 seconds]

17:00 <diehlpk_work> Abhishek09, diehlpk_works:I have found blaze on two places -> Use the one from bitbucket

17:01 <diehlpk_work> As far as I know this is the main repo and all work goes there

17:02 <diehlpk_work> Abhishek09, diehlpk_work:is it necessary to install hpx or i can use lib files -> I do not understand this question

17:04 daissgr has quit [Ping timeout: 258 seconds]

17:05 hkaiser has joined #ste||ar

17:06 daissgr has joined #ste||ar

17:09 <parsa_> Abhishek09: what do you mean by how does phylanx circle work? it builds and tests phylanx. the commands it runs are in .circleci/config.yml and you can see the builds in https://circleci.com/gh/STEllAR-GROUP/phylanx

17:10 parsa_ is now known as parsa

17:19 aserio has joined #ste||ar

17:19 david_pfander has quit [Ping timeout: 258 seconds]

17:20 <Abhishek09> deihlpk_works: i consider the package for fedora platform.

17:21 <Abhishek09> Most of us are on this community using fedora.

17:26 <Abhishek09> parsa:can i use cibuildwheel on circleci?

17:26 diehlpk_work has quit [Remote host closed the connection]

17:28 <parsa> Abhishek09: what's the point? circleci doesn't support windows, we don't pay circleci to get the mac option, and we only use ubuntu to test on circleci...

17:28 <parsa> don't worry about circleci... build one wheel for phylanx on any platform you can and demonstrate that it works

17:29 <parsa> locally, on your own machine

17:32 <Abhishek09> parsa:it currently works on Travis CI and CircleCI to build Linux and Mac wheels(PAID), and Appveyor to build Windows wheels.

17:32 <parsa> Abhishek09: do you have a wheel file you have made for phylanx?

17:33 <Abhishek09> i reuse that file config.yml to build the wheel

17:33 <parsa> look, travis, circle ci, azure pipelines, gitlab, etc only run commands on their machines. you should be able to build on your own machine

17:34 <parsa> circleci and the like are used for continuous integrations (i.e. making sure everything still works for every change). you don't need to worry about that now

17:35 <Abhishek09> But it is easy to create .whl on fedora.

17:35 <Abhishek09> i use cibuildwheel on circleci

17:35 <parsa> then give us that wheel file and let us see what you've done

17:38 <Abhishek09> No, i have created it.It is part of Gsoc.Now only have think how would it implemented and how it works

17:38 <zao> *haven't ?

17:40 <Abhishek09> we would create when coding begins.

17:45 <zao> Feels like something you might want to test out beforehand maybe.

17:45 <zao> I would, just to be more certain of my proposal if I did one.

17:45 <zao> When are they due, by the way?

17:48 diehlpk_work has joined #ste||ar

17:48 <hkaiser> heller_: yt?

17:48 <parsa> Abhishek09: do you even have a proposal?

17:49 <Abhishek09> zao:you also applying for Gsoc:you working on which task?

17:50 <parsa> :)))))

17:50 <zao> Abhishek09: No, I'm not a student and not part of the HPX/Phylanx GSoC.

17:50 <zao> I hang around here giving good/bad advice and like trying to build HPX.

17:51 <zao> (I'm a systems engineer at a HPC site)

17:51 <zao> My job is literally installing silly scientific Python software for a living :D

17:51 <diehlpk_work> Abhishek09, I think parsa is right, it would help to write down what you want to do

17:52 <Abhishek09> parsa: Yes i roughly created proposal.I will upload when GSoC it start to accept So we people can review my proposal .

17:53 <diehlpk_work> Abhishek09, Normally, you share the proposal with your mentors before uploading

17:53 <diehlpk_work> We can see your proposal only after submission

17:53 <diehlpk_work> At least in the last years, students shared a google doc with their mentors and they made remarks

17:55 <Abhishek09> GSoc offer review option for mentor for proposals.So you people can suggest changes.

18:01 aserio has quit [Ping timeout: 264 seconds]

18:05 <Abhishek09> deehlpk_works:Is sharing of proposal is safe,confidential.

18:19 <heller_> As safe and confidential as you want it to be

18:20 <heller_> The organizations haven't even been announced yet

18:20 daissgr1 has joined #ste||ar

18:27 <hkaiser> just received an email that we're not part of GSoC this year

18:34 aserio has joined #ste||ar

18:37 <Abhishek09> this will be announced on 27th

18:39 <Abhishek09> How would get that email now?

18:39 <zao> Oh dear.

18:40 <K-ballo> our cool down year?

18:41 <Abhishek09> diehlpk_work,parsa:is this msg is true?

18:43 <hkaiser> Abhishek09: it is official

18:45 <zao> More time to work on HPX then, yay!

18:46 <zao> parsa: Heh, tried making my own little Python package with a C extension. Turns out that there doesn't seem to be any naming scheme for distro-specific wheels, just some nondescript "linux_x86_64" platform tag or the manylinuxes.

18:48 <zao> I guess that nothing would stop you from having your own wheelhouse with some sort of tagging, I guess.

18:48 bibek has joined #ste||ar

18:48 <zao> I think that Compute Canada builds all their software as wheels and put them on CVMFS for their sites.

18:50 <Abhishek09> deihlpk_works,parsa:are you here?

18:51 <parsa> zao: this is where the doc is: https://www.python.org/dev/peps/pep-0425/ i've been assuming you can be more specific

18:51 <parsa> at least say for sure at least in case of pandas, it downloads a prebuilt binary for debian and builds from source for alpine

18:51 <parsa> ***i can say at least in case of pandas

18:53 <zao> Those seem to be manylinux1, sadly.

18:53 <zao> I have not managed to find anything more distro-specific than manylinux1 and manylinux2010,.

18:54 <parsa> i don't know, it could be that people have something elaborate going on their setup.py

18:54 <zao> And it doesn't seem like pypi allows uploads of the naked linux_x86_64/i686 tags at all, only manylinux:en.

18:54 <zao> I would reckon that alpine doesn't conform to manylinux:es at all, considering the lack of glibc.

18:55 <zao> (Alpine runs musl libc, as you probably know)_

18:55 <parsa> yeah, you can get glibc to work on it but its hacky

18:56 <zao> Ah, it's even mentioned in PEP 513, probably explicitly declares itself to be incompatible.

18:59 <Abhishek09> i not able to digest this sad news that ste||ar has not selected for Gsoc.

18:59 <parsa> zao: i don't know if we can provide an hpx build independent of the linux distro. we're pickier than most packages on our dependencies. we probably have to always build everything we need then

18:59 eschnett_ has quit [Quit: eschnett_]

19:00 <zao> Abhishek09: A bit unfortunate. I hope that you can find some other project that suits you.

19:01 <Abhishek09> But i have almost done this project

19:02 <zao> parsa: At least our deps are fairly simple to build privately.

19:02 eschnett_ has joined #ste||ar

19:03 <Abhishek09> parsa,deihlpk_works:they have never confirmed me

19:03 <Abhishek09> i want to listen from them.

19:04 <zao> Abhishek09: You'll find out in public on February 26 12:00 UTC then as the list is published.

19:10 <K-ballo> Abhishek09: if it's any consolation, if you truly were almost done with the project, we wouldn't have accepted you without at least increasing the scope of the project first

19:11 <zao> In any way, the various attempts to make a manylinux2010 image seem to mostly aim for devtoolset-7, which is GCC 7.3.1.

19:11 <zao> They're a bit stuck on that RH doesn't have any newer toolsets for i686, so manylinux2010 would have either no i686 flavour or 4.8.2 as advertised in the PEP.

19:12 <zao> For future work, manylinux2010 or a newer equivalent would be excellent if the aim is to get packages onto pypi.

19:12 nikunj has joined #ste||ar

19:13 <zao> I'm not sure if you could have custom platform tags if you ran your own wheelhouse and if they would be honored by installers. If you could, that'd be an alternative I guess if you had the resources to build on Popular Distros.

19:14 <zao> All in all, situation hazy, hopefully I haven't confused it further with this research.

19:16 <parsa> zao: no no no, it is helpful. thanks for investigating!

19:18 <diehlpk_work> Abhishek09, Yes, this message is true

19:18 <diehlpk_work> We got not accepted this year

19:22 Abhishek09 has quit [Quit: Page closed]

19:23 <zao> Ah, ComputeCanada uses separate wheelhouses and generic names like -cp27-cp27mu-linux_x86_64.whl

19:39 nikunj has quit [Quit: Leaving]

19:45 hkaiser has quit [Quit: bye]

19:56 <heller_> lol

20:24 <heller_> what happened, why is migrate_component failing again :(?

20:26 <K-ballo> isn't that the one that was failing intermitently?

20:30 <heller_> yeah, and now it is failing consistently for the MPI parcelport

20:30 <heller_> we fixed it for good one week ago

20:31 <heller_> at least that was the idea ;)

20:31 <heller_> and all of a sudden, after adding asserts, it seems to work again...

20:32 <heller_> because I am running the TCP parcelport...

20:32 <K-ballo> consistent is good

20:33 <K-ballo> time sensitive is so bad

20:34 <heller_> sure

20:34 <heller_> just wondering that noone noticed

20:41 <heller_> simbergm: we need a more versatile docker setup... cramming everything into one image isn't good

20:50 daissgr1 has quit [Read error: Connection reset by peer]

21:05 <Yorlik> Should this compile? I thought ele would be invalid at the second push_back, but it compiled: buffers[0]->push_back(std_move(ele));

21:05 <Yorlik> buffers[1]->push_back(std_move(ele));

21:07 <K-ballo> the type of `ele` won't change after a push_back

21:07 <heller_> just because it is invalid, doesn't mean it doesn't compile

21:08 <K-ballo> there is no way whatsoever in which those two separate expressions are distinguishable at compile time

21:08 <K-ballo> they either both compile or neither does

21:08 <heller_> the code above looks perfectly fine (without further knowledge of the code)

21:08 <heller_> there's nothing inherently wrong with it

21:09 <K-ballo> that said, fluid typing is sweet!

21:09 <Yorlik> I thought ele would be sortof invalid after being moved away.

21:09 <heller_> BUT, use after move is seldomly something you want to do do (unless you assign a new value to it or somesuch)

21:10 <heller_> sort of, probably, but in a usable state

21:10 <K-ballo> ah, the destructive moves paradigm.. we don't have that in C++

21:10 <Yorlik> Std_move to a vector should create a copy anyways, right? After all internally a vector is based on an aligned array.

21:10 <heller_> depends on the object, really

21:11 <Yorlik> Its a simple struct

21:11 <heller_> a move, in C++, primarly is not about optimization, but about semantics

21:11 <Yorlik> I thought it was about object ownership

21:11 <heller_> sure, which is semantics

21:12 <Yorlik> Since I gave ownership to the first vector, the second shouldn't have a right to receive the object as i see it. that's why I'm so puzzled

21:12 <K-ballo> that assumes a destructive move model, which we don't have

21:12 <heller_> well, the second line moves a different object

21:12 <K-ballo> you transfered ownership of the resources within the object, not the object itself

21:12 <K-ballo> the object itself remains in a valid (but often unspecified) state

21:13 <Yorlik> So - in the case of a pd (simple struct) it just copies?

21:13 <heller_> probably, yes

21:13 <Yorlik> s/pd/pod/g

21:14 <Yorlik> Could I create the object directly in the slot of the vector, to avoid copying the data?

21:14 <heller_> yes, there is emplace_back

21:14 <heller_> which constructs the object in the place it is supposed to end up in

21:14 <Yorlik> Nice

21:14 <Yorlik> I shall check that out

21:15 <heller_> (could also call the move constructor)

21:16 <Yorlik> Basically I have a vecore of elements, which are indexed in an addition unorderd_map by their associated id_type

21:16 <Yorlik> The id type belongs to the mopther object, the entity, not the component which is stored in the vector

21:17 <heller_> Yorlik: FWIW, most cases of undefined behavior in C++ do not require diagnostics from the compiler

21:17 <Yorlik> I see: I'm in a minefield and nead to tread carefully ;)

21:17 <heller_> mostly you'll be fine

21:17 <Yorlik> BTW

21:18 <heller_> remember though: premature optimization is the root of all evil :P

21:18 <Yorlik> How would I migrate an object, that has custom dependencies which also need migration

21:18 <Yorlik> heller_: you wanna see this: https://www.youtube.com/watch?v=rX0ItVEVjHc

21:19 <Yorlik> After that we talk about premature yadda yaddas ;)

21:20 <Yorlik> The structure I am building here is the tightest loop in the entire system I'm building - the heart of the entity component system design

21:20 <heller_> Yorlik: I know this talk, and my point is still valid :P

21:20 <Yorlik> This loop is where I really should optimize where possible

21:21 <Yorlik> I agree for other situations

21:21 <K-ballo> lol, that talk

21:21 <K-ballo> that was fun

21:21 <Yorlik> But these component vectors will have to deal with tens of thousands of updates per frame

21:21 <Yorlik> I love that talk

21:22 <Yorlik> My fav quote was " If you don't understand the data you don't understand the problem"

21:22 <Yorlik> And: "Everyone who thinks premature optimization can leave the room now" (When talking about L2 cache misses)

21:24 <K-ballo> yeah, the content was fine, but the speaker was completely ineffective

21:25 <Yorlik> Let's say he was pretty excited about the matter.

21:25 <Yorlik> The content though was really good, I think.

21:30 <Yorlik> Back to the question before I kinda lost:

21:30 <Yorlik> My entities are hpx components

21:30 <Yorlik> they hold an int65 which i use as a bitfield to indicate which components the entity has

21:30 <Yorlik> int 64 ofc :D

21:31 <Yorlik> The entities live in vectors specialized for their type

21:31 <Yorlik> And they are additionally indexed in an unordered mnap by id_type of the mother entity

21:32 <Yorlik> If I want to migrate that entity later, the system has no way to know, that the components also need to migrate

21:32 <Yorlik> How wouzld you approach that ?

21:32 <Yorlik> A custom serializer?

21:32 <Yorlik> I was thinking about an entity parcel package strucxture

21:33 <Yorlik> it would hold the actual data and not the references. But thats ineffective ofc

21:33 <Yorlik> Because ti implies copying

21:33 <Yorlik> Essentially it's about migration of a group of related objects

21:33 <Yorlik> And not all of them are components, some are just POD

21:52 <heller_> you'll always need a custom serializer

21:52 <heller_> not all copies are bad

21:55 <heller_> K-ballo: FWIW, the migrate component test failure seems to be a problem with the MPI implementation, I can reproduce it with the same version, and it goes away with a newer one. The error itself is strange as well, without a clear origin (that is, the data received never gets sent anywhere)

21:55 <heller_> not sure why it only triggers with that particular test though

21:56 hkaiser has joined #ste||ar

21:57 * heller_ hates MPI

21:57 <zao> Yorlik: If you want a world where objects that have been moved are invalid, you want to program in Rust :D

21:58 <Yorlik> We tried Rust and deliberately decided against it ;)

21:58 <zao> Chickens! :)

21:58 <heller_> zao: how long did it take you to get your first non trivial program compiled?

21:59 <zao> It was way easier this time around compared to the first time I tried to pick it up a few years ago.

21:59 <heller_> had quite long conversations with rust heads last week...

21:59 <zao> I still trip myself up, of course.

21:59 <heller_> talking about tokyo and rayon

21:59 <hkaiser> heller_: short question

21:59 <heller_> hkaiser: 42

22:00 <zao> The state of tokio and futures-preview for async/await is a right mess.

22:00 <hkaiser> that's an answer to a long question

22:00 <zao> I'm going to let that solve itself :)

22:00 <heller_> hkaiser: worth a shot...

22:00 <hkaiser> heller_: can't find it :/

22:00 eschnett_ has quit [Quit: eschnett_]

22:00 <heller_> hkaiser: the question?

22:00 <hkaiser> the link

22:00 <heller_> he

22:01 <heller_> btw: I have a undefined behavior sanitizer clean build now!

22:01 <hkaiser> heller_: saw that, yay!

22:01 <hkaiser> merge it!

22:01 <heller_> someone just broke migrate component

22:01 <heller_> I need to fix that first ;)

22:02 <heller_> and of course the guided pool mess that bubbled up

22:02 <hkaiser> on the gitlab builder - there was a strange MPI error while executing the migration test

22:02 <heller_> *nod*

22:02 <hkaiser> did you see that?

22:02 <heller_> yes

22:02 <hkaiser> ok

22:02 <heller_> goes away when updating the MPI version

22:02 <hkaiser> ok, good

22:02 <heller_> tried to hunt it down today

22:03 <heller_> totally unclear where it comes from. The gist of the story: Someone receives a message header, but the actual size of the MPI message is zero

22:04 <heller_> thus, the entire header is filled with a -1 pattern, leading to that failure

22:05 <heller_> if I just ignore that case, I get a hang

22:05 <heller_> the release build seems to be not affected by that glitch, the debugger however, clearly shows that no data was actually received

22:05 <hkaiser> I saw it in both, release and debug

22:06 <heller_> oh really?

22:06 <hkaiser> release just fails

22:06 <hkaiser> debug raises an assert

22:06 <heller_> ah right ...

22:06 <heller_> release times out

22:06 <hkaiser> heller_: here: https://gitlab.com/stellar-group/hpx/pipelines/48885789

22:06 <hkaiser> yah

22:12 <heller_> hkaiser: now, I have a question, what was my biggest to HPX in the last 6 years

22:12 <heller_> *nod*

22:12 <heller_> I'll see what I can do

22:12 <heller_> what makes me uneasy is that it only fails for the migrate component test

22:13 <K-ballo> your biggest what?

22:13 <heller_> shit

22:13 <heller_> biggest contribution

22:13 <zao> :D

22:13 <hkaiser> heller_: yah

22:13 <heller_> hkaiser: now, I have a question, what was my biggest contribution to HPX in the last 6 years

22:13 <heller_> I guess, typos qualify

22:13 <hkaiser> heller_: what would you think your biggest contribution was?

22:13 <heller_> I need some punchline for my talk on htursday ;)

22:14 <heller_> I have no idea

22:14 <hkaiser> heller_: then you're ready to defend ;-)

22:14 <heller_> doesn't help :P

22:37 <heller_> hkaiser: see pm please

22:49 <zao> `{what}: assertion 'tag() != -1' failed: HPX(assertion_failure)`

22:50 <zao> Rather nice, I pulled your docker container and it also blows up on the `tests.unit.components.distributed.mpi.migrate_component` test for the commit that hkaiser linked there.

22:50 <hkaiser> that's the one

22:50 <zao> https://gist.github.com/zao/b48767904fde5d8a5fdc9cc9ffe49a29

22:51 <zao> Figured I'd try on my Ryzen build machine, see if it happens here too.

22:53 <zao> A bummer I don't have any build automation yet.

23:09 aserio has quit [Quit: aserio]

23:28 <heller_> zao: yes, the docker image has the problematic MPI version

23:32 <zao> Very curious, should try with my EB tree later outside the container, got a ton of MPI versions there in different toolchains.

23:32 <zao> But now, sleep :)

23:55 <zao> Worth noting, we have had some OpenMPI/impi versions at work that have just not worked right with some codes. Can’t recall which now but we build with other toolchains every now and then.