#ste||ar on 2018-07-11 — irc logs at irclog.cct.lsu.edu

2018-04-23 16:40 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC2018: https://wp.me/p4pxJf-k1

00:31 anushi has quit [Ping timeout: 260 seconds]

00:31 anushi has joined #ste||ar

00:54 eschnett_ has joined #ste||ar

00:54 eschnett has quit [Ping timeout: 240 seconds]

01:11 anushi has quit [Ping timeout: 240 seconds]

01:13 diehlpk has joined #ste||ar

02:31 diehlpk has quit [Ping timeout: 240 seconds]

02:31 hkaiser has quit [Quit: bye]

03:01 parsa[w] has quit [Read error: Connection reset by peer]

03:04 parsa[w] has joined #ste||ar

03:09 anushi has joined #ste||ar

03:20 K-ballo has quit [Quit: K-ballo]

03:27 nanashi55 has quit [Ping timeout: 240 seconds]

03:27 nanashi55 has joined #ste||ar

03:28 nikunj has joined #ste||ar

06:16 <jbjnr> on power I've got problems with coroutines. I tried with GENERIC_COROUTINES ON and OFF but both give errors. I seem to recall that we had to use an older boost at one time (but I though hk fixed it). Does anyone know if it's fixable with another option, or boost version etc. heller___ ?

07:17 anushi has quit [Ping timeout: 240 seconds]

07:18 anushi has joined #ste||ar

07:22 anushi has quit [Remote host closed the connection]

07:22 anushi has joined #ste||ar

07:36 anushi has quit [Ping timeout: 240 seconds]

07:38 anushi has joined #ste||ar

07:41 david_pfander has joined #ste||ar

07:43 anushi has quit [Ping timeout: 240 seconds]

07:43 anushi has joined #ste||ar

07:43 anushi has quit [Remote host closed the connection]

07:43 anushi has joined #ste||ar

07:53 <github> [hpx] msimberg pushed 2 new commits to master: https://git.io/fNkt9

07:53 <github> hpx/master ddbd108 Mikael Simberg: Fix some more c++11 build problems

07:53 <github> hpx/master ef04f40 Mikael Simberg: Merge pull request #3372 from msimberg/fix-c++11...

07:54 <jakub_golinowski> M-ms - for me all the perf tests passed both for the opencv from master (pthreads) and opencv from hpx_backend branch

07:55 <jakub_golinowski> among them the test that was requested by the opencv ppl - but the results are far from satisfying - in all cases opencv with hpx backend is considerably slower sometimes the overall test run time for opencv with hpx backend is multiple times bigger than the opencv wiht pthreads runtime

07:55 <M-ms> jakub_golinowski: do you mean (opencv master branch with pthreads backend) and (opencv hpx_backend branch with pthreads backend)?

07:56 <M-ms> jakub_golinowski: mmh, that's a problem

07:56 <jakub_golinowski> ah, no sorry I mean: (1) opencv master with pthreads backend vs. (2) opencv hpx_backend branch with hpx backend

07:57 <jbjnr> jakub_golinowski: the time taken for the hpx runtime to start up is quite long and can contribute to longer tests. I hope the test time reported for the graphs is only the copmoutation part and does not include the startup.

08:02 <jakub_golinowski> jbjnr, in the mandelbrot benchmark it is only the computation time. However, note that in case of start-stop backend the runtime needs to be started after the call to parallel_for_() and therefore is included in the computation time (but this is a fair measurement, since in the start-stop version we always have to start and stop the rutime whenever we call parallel_for)

08:02 <jbjnr> ok

08:03 <jakub_golinowski> M-ms, you said something that employing HPX might somehow disrupt OpenCL - maybe this is the reason?

08:05 <M-ms> jakub_golinowski: yeah, may be but I don't know what, it just seemed to be a commonality for some of the failing tests

08:06 <M-ms> so we know now that HPX is a bit slower but not by much in your mandelbrot benchmark

08:06 <M-ms> but something else is happening in the perf tests that's making it really slow

08:07 <M-ms> so you could try profiling one of the perf tests (also filtered to just one test) and see if there are any hints

08:08 <M-ms> although before that, all perf tests pass now? so what about unit tests? any change there or still some failing?

08:08 <jakub_golinowski> M-ms, do you have something exact in mind when saying profiling? Use some special tool?

08:17 eschnett_ has quit [Quit: eschnett_]

08:20 <M-ms> jakub_golinowski: yeah, my favourite is perf because all you need to do is "perf my_program" (and build with debug symbols)

08:21 <M-ms> it's not fancy but it gives something useful quickly

08:21 <M-ms> https://perf.wiki.kernel.org/index.php/Tutorial

08:23 <heller___> jbjnr: what are the problems?

08:23 <heller___> jbjnr: compiler? linker? runtime?

08:24 <jbjnr> , funp_(&trampoline<Functor>)

08:24 <jbjnr> wtf?

08:24 <jbjnr> users/biddisco/src/hpx/hpx/runtime/threads/coroutines/detail/context_generic_context.hpp:209:35: error: use of undeclared identifier 'Functor'

08:24 <jbjnr> and

08:24 <jbjnr> users/biddisco/src/hpx/hpx/runtime/threads/coroutines/detail/coroutine_impl.hpp:62:17: error: use of class template 'context_base' requires template arguments

08:24 <jbjnr> typedef context_base super_type;

08:25 <jbjnr> context type problems

08:25 <jbjnr> boost-1.65.1

08:25 <jbjnr> too new?

08:26 <jakub_golinowski> M-ms, as for the unit tests I think there are some failing still I am reruning them now to make sure no new segfaults pop-up.

08:26 <jakub_golinowski> M-ms, do you want me to push the test resutls to repo?

08:26 <jbjnr> I didn't spend any time looking at the code yet cos I have other things to work on too, but I do recall us having issues with bost context on power and maybe I need to do something. But I can't remember, so I thought I'd ask instead before spending time on it

08:26 <jbjnr> heller___: ^^^^

08:27 <jakub_golinowski> M-ms, thank you for the reference to profiler - I will familiarize myself with it

08:31 <heller___> jbjnr: strange, I was compiling a more or less recent HPX with generic context coroutines for ARM the other day

08:31 <heller___> using boost 1.6 something

08:32 <jbjnr> PowerPC

08:32 <heller___> boost 1.67

08:32 <heller___> sure

08:33 <heller___> let me check

08:44 jakub_golinowski has quit [Ping timeout: 256 seconds]

08:49 quaz0r has quit [Ping timeout: 240 seconds]

08:58 quaz0r has joined #ste||ar

08:58 jakub_golinowski has joined #ste||ar

09:01 anushi has quit [Ping timeout: 265 seconds]

09:02 anushi has joined #ste||ar

09:06 <M-ms> jakub_golinowski: nah, no need to push the test results

09:16 <jakub_golinowski> M-ms, so after filtering out a few tests the accuracy tests for pthreads from master vs. tests for hpx from hpx_backend have the same results in terms of fail/pass

09:17 <jakub_golinowski> M-ms, and the result is that all the tests pass - I realized I was not correctly passing the path to the opencv_extra/testdata and this was the reason for tests that were failing yesterday

09:19 <M-ms> jakub_golinowski: ok, I assume the ones you're filtering out are still only failing with the hpx backend? how many are we roughly talking about?

09:19 <jakub_golinowski> M-ms, sth like 10 tests

09:19 <M-ms> hmm, that actually sounds pretty good

09:20 <jakub_golinowski> M-ms, I remember mostly they were producing the segfault with the error of sth like longjump sth sth

09:20 <M-ms> so some of these are the ocl tests?

09:20 <jakub_golinowski> yes

09:20 <M-ms> ah, the ones we saw in the beginning

09:20 <M-ms> so non-core tests are mostly good then

09:21 <jakub_golinowski> M-ms, yes and some other tests that at first sight do not seem to have a lot in common (at least by looking at their names)

09:21 <jakub_golinowski> I modified a bit the run_tests script and will push it in a secodn

09:21 <M-ms> ok, thanks

09:21 <jakub_golinowski> there is the full list of failing tests - or to be more specific tests that I have seen at least one failing

09:22 <M-ms> could you also make it run perf tests (if you haven't already)?

09:22 <jakub_golinowski> M-ms, yes yes it runs perf tests as well

09:22 <M-ms> ok, good

09:29 <jakub_golinowski> M-ms, and about the changes I have as a PR - can I merge then now?

09:38 <M-ms> jakub_golinowski: sorry, I didn't have time to look at it properly

09:40 <M-ms> the only thing I wanted to comment was that you don't in principle need to have project(xxx) and find_package(HPX) in all the sub-CMakeLists.txt, it would be enough to do that at the top level

09:40 <M-ms> on the other hand they now still work as standalone examples which is still nice

09:40 <M-ms> but please go ahead and merge so you're not stuck juggling branches

09:44 <heller___> jbjnr: can you please post the full error message?

09:45 <jakub_golinowski> M-ms, thanks - with the projects at the single-example level the idea was exactly to still be able to build single example. For instance the mandelbrot benchmark is based on rebuilding the single opencv_mandelbrot example

09:48 <jbjnr> heller___: https://gist.github.com/biddisco/4393470d9cb6356bf309655fed768a83 and https://gist.github.com/biddisco/75183bfe9b0d76b501ea0db152d0013b

09:48 <jbjnr> error messages for both type of context coroutines

09:48 <jbjnr> I can't remember which I'm suppoosed to use

09:49 <jbjnr> M-ms: maybe you know which options matthieu used on power

10:00 <heller___> jbjnr: you should use GENERIC_CONTEXT

10:01 eschnett has joined #ste||ar

10:01 <heller___> jbjnr: https://github.com/STEllAR-GROUP/hpx/blob/master/hpx/runtime/threads/coroutines/detail/context_generic_context.hpp#L209

10:02 <jbjnr> heller___: ok. remind me, -DHPX_WITH_GENERIC_CONTEXT_COROUTINES=ON means use boost, or use our own version?

10:02 <heller___> jbjnr: I am not sure where your error comes from

10:02 <heller___> this means to use boost

10:04 <jbjnr> hat's odd. the version in git seems different to mine

10:04 <jbjnr> I'd better check I'm not holding some merges that messed it up.

10:10 <jbjnr> heller___: I have merged the "Changing the coroutine implementations to do a lazy init" into my branch and that's why it is different.

10:10 <heller___> ahh, i see

10:11 <heller___> could you try with regular master please?

10:11 <jbjnr> that's annoying, becuase I can't unmerge that as I made a load of changes to fit myself to that branch

10:12 <jbjnr> I am building master now, but if there is a chance you could fix that lazy init branch ....

10:17 <jbjnr> heller___: master https://gist.github.com/biddisco/b4666f77c426be10f300f9ff0c7bae37 I think I need to tell clang to use it's own c++ stdlib instead of gcc one.

10:21 <heller___> yup

10:23 jakub_golinowski has quit [Ping timeout: 265 seconds]

10:32 <M-ms> jbjnr: I don't know but I assume he didn't have any special options

10:32 <M-ms> But maybe that's why he was having problems later...

10:32 <jbjnr> M-ms: it's ok. turns out my branch is screwed up. redoing everything now with stdlib=libstdc++ and clean master branch

10:33 <M-ms> ok

10:37 <zao> Sounds like you're having about as much fun as I do on FreeBSD :)

10:51 eschnett has quit [Quit: eschnett]

11:06 eschnett has joined #ste||ar

11:34 hkaiser has joined #ste||ar

11:38 <nikunj> hkaiser: So I looked into the cmake source code in detail today. Turns out hpx_wrap got added due to it's addition in HPX_PKG_LIBRARIES and not due to hpx_targets. I made the change and tested the code with Debug/Release with dynamic main turned on and off

11:38 <nikunj> Phylanx worked fine on my laptop with the HPX build

11:38 anushi has quit [Remote host closed the connection]

11:40 <hkaiser> nikunj: does the hpx build pass now?

11:40 <nikunj> yes it does. here: https://github.com/STEllAR-GROUP/hpx/pull/3374

11:43 hkaiser has quit [Read error: Connection reset by peer]

11:44 hkaiser has joined #ste||ar

11:45 <hkaiser> nikunj: good, thanks

11:46 <hkaiser> let's merge this, then

11:47 <hkaiser> nikunj: for Phylanx, do we still need #519?

11:47 <nikunj> hkaiser: yes

11:47 <hkaiser> ok, let's retrigger that once HPX has cycled

11:48 <nikunj> in my Phylanx pull I've added 2 lines of code

11:48 <nikunj> One for the linker flag, second for the linking library

11:48 <hkaiser> k, is that in #519?

11:48 <nikunj> yes

11:48 <nikunj> that's why we will need #519 as well

11:49 <hkaiser> k

12:00 <jbjnr> boost help anyone?

12:01 <jbjnr> thre bost libs on power are generated with names like libboost_system-clang70-mt-d-p64-1_67.so

12:01 <hkaiser> wazzup?

12:01 <hkaiser> yes

12:01 <jbjnr> but cmake does not add the p64 to the lib name and so cannot find it

12:01 <jbjnr> can I tell b2 to not add p64 to the names

12:01 <hkaiser> try a newer cmake

12:01 <zao> Ooh, the new CPU tags.

12:01 <hkaiser> iirc that was added recently only

12:02 <jbjnr> can I just turn it off?

12:02 <hkaiser> you can generate boost without the tags, yes

12:02 <jbjnr> rebuilding cmake will take longer than relinking boost

12:02 <jbjnr> and I have no guarantee that it will work with a newer cmake

12:03 <zao> https://github.com/Kitware/CMake/blob/master/Modules/FindBoost.cmake#L1455-L1473

12:03 <zao> Amusingly enough, they don't seem to have implemented the Power one.

12:03 <zao> jbjnr: ^

12:04 <jbjnr> thanks. That tells me what I need to know. newer cmake won't help

12:04 <zao> Anyway, building a Boost in non-"versioned" flavour should get rid of it, together with the rest of the useful tags.

12:04 <jbjnr> ok. I will remove the versioned tag

12:05 <jbjnr> thanks

12:05 <jbjnr> that was what I wanted to know

12:05 K-ballo has joined #ste||ar

12:05 <zao> Not sure if the layout "tagged" will work, or if you need to go down to "system".

12:05 <zao> Or building a newer CMake, with a FindBoost handpatched :P

12:05 <jbjnr> trying now with 'tagged'

12:06 <jbjnr> I will build newer boost if this fails

12:07 <jbjnr> I begin to see why our users dislike boost so much.

12:07 <jbjnr> zao: thanks. libs look good in tagged mode

12:07 <jbjnr> now I try cmake on them

12:08 <jbjnr> yat \o/

12:08 <jbjnr> yay \o/

12:08 <jbjnr> I meant.

12:08 <jbjnr> works!!!!

12:09 <zao> \o/

12:25 <jbjnr> zao: cpp tests compiling, linking and giving the right output. All is well with the world. Cheers.

12:25 <jbjnr> now for hpx again ....

12:29 <nikunj> jbjnr: HPX cmake for some reason don't find boost if the layout is set to system or versioned but finds it with layout set to tagged. That's one thing I faced while installing HPX

12:30 <jbjnr> nikunj: ok. I usually use versioned without problems, but it looks like some new stuff is added and cmake has not caught up.

12:38 <jbjnr> heller___: M-ms hkaiser pycicle is being upgraded so that you can now trigger N builds per PR, with random combinations of CMake options and settings for each build. Options can be dependent on other options (so if HPX_WITH_CUDA is ON, then blah blah blah can be added with random combinations of sub options). Also, boost versions, compiler type (gcc/clang) can be handled in the same pass, so we now have a way of replicating buildbot

12:38 <jbjnr> and going further becuase we can explore option spaces more thoroughly.

12:39 <jbjnr> I will need help setting up good combinations of options for testing.

12:41 mcopik has joined #ste||ar

12:42 <jbjnr> anyone : openpower01:~/build/hpx$ bin/hello_world --hpx:threads=4

12:42 <jbjnr> terminating with uncaught exception of type std::invalid_argument: hpx::resource::get_partitioner() can be called only after the resource partitioner has been allowed to parse the command line options.

12:42 <jbjnr> Aborted (core dumped)

12:43 <jbjnr> Can someone remind me what causes the error above^^^ ?

12:43 <nikunj> jbjnr: which HPX build are you using?

12:43 quaz0r has quit [Ping timeout: 260 seconds]

12:43 <jbjnr> the one I have just done on a power machine

12:44 <nikunj> jbjnr: it might be due to previous incomplete implementation that got merged

12:44 <nikunj> could you try building the current master?

12:44 <jbjnr> this is master

12:44 <jbjnr> from today

12:44 <jbjnr> M-ms: can you rememeber what causes the RP error above?

12:44 jakub_golinowski has joined #ste||ar

12:45 <nikunj> jbjnr: my commit got merged like an hour ago. Could you retry with that?

12:45 <jbjnr> ok

12:46 <M-ms> jbjnr: HPX_WITH_MAX_CPU_COUNT?

12:46 <jbjnr> nikunj: your merge has nothing to do with this error

12:46 <jbjnr> M-ms: aha!

12:46 <jbjnr> thanks

12:46 <jbjnr> I check ...

12:46 <jbjnr> we need to fix that. its a huge PITA

12:46 <jbjnr> I forget every time

12:47 <M-ms> not sure if it's that but there's an equally unhelpful message if you forget that

12:47 <nikunj> jbjnr: That error also appears when you try to access HPX functionality without initiating HPX (I faced it quite a lot during implementing). So I got confused :/

12:48 <jbjnr> I think I will add a CMake check that greps /proc/cpuinfo and gives a warning just in case. It won't be reliable because login/compute nodes, but at least it might help

12:51 <M-ms> does someone know why we can't have the masks be dynamic? was it for a performance reason at some point?

12:53 <jbjnr> yes. for <64 we use an int with bit masks, for larger we use a bitset. I doubt there's much difference in speed though

12:54 <heller___> shouldn't

12:54 <jakub_golinowski> M-ms, could you tell me what should I expect from perf? Is there an option to have it tell me how much time was spent in each function or sth like this?

12:54 <heller___> we used to use a dynamic bitset once. that was a performance hit

12:55 <jbjnr> balls.

12:55 <jbjnr> fixing CPU count did not make error go away

12:57 quaz0r has joined #ste||ar

12:58 <M-ms> jbjnr: ?

12:58 <jbjnr> ?

12:58 <M-ms> just to be sure, you set it high enough? because it has something like 8 hyperthreads, no?

12:58 <jbjnr> grrrr

12:59 <jbjnr> lstop gives me 160 PUs and I set it to 256

12:59 <jakub_golinowski> M-ms, I got as far as this: Overhead Command Shared Object Symbol ◆

12:59 <jakub_golinowski> 42.07% opencv_perf_dnn libopencv_dnn.so.4.0.0 [.] _ZN2cv3dnn8opt_AVX28fastConvEPKfmS

13:00 <M-ms> jakub_golinowski: yeah, so I usually run "perf record -F99 --call-graph dwarf my_program"

13:00 <M-ms> -F sets the sampling frequency, call-graph dwarf was useful for some reason

13:00 <M-ms> it then outputs a file which you can view with perf report

13:00 <hkaiser> : jbjnrI still think this is a problem in the resource_manager

13:00 <hkaiser> we need to correct things there, using the cmake option just prevents it from happening

13:01 <M-ms> ah, yeah, so you probably need to recompile with RelWithDebInfo for it to be useful

13:01 <jakub_golinowski> M-ms, but actually this is from the debug build :/

13:01 <jbjnr> M-ms: hwloc 1.x or 2.0 on power? opinion?

13:01 <M-ms> note that hpx is going to have a lot of entries there while opencv is doing single threaded stuff

13:02 <jbjnr> no differnce I would hope

13:02 <hkaiser> M-ms: any optinion on that?

13:02 <M-ms> hrm

13:03 <M-ms> jbjnr: I think neither was significantly worse when we tested with mathieu (but some affinity test was giving problems)

13:03 <jbjnr> I was just looking for possible reasons why the RP was choking.

13:03 <hkaiser> jbjnr, M-msI believe the problem you're seeing is at least diagnosable, if not preventable

13:03 <M-ms> hkaiser: you mean checking properly in the resource manager if hpx is configured with a bad max cpu count?

13:04 <hkaiser> M-ms: at least, if not find a way for the resource manager to use only as many cores as hpx was configured with

13:04 <jbjnr> it might not be an RP problem. it might be some unrelated error

13:05 <hkaiser> jbjnr: there is also the hpx_main snafu on master nikunj is trying to resolve, currently

13:05 <hkaiser> could be resolved now, could be not

13:05 <M-ms> mmh, yeah if it's cpu count related I'm sure we could do something better there, if it's something else depends on what that something else is...

13:05 <hkaiser> results in a similarily useless error message

13:06 <M-ms> jakub_golinowski: so are you getting more lines of output than that one line?

13:06 <hkaiser> M-ms: well, you can work around the issue by specifying the cpu count, so it looks to be related

13:06 <nikunj> jbjnr: could you please tell the environment you're on?

13:06 <hkaiser> jbjnr: last resort: use HPX_WITH_DYNAMIC_HPX_MAIN=OFF

13:06 <jakub_golinowski> M-ms, ah yes, of course - but I still het the somewhat obfuscated Symbols

13:07 <M-ms> jakub_golinowski: another approach would be to just time the parallel_for loop to see if that's where the time is spent or if it's somewhere else

13:08 <hkaiser> M-ms, jakub_golinowskihave you ocnsidered using APEX? allows to collect traces, timings, etc.

13:08 <jbjnr> nikunj: I'm on a powerpc node. OS is RedHad Enterprise Server 7.5. 160 cores. Shitload of memory. etc etc

13:08 <M-ms> hkaiser: ah, ok

13:08 <hkaiser> shows every hpx thread in the end

13:09 <jbjnr> Clang 7.0 compiled from source today

13:09 <hkaiser> jbjnr: is it one of those summit nodes?

13:09 <jbjnr> nikunj: hkaiser I did not realize that nikunj was working on something related to this. I will pul from master again now

13:09 <hkaiser> 6 GPUs?

13:09 <jbjnr> hkaiser: similar

13:10 <hkaiser> cool

13:10 <jbjnr> it's a local node here we are using for summit type testing

13:10 <hkaiser> nice

13:10 <nikunj> jbjnr: redhat does come with glibc. So my code should work. Please try with the current merged commit. If it's related to my implementation then it should (most likely) get fixed

13:10 <M-ms> hkaiser: that's a good idea, wanted to start off easy though

13:10 <hkaiser> k

13:11 hkaiser has quit [Quit: bye]

13:11 <jbjnr> clang-7: error: unable to execute command: Segmentation fault (core dumped)

13:11 <jbjnr> grrrrrr

13:11 <jbjnr> tried to recompile jemalloc

13:11 <jbjnr> (again)

13:25 <jakub_golinowski> M-ms, so basically I should not see Symbols like this: _ZN3hpx7threads10coroutines6detail2lx10trampolineINS2_14coroutine_implEEEvPT_

13:25 <jakub_golinowski> but sth more readable?

13:26 <nikunj> jakub_golinowski: are you trying to see assembly for a code?

13:26 <jakub_golinowski> nikunj, no no :D I am trying to profile a cpp application. Specifically one of the OpenCV tests

13:27 <jakub_golinowski> using perf for the first time and trying to first get to know what should I expect from it

13:27 <nikunj> jakub_golinowski: oh my bad then. Those symbols come in assembly so I was a bit curious

13:28 <jbjnr> nikunj: using latest master I get the same exception

13:28 <jbjnr> terminating with uncaught exception of type std::invalid_argument: hpx::resource::get_partitioner() can be called only after the resource partitioner has been allowed to parse the command line options.

13:28 <jbjnr> Aborted (core dumped)

13:29 <nikunj> jbjnr: It might not be related to my code then. As a last resort can you try building with -DHPX_WITH_DYNAMIC_HPX_MAIN=OFF

13:30 <jbjnr> building now

13:34 <nikunj> jbjnr: just to be sure, it comes with glibc right?

13:38 <jbjnr> nikunj: -DHPX_WITH_DYNAMIC_HPX_MAIN=OFF fixes it.

13:38 <jbjnr> Thanks

13:38 <jbjnr> I have hello world from 160 cores

13:38 <nikunj> jbjnr: so it is related to my implementation. Could you please tell me the result of: ldd --verion

13:39 <jbjnr> openpower01:~/build/hpx$ ldd --version

13:39 <jbjnr> -bash: otool: command not found

13:39 <jbjnr> we are missing some bintools by the looks of it

13:39 <nikunj> jbjnr: that explains it!

13:39 <jbjnr> how?

13:39 <nikunj> My implementation requires glibc. It was not found in your machine

13:40 <nikunj> that is why it did not work. I could not find a macro that could specifically targeted glibc, so I used linux in general (thinking most of them come with glibc by default)

13:41 <jbjnr> ok. makes sense. I am not very expert with some of the sysdmin side of stuff.

13:41 <jakub_golinowski> M-ms, not sure if doing sth wrong but other libraries have more readable Symbols (including so from OpenCV) however HPX has still this style:

13:41 <jakub_golinowski> _ZNSt8_Rb_treeIPKvSt4pairIKS1_N3hpx4util6detail9lock_dataEESt10_Select1stIS8_ESt4lessIS1_ESaIS8_EE4findERS3_

13:41 <zao> Judging by paths mentioned earlier, it's running RHEL/CentOS?

13:42 <jbjnr> M-ms: building al tests on power now, will confirm Matthieu's findings. Can you remember if he had many fails?

13:42 <jbjnr> nikunj: Thanks very much for helping. I can actually get some work done now.

13:43 <jbjnr> zao: I love doing a make -j32 test :)

13:43 <jbjnr> I could try more, but ....

13:43 <nikunj> jbjnr: I will find a suitable solution to fix it right off the box. until then please use -DHPX_WITH_DYNAMIC_HPX_MAIN=OFF or install glibc

13:43 <zao> Lovely.

13:43 <zao> We only have 72 cores on the largemem nodes, 260+ on the KNLs but that's cheating.

13:48 <nikunj> woah! that's a lot of cores

13:49 <nikunj> and I'm yet to work on an 8 core machine :p

13:50 <zao> Had to go for four sockets to support 3TB of memory.

13:50 <zao> So many sticks :)

13:50 <nikunj> xD

13:51 <nikunj> K-ballo: yt?

13:52 <M-ms> jakub_golinowski: I've had more readable names at some point, but seem to have similar names when I'm checking now

13:52 <M-ms> I'll check at home as well

13:52 <M-ms> you should still be able to see if there's something obvious that sticks out

13:53 <M-ms> but consider timing just the parallel for loops as well, that's bound to work

13:53 <M-ms> jbjnr: I have a list somewhere but I think it was two or three tests that failed consistently

13:54 <jakub_golinowski> M-ms, also in the perf docu they use the term build-id - maybe it has to be somehow explicitly enabled for hpx?

13:54 <jakub_golinowski> M-ms, as for timing parallel for loops, you mean also with the use of perf?

13:55 <M-ms> jakub_golinowski: no, for that I mean just with a timer, I don't think perf can be made to look at just a section of code

13:56 <jakub_golinowski> M-ms, timer like in the opencv_mandelbrot just by changing the source code (woving the instrumentation to the binary)?

13:56 <M-ms> yep, the "dumb but simple" way

13:57 <M-ms> jbjnr: 455 - tests.unit.parcelset.distributed.tcp.put_parcels_with_coalescing (Failed)

13:57 <M-ms> 506 - tests.unit.threads.thread_affinity (Failed)

13:57 <M-ms> I'm not sure if the first one always failed, the second one was the main problem

13:57 <jbjnr> thanks. thread_affinity is a biggy

13:57 <jbjnr> that's what I look at now.

13:58 <jbjnr> [100%] Built target tests

13:58 <M-ms> I think it was with hwloc 1.X and 2.0.0, 2.0.1 may have fixed something

14:01 galabc has joined #ste||ar

14:04 <K-ballo> nikunj: I'm here now

14:04 <nikunj> K-ballo: do you know a way (probably a macro) to know if the code is linked with glibc?

14:07 <zao> nikunj: https://sourceforge.net/p/predef/wiki/Libraries/

14:07 <zao> predef.sf.net is (used to be) the holy grail for detection defines.

14:08 <K-ballo> sorry, I froze over linked there

14:08 <K-ballo> `__GLIBC__` should be fine to see if it was included

14:14 hkaiser has joined #ste||ar

14:17 <jbjnr> M-ms: I think my build wins the epic fail contest https://gist.github.com/biddisco/10f1e782c86a0483bc752753c3d1c056

14:17 <jbjnr> interesting that the affinity looks like it passed. but there are clearly problems with parallel algorithms

14:18 <K-ballo> that's an odd collection of algorithms.. why those and not others?

14:18 <nikunj> K-ballo: let me try

14:19 <M-ms> impressive!

14:19 <M-ms> which hwloc?

14:19 <K-ballo> jbjnr: what's the output for 553?

14:19 <nikunj> zao: I too saw that link, the problem was the inclusion of that header `features.h`

14:19 <M-ms> and how many of those fail only because the timed out tests don't properly die?

14:19 <jbjnr> K-ballo: 553: /users/biddisco/src/hpx/tests/unit/util/function/function_arith.cpp(30): test 'f(5, 3) == 5.f/3' failed in function 'int hpx_startup::user_main()': '1.66667' != '1.66667'

14:20 <jbjnr> useful error message!

14:20 <jbjnr> M-ms: hwloc 1.11.10

14:20 <K-ballo> what kind of floating point you have over there?

14:20 <jbjnr> I think

14:20 <jbjnr> K-ballo: no idea. I'd better check

14:20 <K-ballo> -fast-math?

14:21 <jbjnr> whatever they give us on PowerPC

14:21 <jbjnr> yes I used fast-math

14:21 <K-ballo> ok

14:21 <nikunj> hkaiser: could you please re trigger the Phylanx pr

14:21 <hkaiser> nikunj: has hpx passed now?

14:21 <nikunj> yes

14:22 <hkaiser> done

14:28 <jbjnr> K-ballo: https://gist.github.com/biddisco/c05978233004587758b2abca1c954477 I'm going to stick my neck out and surmise that we have a bug in our find implementation ...

14:29 <zao> **gasp**

14:29 <jbjnr> __kernel_sigtramp_rt64 is new to me

14:30 <zao> Assumedly involved in failure handling on your platform.

14:30 <zao> I can't find the logs, but I could swear that that test failed the other day for me.

14:30 <jbjnr> yes. I suppose that catches something, then the stack backtrace dies and segfaults. Not sure what the original error is

14:32 <zao> Huh... didn't we change all these tests over to something not rand()/srand()?

14:34 <zao> Ah, some of the failing ones use generators, probably not that then.

14:47 mbremer has joined #ste||ar

15:02 <nikunj> hkaiser: seems like all tests have passed!

15:05 <jakub_golinowski> M-ms, hkaiser: I built HPX in mode RelWithDebugInfo and still get the somewhat "obfuscated" perf Symbols:

15:05 <jakub_golinowski> _ZN3hpx7threads6detail15scheduling_loopINS0_8policies30local_priority_queue_schedulerISt5mutexNS3_13lockfree_fifoES6_NS3_13lockfree_lifoEEEEEvmRT_RNS▒

15:07 <M-ms> jakub_golinowski: I don't know why that's happening

15:08 <M-ms> some more basic things you could try are change the number of threads hpx uses, and play with the idling settings

15:09 <jakub_golinowski> M-ms, to probe the performance increase

15:09 <jakub_golinowski> ?

15:10 <M-ms> you'll get 8 threads by default and it might be that the serial parts of the tests are slowed down significantly by a second thread spinning in the scheduling loop on the same core

15:10 <M-ms> just to see if it makes some difference to start with, don't care about exact numbers yet

15:12 <hkaiser> jakub_golinowski: have you tried to feed those symbols through c++filt?

15:12 <jakub_golinowski> hkaiser, no, what is that?

15:12 galabc has quit [Quit: Leaving]

15:13 <jakub_golinowski> Ah I see

15:15 <jakub_golinowski> hkaiser, it works :D

15:19 <hkaiser> jaafar: you can pass the whole file through it, it will ignore everything but the symbols

15:26 <nikunj> hkaiser: all tests have passed with #519. This should fix things for now.

15:27 <hkaiser> nikunj: let me merge things and retrigger all the other PRs

15:36 david_pfander has quit [Ping timeout: 260 seconds]

15:48 hkaiser has quit [Quit: bye]

16:10 mcopik has quit [Ping timeout: 265 seconds]

16:22 mcopik has joined #ste||ar

16:26 <jakub_golinowski> M-ms, hkaiser: I found out that the package perf for ubuntu is built with NO_DEMANGLE

16:26 <jakub_golinowski> I rebuilt the package manually changing NO_DEMANGLE to 0 and now it works withot the c++filt trick

16:40 jakub_golinowski has quit [Quit: Ex-Chat]

16:53 nikunj has quit [Quit: Leaving]

16:53 jakub_golinowski has joined #ste||ar

16:56 nikunj has joined #ste||ar

17:02 anushi has joined #ste||ar

17:05 <nikunj> jbjnr: yt?

17:06 <jbjnr> nikunj: just logged in from home

17:07 <nikunj> ohk, jbjnr could you share the options that ld gives you in Redhat (run: man ld and share a gist) the next time you operate on it.

17:07 <jbjnr> jakub_golinowski: you should try --hpx:threads=4 --hpx:bind=balanced to use only one PU per core

17:07 <jbjnr> nikunj: I'll try now

17:08 <jakub_golinowski> jbjnr, in order to boost performance?

17:08 <nikunj> jbjnr: thanks!

17:10 <jbjnr> nikunj: https://gist.github.com/biddisco/337e9b6ef5e0f0caec6fa1d08ae99fb6

17:11 <jbjnr> jakub_golinowski: yes, if you have 4 cores, 8 hyperthreads, you often find that performance drops when you use both PUs on a core compared to just one

17:11 hkaiser has joined #ste||ar

17:11 <jakub_golinowski> ok, thank for the hint!

17:11 <jakub_golinowski> jbjnr, I will run the mandelbrot benchmark with this config

17:11 <nikunj> jbjnr: could you run this code as well and tell me if it prints anything: https://gist.github.com/NK-Nikunj/f13dfc69e3deb580b421984c64fe8147

17:12 <jbjnr> use N/2 or however many actual cores you have

17:13 <jakub_golinowski> jbjnr, right

17:14 <jakub_golinowski> but actually sth went wrong and I cannot build opencv any more :/

17:15 <jbjnr> nikunj: I have added GLIBC to an existing check I have and the output is here https://gist.github.com/biddisco/4b9080a805775c4a9858b649a556bad5

17:15 <jbjnr> near the bottom

17:16 <nikunj> so glibc is defined

17:17 <nikunj> jbjnr: did you add glibc today or was it already present?

17:17 <jbjnr> I did not add it

17:17 <jbjnr> I compiled a small test using the same compiler settings as I use for HPX

17:17 <zao> The libc of choice on a system tends to be rather fixed in stone.

17:18 <zao> Unless you go extremely out of your way to build a separate one, but it won't be able to interop much with the world.

17:18 <nikunj> then this would mean glibc has defined it's startup code differently for powerpc

17:18 anushi has quit [Remote host closed the connection]

17:18 <nikunj> in that case, I'll have to look up the source code for powerpc as well

17:19 <jbjnr> nikunj: https://github.com/biddisco/cpptest/blob/master/pi-lockfree.cxx is a test I wrote when I had problems with HPXon raspberry pi, I just added __GLIBC__ to it and ran it

17:19 <zao> jbjnr: Did you mention what kind of hardware and distro this was?

17:19 * zao prays it's not a Cray.

17:19 <jbjnr> gtg dinner is ready

17:19 <jbjnr> back in 25mins

17:19 <zao> enjoy!

17:20 <jakub_golinowski> nikunj, can this error be connected with ongoing changes: https://pastebin.com/Ln5VfHYW

17:20 <jakub_golinowski> hmm I think it might be not

17:20 <nikunj> jakub_golinowski: it is linked with my errors

17:21 <nikunj> jakub_golinowski: could you try building HPX from current master

17:21 <nikunj> that should fix things

17:21 <jakub_golinowski> nikunj, ok

17:23 <nikunj> jakub_golinowski, what environment are you using?

17:23 <jakub_golinowski> nikunj, ubuntu 16.04

17:23 <jakub_golinowski> gcc 5.4.0

17:24 <jakub_golinowski> but this error seems to only emerge when I try to build opencv agains hpx in RelWithDebugInfo mode

17:25 <nikunj> oh yes, I can understand. That's coz I mistakenly exported CMAKE_EXE_LINKER_FLAGS, which is the root cause of all this disruptions you see

17:25 <nikunj> It added my linker flag (savior for hpx, killer for others) to it, so anything but hpx executable would build nicely

17:26 <nikunj> I changed it today and it is now merged, so things should run fine now

17:29 <jakub_golinowski> nikunj, ok, rebuilding hpx

17:29 <nikunj> jbjnr: could you please share the assembly of a simple hello world program

17:30 anushi has joined #ste||ar

17:36 <hkaiser> nikunj: yt?

17:37 <hkaiser> nikunj: things are much better now, but not entirely good yet

17:37 <hkaiser> please look at the phylanx buildbot here: http://ktau.nic.uoregon.edu:8020/#/

17:38 <hkaiser> this might be a better view: http://ktau.nic.uoregon.edu:8020/#/builders

17:39 <hkaiser> three (out of nine) platforms passnow

17:39 anushi has quit [Remote host closed the connection]

17:40 <hkaiser> nikunj: better yet here: http://ktau.nic.uoregon.edu:8020/#/console

17:40 <nikunj> hkaiser: I'll have a look

17:41 <nikunj> hkaiser: are these separate architectures?

17:42 jakub_golinowski has quit [Ping timeout: 256 seconds]

17:42 <hkaiser> yes, powerpc, knl (XeonPhi) and a x86 system

17:43 <nikunj> hkaiser: ok

17:43 <nikunj> I'll have a look

17:43 <hkaiser> thanks

17:43 jakub_golinowski has joined #ste||ar

17:47 anushi has joined #ste||ar

17:48 <nikunj> hkaiser: do we have a similar buildbot for hpx?

17:49 jakub_golinowski has quit [Ping timeout: 256 seconds]

17:50 jakub_golinowski has joined #ste||ar

17:53 <hkaiser> nikunj: no, but hpx is being built there as well (no tests are run, though)

17:54 <nikunj> hkaiser: ok

17:54 anushi has quit [Remote host closed the connection]

17:54 jakub_golinowski has quit [Ping timeout: 256 seconds]

17:55 jakub_golinowski has joined #ste||ar

17:55 anushi has joined #ste||ar

17:58 <nikunj> hkaiser: I see only one failing test everywhere, am I missing anything?

18:01 <jbjnr> nikunj: https://gist.github.com/biddisco/5a476d95828d8942af3ed19205ef8b99

18:01 <jbjnr> if you need extradebug symbols etc - I can recompile - think this was release

18:01 <nikunj> jbjnr, no that's enough

18:02 <nikunj> I only wanted to see how they handle call to __libc_start_main

18:04 <jbjnr> watching england-croatia now, so won't be replying much

18:04 <nikunj> jbjnr: England will win

18:05 nikunj has quit [Quit: Leaving]

18:06 <jbjnr> wow - what a goal!!!!

18:07 nikunj has joined #ste||ar

18:17 <M-ms> is phylanx using some fancy new version of buildbot?

18:17 <M-ms> jakub_golinowski: good that you got perf working

18:18 <M-ms> I'm going to rebuild all my stuff with latest masters as well now

18:18 <M-ms> try not just the mandelbrot benchmark with 4 threads but also the opencv perf tests

18:24 <nikunj> jbjnr: Please run the executable generated from the makefile here (https://github.com/NK-Nikunj/GSoC-experimental-codes/tree/master/powerpc) whenever you get time. If the output is not the same as written in README.md then please notify me about it.

18:24 hkaiser has quit [Quit: bye]

18:45 <jaafar> Someone challenged me on Twitter so I'm going to pull out some HPX on them https://twitter.com/lemire/status/1017058602161463296

18:45 <jaafar> Don't let me down, y'all

18:45 eschnett has quit [Quit: eschnett]

18:45 anushi has quit [Ping timeout: 264 seconds]

18:47 anushi has joined #ste||ar

18:48 <zao> :)

18:50 <jbjnr> nikunj: https://gist.github.com/biddisco/5502b9b7b4fe5b0f682aee1675fe8a2d

18:51 <nikunj> so __libc_start_main does not get wrapped!

18:51 <nikunj> That's the root cause of non-working hpx build on Powerpc then

18:52 <nikunj> jbjnr: could you provide me with the assembly of main?

18:52 <nikunj> that would help me decipher things

18:54 jakub_golinowski has quit [Ping timeout: 256 seconds]

18:55 jakub_golinowski has joined #ste||ar

19:03 hkaiser has joined #ste||ar

19:03 <jbjnr> nikunj: added to gist at bottom

19:03 <jbjnr> <@jbjnr> nikunj: https://gist.github.com/biddisco/5502b9b7b4fe5b0f682aee1675fe8a2d

19:03 <jbjnr> <nikunj> so __libc_start_main does not get wrapped!

19:03 <jbjnr> <nikunj> That's the root cause of non-working hpx build on Powerpc then

19:03 <jbjnr> <nikunj> jbjnr: could you provide me with the assembly of main?

19:03 <jbjnr> <nikunj> that would help me decipher things

19:03 <jbjnr> ◀━━ Quits: jakub_golinowski (~jakub@2a00:23c0:3201:c601:8c09:1667:2f74:f10b) (Ping timeout: 256 seconds)

19:03 <jbjnr> ━━▶ Joins: jakub_golinowski (~jakub@host31-52-138-38.range31-52.btcentralplus.com)

19:03 <jbjnr> ━━▶ Joins: hkaiser (~hkaiser@2600:1700:a50:99a0:cc2c:a554:2a09:bd8d)

19:03 <jbjnr> ❮▲❯ ChanServ gives channel operator status to hkaiser

19:04 <jbjnr> oops. keyboard error

19:04 <jbjnr> https://gist.github.com/biddisco/5502b9b7b4fe5b0f682aee1675fe8a2d

19:04 <jbjnr> not sure what happend. Football is back on. bbiab

19:08 <nikunj> Now it's getting even stranger. __libc_start_main is getting wrapped but still the program is not working T-T

19:25 <nikunj> jbjnr, could you try building it again: https://github.com/NK-Nikunj/GSoC-experimental-codes/tree/master/powerpc

19:25 <nikunj> hkaiser, yt?

19:26 <hkaiser> here

19:26 <nikunj> hkaiser: there is one test failing in all the builds

19:26 <hkaiser> why is it red, then?

19:27 <nikunj> hkaiser, also from the stack frame I can see HPX_WITH_DYNAMIC_HPX_MAIN=OFF

19:27 <nikunj> anything you did?

19:27 <hkaiser> ok

19:28 <hkaiser> I'll look

19:28 <nikunj> hkaiser, here: http://ktau.nic.uoregon.edu:8020/#/builders/5/builds/128/steps/12/logs/stdio line 1013

19:29 <hkaiser> ok

19:29 <hkaiser> even that does not prove anything ;-)

19:29 <nikunj> it does not

19:30 <nikunj> I'm happy that most of the errors have passed

19:37 <hkaiser> nikunj: :D

19:37 <hkaiser> yah, good job!

19:48 <nikunj> jbjnr, I have changed the function signature, could you try it again: https://github.com/NK-Nikunj/GSoC-experimental-codes/tree/master/powerpc

19:50 <diehlpk_work> jbjnr, Exciting game :)

19:50 <nikunj> diehlpk_work, very exciting!

19:57 <jbjnr> nikunj: pulled from master but it gave ma a conflict - I assume you force pushed. I reset --hard origin/master and recompiled. Same output, just main

19:58 <nikunj> jbjnr, yes it was a force push

19:58 <nikunj> jbjnr, i see

19:59 <nikunj> jbjnr, could you tell if it's a powerpc or a powerpc64?

19:59 <jbjnr> 64

20:00 <nikunj> also which environment is it (with version)

20:01 <jbjnr> ehat do you mean? what do you want to know

20:01 <nikunj> which version of redhat are you using?

20:02 <jbjnr> nikunj: https://gist.github.com/biddisco/f0b8e78cc5066b63751b7c821c0c1f75

20:08 mbremer has quit [Quit: Page closed]

20:21 mcopik has quit [Ping timeout: 240 seconds]

20:34 mcopik has joined #ste||ar

20:40 <M-ms> sorry jbjnr

20:40 <nikunj> I feel sad for england right now

20:42 <nikunj> jbjnr, I made changes specific to powerpc. Could you run make again? (https://github.com/NK-Nikunj/GSoC-experimental-codes/tree/master/powerpc)

20:42 jakub_golinowski has quit [Ping timeout: 244 seconds]

20:46 <jbjnr> nikunj: /usr/lib/gcc/ppc64le-redhat-linux/4.8.5/../../../../lib64/crt1.o: In function `_start':

20:46 <jbjnr> (.text+0x24): undefined reference to `__wrap___libc_start_main'

20:46 <jbjnr> clang-7: error: linker command failed with exit code 1 (use -v to see invocation)

20:47 <diehlpk_work> hkaiser, Could you finish your gsoc evaluation by today?

20:55 <jbjnr> nikunj: I tried adding extern "C" to the __Wrap_xxx but it segfaulted when I did that

20:55 jakub_golinowski has joined #ste||ar

20:55 <nikunj> ok

20:59 <nikunj> hkaiser, yt?

21:00 <hkaiser> nikunj: here

21:00 <nikunj> I have a wonderful idea (which I should have thought of before). Instead of wrapping __libc_start_main, let's wrap the main instead. Call to main for every architecture is same so it will flawlessly

21:01 <hkaiser> ok

21:01 <nikunj> I thought that __libc_start_main would work similarly (so I never bothered changing main) but turns out a slight change in definition won't allow it to work properly on powerpc

21:01 <hkaiser> ...or other platforms

21:02 <hkaiser> we new it was brittle

21:02 <hkaiser> knew*

21:02 <nikunj> yes

21:02 <nikunj> main should not be brittle

21:03 <nikunj> jbjnr, could you try the current master?

21:03 <nikunj> just to check if my main hypothesis is correct

21:03 <jbjnr> wrap_libc.cpp:(.text+0x44): undefined reference to `__real_main'

21:04 <nikunj> oh wait

21:04 <nikunj> I forgot to add a few things

21:06 <nikunj> jbjnr, try now, I've changed makefile

21:07 <jbjnr> nikunj: openpower01:~/src/GSoC-experimental-codes/powerpc (master<>)$ ./main

21:07 <jbjnr> __wrap_main

21:07 <jbjnr> main

21:07 <jbjnr> looks good

21:07 <nikunj> jbjnr, perfect!

21:07 <nikunj> so my hypothesis was right!

21:08 <jbjnr> what was the hypothesis

21:09 <nikunj> jbjnr, the wrap symbol (from ld) provides you a way to wrap a symbol and define your own wrapper for it. I chose _libc_start_main as a wrapper initially thinking that we might be able to initiate the HPX runtime pretty early on. It failed pretty miserably, but the entry point was changed to our own custom entry point.

21:10 <nikunj> Thinking that the solution is still portable enough I implemented it only later to identify it won't work on powerpc

21:10 <nikunj> So now we are wrapping main instead since that will now be our new portable entry point

21:11 <nikunj> that should do things nicely

21:11 <jbjnr> good work

21:11 <nikunj> jbjnr, thanks!

21:12 <nikunj> hkaiser: working on a pr now. Will try this to get merged then rebase apple to merge it as well. sounds good?

21:12 <hkaiser> let focus on one thing at a time

21:13 <nikunj> hkaiser: what should I focus on?

21:13 <hkaiser> finish linux support

21:14 <nikunj> hkaiser: the pr was to make the linux support better. This would prevent us from using multiple versions of __libc_start_main for different platforms

21:14 <hkaiser> nikunj: do you have ppc support under control now?

21:15 <nikunj> if we wrap main, we will have it under control. Things work on jbjnr machine

21:18 <nikunj> hkaiser: changing the wrapper symbol will help us gain better overall control over the linux platform

21:18 <hkaiser> nikunj: isn't that what I suggested? finish linux support?

21:20 <nikunj> hkaiser: ohk on it then, pr will arive soon

21:23 <hkaiser> nikunj: is main a weak symbol too?

21:26 <nikunj> if it's letting us wrap it then yes it is as well

21:28 <nikunj> hkaiser: i'm not too sure though

21:28 <hkaiser> also, pls be careful, hpx_init defines its own main (which would have to be wrapped with a pp constant anyways in your case, now that I think about it)

21:29 <hkaiser> it even defines several versions of it

21:29 <nikunj> hkaiser: actually the fact with wrap is it let's you wrap any function (weak or strong)

21:29 <hkaiser> does it?

21:29 <nikunj> Basically it tells the compiler to call the __wrap_<symbol_name> instead of the actual one

21:30 <nikunj> It's very much like dlsym method. But the plus point here is it can be extended to static executables as well

21:30 <nikunj> in case of dlsym it is limited only to dynamic executable

21:32 <nikunj> that was the primary reason i chose wrap over dlsym. dlsym had one thing done right, if you chose to call dlsym when the symbol does not exists then it would simply not refuse to work. In case of wrap we had to add checks to get things right

21:33 <nikunj> *simply refuse to work

21:34 <hkaiser> k

21:35 <nikunj> hkaiser, check here to read about --wrap. I read it from here as well: https://linux.die.net/man/1/ld

21:35 <hkaiser> k

21:47 <jaafar> OK transform_reduce with par execution policy does pretty well, scales nicely

21:48 <jaafar> (I'm following up on https://lemire.me/blog/2018/07/05/how-quickly-can-you-compute-the-dot-product-between-two-large-vectors/)

21:48 mcopik has quit [Ping timeout: 240 seconds]

21:55 <nikunj> hkaiser, build passes. Tests working out fine too

21:56 <hkaiser> jaafar: HPX?

21:56 <hkaiser> nikunj: nice

21:58 <jaafar> hkaiser: you got it :)

21:58 <hkaiser> jaafar: cool

21:58 <jaafar> It was just a dot product, I thought "there's an algorithm for that" :)

21:58 mbremer has joined #ste||ar

21:58 <jaafar> If you want to follow on Twitter: https://twitter.com/lemire/status/1017058602161463296

22:06 <hkaiser> jaafar: thanks!

22:08 <hkaiser> jaafar: do you have your results posted somewhere?

22:31 quaz0r has quit [Ping timeout: 256 seconds]

22:34 <nikunj> hkaiser, phylanx builds and runs perfectly as well.

22:38 jbjnr has quit [Read error: Connection reset by peer]

22:38 <jaafar> hkaiser: ah no I didn't actually post them but I have them in a terminal window right now :)

22:42 <jaafar> https://pastebin.com/9G54fzfi

22:44 mcopik has joined #ste||ar

22:45 <github> [hpx] NK-Nikunj opened pull request #3375: Replacing wrapper for __libc_start_main with main (master...Linux_better_impl) https://git.io/fNIXC

22:49 nikunj has quit [Quit: goodnight]

22:52 <jakub_golinowski> M-ms, for your reference and to start a discussion about how to read the perf logs: https://pastebin.com/8NueJTEa

22:58 mcopik has quit [Ping timeout: 260 seconds]

23:03 quaz0r has joined #ste||ar

23:11 quaz0r has quit [Ping timeout: 268 seconds]

23:12 hkaiser has quit [Read error: Connection reset by peer]

23:17 diehlpk has joined #ste||ar

23:37 quaz0r has joined #ste||ar

23:44 jakub_golinowski has quit [Ping timeout: 240 seconds]