#ste||ar on 2020-05-11 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:01 <Yorlik> And this fixed it - lol:

00:01 <Yorlik> task_data* task_data_p = reinterpret_cast<task_data*>( hpx::this_thread::get_thread_data( ) );

00:01 <Yorlik> while ( task_data_p == nullptr ) {

00:01 <Yorlik> task_data_p = reinterpret_cast<task_data*>( hpx::this_thread::get_thread_data( ) );

00:01 <Yorlik> }

00:01 <Yorlik> lua = task_data_p->task_engine.get();

00:02 <Yorlik> That while is ... erm ... lol? But it fixed the nullptr issue

00:02 <Yorlik> So the updater start working, before the on_start lambda has finished

00:09 <Yorlik> hkaiser - I think this is an issue: The lambda should be finished before the tasks start. AFter all I'm using it to actually limit task creation. Not sure if I'm doing something wrong here.

00:12 <Yorlik> Might be setting the task data in a wrong way

00:14 <Yorlik> So that was stupid - it never leaves the loop.

00:14 <Yorlik> But it means I am setting my data wrong

00:16 <Yorlik> hkaiser: Is this supposed how to set it in the on_start lambda? (unsafe method)

00:16 <Yorlik> ----

00:16 <Yorlik> hpx::this_thread::set_thread_data( reinterpret_cast<size_t>( task_data_p ) );

00:16 <Yorlik> task_data_p->task_engine = get_luaengine( );

00:16 <Yorlik> auto task_data_p = new task_data {};

00:16 <Yorlik> ----

00:18 <Yorlik> Because task_data is constantly being a nullptr:

00:18 <Yorlik> --------

00:18 <Yorlik> lua_engine* lua = task_data_p->task_engine.get( );

00:18 <Yorlik> task_data* task_data_p = reinterpret_cast<task_data*>( hpx::this_thread::get_thread_data( ) );

00:18 <Yorlik> --------

00:20 <hkaiser> sec

00:21 <hkaiser> Yorlik: yes, that's how it is supposed to be used

00:21 <Yorlik> Using it inside is always giving me a nullptr

00:22 <Yorlik> task_data_p = nullptr

00:22 <hkaiser> you set it in on_start?

00:22 <Yorlik> Yes

00:22 <Yorlik> With the code showed

00:23 <hkaiser> and where do you call get?

00:23 <hkaiser> you sure it's the same thread that called on_start?

00:23 <Yorlik> inside update()

00:24 <Yorlik> I am just using hpx::this thread - might be wrong

00:24 <hkaiser> print the id to check

00:24 <Yorlik> Doing that

00:24 <hkaiser> hpx::this_thread::get_id()

00:26 <Yorlik> Yup - on it

00:27 <Yorlik> The lambda obviously isn't been called for whatever reason

00:27 <Yorlik> I might have done something bad to the executor, I'm afraid ... checking

00:48 <Yorlik> hkaiser: It is executing the Executor Constructor 4 times (one per thread I guess), but never calling any of the interface functions, especially not bulk_async_execute

00:50 <hkaiser> so you're constructing an executor for each core, that's fine I guess

00:50 <hkaiser> also, you use for_loop, right?

00:51 <Yorlik> Yes

00:51 <Yorlik> I wonder if I have a silent fail somewhere in the call chain

00:52 <hkaiser> no idea

00:52 <hkaiser> for_loop definitely uses bulk_async_execute

00:52 <Yorlik> Thats the loop:

00:53 <Yorlik> hpx::parallel::for_loop(

00:53 <Yorlik> .with( auto_chunk_size( autochunker_target_us * 1us ) ),

00:53 <Yorlik> .on( exec )

00:53 <Yorlik> hpx::parallel::execution::par

00:53 <Yorlik> 0,

00:53 <Yorlik> m_e_type::endindex.load( ),

00:53 <Yorlik> &update_entity<I>

00:53 <Yorlik> );

00:53 <Yorlik> Been using this for ages

00:54 <hkaiser> ok

00:54 <Yorlik> I must have done something today

00:54 <Yorlik> It used to work. But I didn't touch the executor

00:54 <hkaiser> the thing is that auto_chunk_size() might run (part of) the iterations directly, which circumvents using the on_start/on_end

00:54 <hkaiser> sorry, I forgot about that

00:55 <Yorlik> Ow ..

00:55 <Yorlik> Lemme switch that off a moment

00:55 <hkaiser> back to the drawing board, then

00:55 <Yorlik> You want to rework the executor?

00:58 <Yorlik> With static chunk size it works like a charm. I guess it's development :)

01:10 <hkaiser> yah

01:10 <hkaiser> sorry for all the trouble

01:11 <Yorlik> Naw - it's fun actually.

01:11 <Yorlik> And I'm learning a lot.

01:11 <Yorlik> And after all - you made this very nice executor after I needed it - so - that's simply great !

01:12 <Yorlik> If I make the static chunk size larger than the loop - will it run single threaded or stull chop it at least into the number of worker threads?

01:14 <hkaiser> whatever the static chunker decides will happen, look at the code

01:15 <Yorlik> OK

01:24 <Yorlik> I guess it's get_chunk_size - it gives sensible defaults when o is passed

01:30 <Yorlik> Updating 100.000 objects with 100.000 calls into Lua in ~ 1.5 - 2.0 seconds

01:30 <Yorlik> Not too bad as a start

01:31 <Yorlik> Can't wait to run that on my threadripper next week :D

01:31 <Yorlik> 12 cores :)

01:38 <hkaiser> Yorlik: we need to make the autochunker runn the function using the executor instead of running it directly

01:38 <hkaiser> that's a bit involved (requires API changes) but not difficult

01:39 <Yorlik> Will it cost performance?

01:39 <hkaiser> but this would fix an obvious oversight in the initial design

01:39 <hkaiser> no perf impact, I think

01:39 <Yorlik> Allright.

01:39 <Yorlik> I can wait for it.

01:39 <Yorlik> I'll just use the static chunker with 0 as param for that time

01:40 <hkaiser> autochunker would use executor::sync_execute to run the function

01:40 <Yorlik> It gives some nice defauding on cere count

01:40 <hkaiser> k

01:40 <Yorlik> I think my keystrokes are not coming all through - I need to visually check

01:41 <hkaiser> Yorlik: static chunker with zero as it's argument is the default, I think - no need to use a chunker at all in this case

01:41 <Yorlik> defauding = defaults depending on

01:41 <Yorlik> OK

01:41 <Yorlik> So just not call to .with()

01:41 <hkaiser> I might create the PR for this later today or tomorrow

01:42 <Yorlik> Great. This is fun to work like this. I feel supported :)

01:43 <Yorlik> BTW - with the removal of another lock by using the task data the exceptions in Debug Mode are gone (for now)

01:54 <Yorlik> Allright - time to sleep - it's 4.00 A.M. here ... Good Night!

02:29 hkaiser has quit [Quit: bye]

07:47 <jbjnr> what's the oldest version of boost that we support in hpx

07:47 <jbjnr> anyone remember?

07:47 <heller1> should be documented

07:47 <jbjnr> I couldn't be arsed looking it up, thought someone might rememebr it

07:48 <jbjnr> 1.61 or newer

07:48 <rori> it is set to 1.61

07:48 <jbjnr> turns out to be easy to find

07:48 <jbjnr> thANKS RORI

07:48 <heller1> 1.67.0 or newer

07:49 <heller1> oh, recommended ;)

07:49 <jbjnr> I'm setting up spack on rostam to build all boost versions, with all compilers etc etc

07:50 <heller1> cool

07:52 <zao> (eeew)

08:02 <heller1> flamewars incoming?

08:04 <zao> Nah, just contractually obligated as an EasyBuild maintainer to react :P

08:19 <heller1> ;)

08:24 jaafar has quit [Ping timeout: 244 seconds]

08:26 jaafar has joined #ste||ar

09:06 <jbjnr> zao: from my limited experience, spack seems to be a bit better than easybuild for us users. I'm able to do more with less errors/pain, though I'm still struggling a bit with some of the module related issues.

09:07 <zao> jbjnr: Yeah, it's friendlier toward individual developers/users, while EB is more about setting up a whole cluster site up-front.

09:07 <zao> Choices are good.

09:08 <jbjnr> lol

10:04 mcopik has joined #ste||ar

10:04 mcopik has quit [Client Quit]

10:17 * jbjnr sent a long message: < https://matrix.org/_matrix/media/r0/download/matrix.org/bLeMNiwxOasutolsEIZlGGwV >

10:18 <jbjnr> I thought we were requiring hwloc 2.0. Can we also bump our requirements up for any other tools like llvm from 3.8 to 10.0 now covers a very large range, and gcc from 4.9 to 9.3 is a lot

10:18 <jbjnr> I will get pack to autmotaiclly install all version of all of these for our rostam build matrix

10:19 <jbjnr> (I made up the jemalloc and perftools versions cos we don't require thm)

10:19 <heller1> do we really support clang 3.8 and gcc 4.9 still?

10:19 <jbjnr> ^^spack, not pack

10:19 <jbjnr> according to our docs we do

10:19 <jbjnr> I though we were on gcc 6

10:20 <jbjnr> https://stellar-group.github.io/hpx/docs/sphinx/latest/html/manual/building_hpx.html#software-and-libraries

10:20 <jbjnr> I'd like to reduce the list size a bit

10:21 <heller1> yeah, that's quite a number of configurations

10:21 <ms[m]> jbjnr: it's minimum gcc 7 on master now

10:22 <ms[m]> don't remember which clang

10:22 <ms[m]> but that's a massive list in any case

10:22 <ms[m]> the only wish I have is that we at least have some set of configurations that we always test (you can have random configurations on top of that if you'd like

10:23 <jbjnr> ok gcc 7 it is

10:24 <jbjnr> Need to reduce the clang versions. I'll pick 7.0 as a lowest test value unless told otherwise

10:25 <jbjnr> can we insist on hwloc 2 as well?

10:25 <jbjnr> and rais the boost number?

10:25 <jbjnr> * and raise the boost number?

10:26 <jbjnr> maybe I should ignore the versions superceded by a point release of everything too

10:27 <jbjnr> actually spack is doing that for me already I thinkk

10:29 <ms[m]> it's clang 5 at the moment

10:30 <ms[m]> boost will probably be minimum 1.64 at the next release (10 versions)

10:30 <ms[m]> we might be able to bump clang as well

10:31 <ms[m]> one version per major version is enough for compilers (there are some regressions within the minor versions but fixes for those are case-by-case anyway

10:31 <ms[m]> )

10:41 * jbjnr sent a long message: < https://matrix.org/_matrix/media/r0/download/matrix.org/URLnszkfRIymhfnzjRFObZUk >

10:42 <jbjnr> bit, but not quite so extreme

10:42 <jbjnr> (I set hwloc to 2.0 minimum)

10:51 <heller1> jbjnr: I think we should only test > recommended

10:52 <jbjnr> good idea, then we can bump hwloc recommended to 2.0

10:53 <jbjnr> and bump clang up a bit for recommended maybe ...

10:54 <heller1> yeah, maybe the prior to the latest released version?

11:02 nikunj97 has joined #ste||ar

11:05 <ms[m]> I removed the recommended versions from the latest docs because either we support a version or we don't, and most of the time there are no practical differences for the user

11:06 <ms[m]> minimum and latest version should at least be tested

12:13 <zao> ms[m]: Sorry if I was a bit of a butt the other day, everything was bad :D

12:15 hkaiser has joined #ste||ar

12:17 <ms[m]> zao: lol

12:17 <ms[m]> you weren't and even if you had been, hpx can be a bit of a butt so it would only be fair ;)

12:22 <ms[m]> do feel free to bug us about all the issues you had if they're still relevant

12:22 <ms[m]> and that reminds me that I should fix that message about ignoring build types...

12:30 <zao> I'll probably give HPX a few tries, made some progress on it and found how executors were inherited or not.

12:30 <zao> Is there any way to `maybe_get()` a future? Give it a bit of a push and if it resolves fully get the thing, but if it doesn't, continue with the task you were on?

12:31 <zao> Right now I hpx::this_thread::yield(), but I don't necessarily want to give up my whole time slice forever.

12:31 <zao> As little as possible, if it makes sense, as my normal loop work on it is time-sensitive.

12:31 <hkaiser> zao: future::is_ready() ?

12:31 <hkaiser> or future::ready() for that matter, don't remember

12:32 <hkaiser> 'giving it push' is not possible :/

12:32 <zao> I was trying to use ready(), but that didn't help it make progress.

12:33 <zao> (on a single-thread executor, with the future on the same executor)

12:34 <zao> If I was using Asio, I'd do a bunch of `io_context::poll_one()` to process any eventual pending main thread work.

12:35 mcopik has joined #ste||ar

12:35 <hkaiser> we don't have a way of doing this except yielding

12:36 <hkaiser> but then you could simply call get() on the future and be done with it

12:36 mcopik has quit [Client Quit]

12:47 <jbjnr> I just did a test on rostam and my simple boost examples worked if I installed boost with c++14 and compiled my tests with either c++14 or c++17 - do we really believe you can mix c++14 and c++17 together - it has never worked for me before, but maybe my env was messed yp

12:47 <jbjnr> * I just did a test on rostam and my simple boost examples worked if I installed boost with c++14 and compiled my tests with either c++14 or c++17 - do we really believe you can mix c++14 and c++17 together - it has never worked for me before, but maybe my env was messed up

12:48 <jbjnr> if I can just build one flavour of boost (per compiler) instead of 2 - it is a big saving

12:48 <hkaiser> jbjnr: gcc claims to be ABI compatible, no idea how much this is actually true

12:48 <jbjnr> I guess I can try installing boost with cxx14 and just see what gives ....

12:49 * jbjnr sent a long message: < https://matrix.org/_matrix/media/r0/download/matrix.org/qxPbCeucDlGxonJghEVfeRBc >

12:49 <jbjnr> I'm now dropping point releases if there is a better one in the same minor build, so 9.0.1 would replace 9.0.0

12:50 <jbjnr> etc etc

12:50 <hkaiser> jbjnr: can we do this with docker containers?

12:51 <jbjnr> I'm doing it with spack environments - similar

12:51 <hkaiser> what does Alireza say?

12:51 <hkaiser> ok

12:51 <jbjnr> I did not consult him. I only ask when I need help

12:51 <jbjnr> I'm learning spack and trying to integrate pycicle with it

12:53 <hkaiser> k

13:05 weilewei has joined #ste||ar

13:33 * jbjnr sent a long message: < https://matrix.org/_matrix/media/r0/download/matrix.org/WvHfvLgKiSYqJSyyryoCPUxd >

13:34 <jbjnr> :)

13:41 <zao> :P

13:42 <zao> I should see if I can get my gosh-darn Matrix server to federate some day.

13:45 rtohid has joined #ste||ar

13:49 <hkaiser> jbjnr: btw, from the list of combinations you plan to run on rostam - I think we shouldn't run more than 10 different variations, otherwise we will overwhelm that machine

13:49 <hkaiser> jbjnr: also we should avoid duplicating configurations with daint

13:50 <jbjnr> pycicle can pick random combinations of libs and flags

13:50 <hkaiser> how would that be reprocible?

13:50 <jbjnr> I'm not planning on building all of them, all of the time

13:50 <jbjnr> no way!

13:50 <jbjnr> it generates a string of flags and libs that can be pasted back into the launch command to reproduce a failed build

13:53 <ms[m]> jbjnr: we do want a fixed subset though that's always run

13:54 <ms[m]> also gsoc meeting with weilewei in 35 minutes, right?

13:54 <jbjnr> That's not a problem - it's all the stuff that fails constantly the rest of the time that anoys me

13:54 <weilewei> ms[m] Right

13:54 <jbjnr> the meeting was arranged to talk about dca, we'll just go over gsoc at the end because we'realready online

13:55 <ms[m]> 👍️ (feel free to ping me if you're done earlier with the dca stuff)

13:55 <weilewei> ms[m] will do!

14:04 kale_ has joined #ste||ar

14:13 <diehlpk_work_> Our application for Season of Docs was not successful :(

14:33 gonidelis has joined #ste||ar

14:42 kale_ has quit [Ping timeout: 260 seconds]

14:55 nan11 has joined #ste||ar

15:04 Nikunj__ has joined #ste||ar

15:06 bita_ has joined #ste||ar

15:06 kale_ has joined #ste||ar

15:08 nikunj97 has quit [Ping timeout: 272 seconds]

15:16 karame_ has joined #ste||ar

15:37 <weilewei> jbjnr ms[m] hkaiser I just sent a GSoC weekly meeting invite if that works for your schedule from next week to August 24 (the end of GSoC)

15:38 <ms[m]> weilewei: thanks!

15:38 <hkaiser> weilewei: I can't make it at that time, at least until mid June

15:41 <weilewei> hkaiser or would you please suggest an alternative time?

15:41 <hkaiser> weilewei: pls coordinate with Katie

15:41 nikunj97 has joined #ste||ar

15:41 <weilewei> hkaiser ok

15:41 <hkaiser> I could meet 30 minutes earlier, i.e. Mondays 8:30am

15:42 <weilewei> jbjnr ms[m] does 30 mins earlier work for you? that will be 3:30pm in your time...

15:43 <jbjnr> fine for me

15:45 Nikunj__ has quit [Ping timeout: 252 seconds]

15:46 <ms[m]> weilewei: fine for me as well

15:47 <Yorlik> hkaiser: YT?

15:47 <hkaiser> Yorlik: here

15:47 <Yorlik> You actually had implemented sync_execute

15:47 <Yorlik> But it doesn't work.

15:48 <hkaiser> what doesn't work?

15:48 <Yorlik> I compiled the pr and switched back to the autochunker

15:48 <Yorlik> Same error like before

15:48 <hkaiser> hmm

15:48 <hkaiser> does it invoke sync_execute on your executor?

15:48 <Yorlik> Nopüe

15:49 <Yorlik> I get no output

15:49 <hkaiser> what executor traits do you define? one_way, two_way?

15:50 <hkaiser> ahh, you probably copied things from the example

15:50 <gonidelis> How can I run/check the tests under `hpx/libs/algorithms/tests`. I get that I need to use `ctest` ....

15:50 kale_ has quit [Ping timeout: 260 seconds]

15:50 <gonidelis> ?

15:51 <hkaiser> ctest allows to specify the targets to run

15:51 <Yorlik> hkaiser: is_one_way_executor, is_never_blocking_one_way_executor, is_two_way_executor, is_bulk_one_way_executor, is_bulk_two_way_executor

15:51 <rori> weilewei: could you invite me to the GSoC meeting too ? :)

15:51 <hkaiser> Yorlik: all of them forward to the embedded executor, no?

15:51 <Yorlik> Yes

15:52 <weilewei> rori sorry that meeting was for my project only, between me and my project mentors. Or you can start your own meeting with your mentors

15:52 <hkaiser> gonidelis: not sure if auto-completion works for ctest targets, though

15:52 <hkaiser> I'm not a linux person - others might be able to help

15:52 <rori> gonidelis: `ctest -R tests.unit.modules.algorithms`

15:52 <hkaiser> weilewei: rori is a mentor ;-)

15:52 <rori> if it's unit tests

15:53 <rori> or you can replace `unit` by regression if you want the regression tests

15:53 <weilewei> rori oh sorry... are you interested in concurrent data structure project as well?

15:53 <gonidelis> rori thank you thank you so much :)))

15:53 <rori> weilewei: yep ;)

15:53 <rori> just as a silent listener :)

15:54 <weilewei> rori can you send me your email, please?

15:54 <rori> aurianer@cscs.ch

15:54 <rori> thanks weilewei

15:55 <Yorlik> hkaiser: Moment - need to fix a bug - might have accidentally used wrong build

15:55 <hkaiser> gonidelis: you should be able to attach the name of the test to run as well: ctest -R tests.unit.modules.algorithm.for_each_test or somesuch

15:55 <weilewei> rori ok, sent it, look forward to seeing you!

15:56 <gonidelis> hkaiser yeha get it... just couldn't find that `-R` flag that was needed

15:57 <rori> gonidelis: and to see all the targets you are interested in you can do `make help | grep tests.*modules`

15:58 <ms[m]> weilewei: rori sorry, should've introduced you two ;) but it seems you've come to an understanding

15:59 <gonidelis> ok, just some clarification q's:

15:59 <ms[m]> hkaiser: how did you fix this one in the end: https://app.circleci.com/pipelines/github/STEllAR-GROUP/hpx/3441/workflows/756bebcd-b31c-4baa-a235-da7c73f88885/jobs/189405/steps

15:59 <ms[m]> I got it on one of my prs and I think I'm up to date with master...

15:59 <gonidelis> 1. there is no modules dir under `tests/unit/...`. So what's that supposed to mean?

15:59 kale_ has joined #ste||ar

16:00 <ms[m]> gonidelis: tests/unit/... is where all the tests used to be

16:00 kale_ has quit [Remote host closed the connection]

16:00 <hkaiser> ms[m]: sec

16:00 <Yorlik> hkaiser: I'm getting a strange runtime error with the new build - i remember that was the reason why I had switched back quickly and forgotton about that. To make sure the error is not on my side I'll recomnpile the pr, though I think I had my local source tree right.

16:00 <rori> gonidelis: the tests related to a modules are located in the module directory i.e. `libs/<module_name>/tests/unit`

16:01 <ms[m]> we've been moving things piecewise into libs/modulename/tests/unit

16:01 <ms[m]> but there are still quite a few tests left in tests/unit

16:01 <ms[m]> they just don'w belong to any particular module yet

16:01 <ms[m]> hkaiser: np, irc overload!

16:01 <rori> gonidelis: and the tests.unit.modules.<> is just a cmake target

16:07 * jbjnr sent a long message: < https://matrix.org/_matrix/media/r0/download/matrix.org/BCwalFLcKHUsyZCmrjuyVziZ >

16:08 <jbjnr> can drop boost to 1.64 if you want

16:10 <gonidelis> rori

16:14 <gonidelis> So we write the tests on source (HPX) at `libs/<module_name>/tests/unit` and then these tests are compiled through the build_directory as this is where their target is specified?? =# =#

16:22 <gonidelis> Also I can see that some headers lie at `libs/algorithms/tests/unit/container_algorithms` while there exists a `libs/algorithms/include/...` directory which leads to corresponding headers... (???)

16:24 <rori> gonidelis: So if you look at the CMakeLists of the tests/unit of the affinity module for example you do a `add_hpx_unit_test` (see [here](https://github.com/STEllAR-GROUP/hpx/blob/master/libs/affinity/tests/unit/CMakeLists.txt#L25)) which is a cmake function (specified [here](https://github.com/STEllAR-GROUP/hpx/blob/master/cmake/HPX_AddTest.cmake#L192))

16:24 <rori> And if you follow the call hierarchy you see that we add the test target for example `tests.unit.modules.affinity` and that we make the dependency to `tests.unit.modules` so that it's called if you just `make tests.unit.modules` (see [here](https://github.com/STEllAR-GROUP/hpx/blob/master/cmake/HPX_AddTest.cmake#L164-L167)))

16:29 <rori> gonidelis: Not sure I understood your second question ^^

16:31 <hkaiser> ms[m]: I didn't fix this warning, just ignored it ;-)

16:37 <Yorlik> hkasier: It works - don't ask what I did wrong - no one knows ...

16:37 <Yorlik> hkaiser ^^

16:37 <hkaiser> Yorlik: \o/

16:38 <Yorlik> I'm in the process of overhauling our build system - it actually get better, but I think some bone splinters got spilled while the sausage was made :)

16:41 <gonidelis> rori WOW!!!! Great! Thank you. The spaghetti unwrapping just amazes me so much... thanks. As for the second question: I can see that there are some hpp's under `/libs/algorithms/tests/unit/container_algorithms` for example. But there is also a directory that I reckon is used for includes (headers) at

16:41 <gonidelis> `/libs/algorithms/include/hpx/parallel/container_algorithms`... what's the difference? sory if i confused you

16:42 <ms[m]> hkaiser: sneaky ;) but thanks

16:43 <ms[m]> I'll see if I understand where it's coming from, otherwise we should exclude that file...

16:43 <hkaiser> ms[m]: I think it's a problem in cmake-format itself

16:44 <hkaiser> Yorlik: could you comment on the PR, please?

16:45 <Yorlik> Yep - sry

16:45 <ms[m]> ok

16:45 <Yorlik> Did test in RelWithDebInfo only

16:46 <rori> gonidelis: so as the path indicates if you find some headers under `/tests/unit/container_algorithms` it means that it is headers using for testing, there is usually the `test` word in those to avoid confusion

16:46 <rori> The others are the headers of the `algorithms` module

16:47 <Yorlik> hkaiser: Done !

16:47 <hkaiser> thanks

16:49 <Yorlik> heller1: yt?

16:50 karame_ has quit [Quit: Ping timeout (120 seconds)]

16:50 <gonidelis> rori ah my bad... stupid overlook. Help appreciated a lot...

16:51 <rori> no worries ;)

16:59 <Yorlik> hkaiser: Any suggestion how to debug these 2 leaks? https://gist.github.com/McKillroy/c8740bf331a46afb26fc44ba65e7b337

17:25 <hkaiser> Yorlik: are you sure you're not slicing the msg by static_cast'ing it?

17:25 <Yorlik> The difference betwee base and derived is just the type

17:25 <Yorlik> All the data is in base

17:25 <Yorlik> One leak I fixed

17:25 <Yorlik> I had forgotten to check futures on living objects

17:25 <Yorlik> I only checked on destruction

17:26 <Yorlik> Actually it seems both are gone

17:26 <hkaiser> which leak is still there?

17:26 <hkaiser> lol

17:26 <Yorlik> I was just accumulating dead futures and messages

17:26 <Yorlik> Now I'm doing a check on every object update if the object has a mailbox and is active

17:27 <Yorlik> Seems I need to do these things a bit more orderly ... :)

17:28 <Yorlik> The good news is: It wasn't really anything serious or problematic - just an oversight and getting used to cleaning up more regularly :)

17:49 <heller1> Yorlik: what up?

17:50 <Yorlik> heller1: I had made another trace, but I found the reason of the leaks.

17:50 <Yorlik> I had forgotten to not only check open futures on objects I destroyed, but living objects also

17:50 <Yorlik> I just added the checking function which finalizes the async requests to the updates.

17:51 <Yorlik> So each object which actually receives an update will getit's open futures checked.

17:51 <Yorlik> And all objects on shuitdown before destruction.

17:51 <Yorlik> So - it's resolved. :D

17:52 <heller1> Alright! In the meantime, I found one leak out of hpx

17:52 <heller1> I'm still hunting another one

17:52 <Yorlik> There are tiny leaks which are probaly hpx - but not that dimension of what I saw.

17:52 <Yorlik> If you want traces I can provide you with some.

17:52 <heller1> Care to share the back trace?

17:53 <Yorlik> I also decided to buy Deleaker when the trial runs out - it's really a nice and affordable tool.

17:53 <heller1> One I found was coming from composable_guard

17:53 <Yorlik> NP - Care a quick screenshare to selct what you really need?

17:53 <heller1> In an hour or so?

17:53 <Yorlik> Sure - poke me - I'm around.

17:53 <heller1> Great

17:53 <Yorlik> Do you have teamviewer?

17:54 <Yorlik> It has best screen quality of all free tools i know of.

19:03 gonidelis has quit [Remote host closed the connection]

19:06 <heller1> Yorlik: ready whenever you are

19:06 <Yorlik> I'm ready

19:06 <heller1> which venue? zoom works nicely for me

19:07 <Yorlik> OK - just send me a link

19:07 <heller1> https://zoom.us/j/2810580903

19:28 <hkaiser> heller1: yt?

19:28 <heller1> hkaiser: what up?

19:30 <hkaiser> heller1: would you have some time to talk at some point?

19:31 <heller1> hkaiser: sure! once Yorlik is done?

19:31 <hkaiser> any time

20:11 <Yorlik> Done ;)

20:16 <heller1> hkaiser: ready whenever you are

20:18 <bita_> hkaiser, 1158 relies on 1159. So I will rebase that after 1159 is merged if it is Okay

20:21 <heller1> hkaiser: if you want, we can use the same link as above

20:31 nikunj97 has quit [Read error: Connection reset by peer]

20:56 <heller1> hkaiser: ping?

21:11 <heller1> calling it a day now...

21:22 <hkaiser> bita_: sure

21:24 <Yorlik> And another bunch of leaks cleared ... seems I'm getting orderly today :)

21:44 rtohid has left #ste||ar [#ste||ar]

21:50 <bita_> hkaiser, can I ask a question?

21:50 <hkaiser> sure

21:50 <bita_> I am trying to find where we decide that we can store to slice but not to slice_column/slice_row..., so I would be able to redirect them to slice_assign too. Would you please guide me where should I look

21:51 <hkaiser> bita_: not sure I understand

21:51 <bita_> I even looked into variable, but cannot see eher it happens

21:51 <hkaiser> ahh, I think I know what you mean

21:51 <hkaiser> the store primitive understands slicing

21:52 <bita_> store(slice(a,1,0),val) works but store(slice_column(a,0),val) doesn't

21:52 <hkaiser> and the physl compiler is doing some trickery to make it happen

21:52 <bita_> yes...

21:52 <hkaiser> store(slice(a,1,0),val) is translated to store(a, val, 1, 0)

21:53 <bita_> which file does this translation?

21:54 <hkaiser> aren't store_row/store_column just shortcuts for slice?

21:54 <bita_> yes, but unfortunately they are only able to extract, not assign

21:55 nikunj has quit [Remote host closed the connection]

21:55 <hkaiser> sec

21:55 nikunj has joined #ste||ar

21:56 <hkaiser> I mean, shouldn't it be possible to rewrite slice_row(...) as store(..., ???) ?

21:56 <hkaiser> similarily for slice_column

21:56 <bita_> if I know where the translation is , I would be happy to write it :")

21:57 <bita_> right now we get https://github.com/STEllAR-GROUP/phylanx/blob/master/src/execution_tree/primitives/primitive_component_base.cpp#L200 if we wanna store something to slice derivatives

21:57 <hkaiser> the slice transformation is handled here: https://github.com/STEllAR-GROUP/phylanx/blob/master/src/execution_tree/compiler/compiler.cpp#L876-L957

21:58 <bita_> thank you

21:58 <hkaiser> right

21:58 <hkaiser> not sure if this helps, though

21:58 <bita_> I will dig it up

22:02 <hkaiser> bita_: I think slice_row is very similar to slice with a None argument: https://github.com/STEllAR-GROUP/phylanx/blob/master/src/execution_tree/compiler/compiler.cpp#L876-L957

22:02 <hkaiser> same for slice_column - so you should be able to just use slice instead

22:02 <hkaiser> sorry here: https://github.com/STEllAR-GROUP/phylanx/blob/master/src/plugins/matrixops/slicing_operation.cpp#L200-L219

22:05 <bita_> you mean I don't need to change slice_column to be able to be assigned to?

22:05 <bita_> hkaiser, ^^

22:06 <hkaiser> yes

22:06 <hkaiser> you should be able to use slice() in the kmeans code instead of slice_column

22:06 <bita_> I did that here: https://github.com/STEllAR-GROUP/phylanx/issues/1162

22:07 <hkaiser> ritgh, so what are you trying to achieve, then?

22:07 nan11 has quit [Ping timeout: 245 seconds]

22:07 weilewei has quit [Ping timeout: 245 seconds]

22:07 <bita_> but I think it would be really better if we have that as you see half of it uses extra lists

22:08 <hkaiser> nod

22:08 <bita_> When I change all slice_column/slice_row with slice the performance is really down even in that example with 30 points

22:08 <hkaiser> I can have a look, if you want - need to remind myself what we did for slice, however ;-)

22:09 <bita_> I appreciate that, I am trying to find things that I can do ;)

22:09 <hkaiser> ahh, then go ahead ;-)

22:09 <bita_> sure, I will let you know if I failed

22:09 <hkaiser> ok

22:10 <hkaiser> bita_: btw, I'll have a phone call with Andrew tomorrow to talk about Krylov methods and tings

22:10 <hkaiser> so he's finally back

22:10 <bita_> great

22:11 <hkaiser> I'll keep you in the loop, he was mumbling something about a possible small project over the summer

22:12 <bita_> :+1

22:18 <hkaiser> bita_: I think the main entry point for handling slice() in the compiler is here: https://github.com/STEllAR-GROUP/phylanx/blob/master/src/execution_tree/compiler/compiler.cpp#L1406

22:18 <hkaiser> slice_column/_row could be handled similarly

22:18 <bita_> uhum, thanks

22:18 <bita_> got it

22:25 nan11 has joined #ste||ar

22:30 <hkaiser> bita_: let me repeat, instead of writing slice_column(centroids, 0) in the kmeans code you should be able to write slice(centroids, nil, 0), correspondingly, instead of slice_row(x, 1) you should be able to write slice(x, 1, nil)

22:30 <hkaiser> those are equivalent

22:31 <hkaiser> same when used with store()

22:32 <Yorlik> Seams I'm almost leak free now. I let 100k object update for some minutes and only have some single count leaks from stuff that is not significant or not my responsibility.

22:32 <hkaiser> at least as long as the variable is 2d - but for others slice_colum/row are undefined anyways

22:32 <hkaiser> Yorlik: the c library leaks things as well, mostly objects in global scope

22:32 <Yorlik> Yes - there is some weird stuff.

22:33 <bita_> Okay, so I use slice & nil instead of what I did in https://github.com/STEllAR-GROUP/phylanx/issues/1162 and leave it be

22:33 <Yorlik> I'll comb through the remaining löeaks later - I'm pretty happy with the state of the leak situation now

22:35 <hkaiser> bita_: might be the easiest - but feel free to dig around in the compiler if you like ;-)

22:36 <bita_> I am trying to make slice distributed, so it was fun reading those files. I am changing it in 1167 and if it was a performance issue, we can change that later

22:36 <hkaiser> ok

23:26 <Yorlik> I'm getting a " boost::wrapexcept<boost::bad_any_cast>" exception when trying to use --hpx:print-counter=/threads{locality#0/total}/idle-rate

23:26 <Yorlik> Counters are on in my compile settings (which is didn't change anyways)

23:26 <Yorlik> Is something wrong with the param syntax?

23:36 <Yorlik> I wonder if it's somehow conflicting with my use of program-options - need to check this

23:36 nan11 has quit [Remote host closed the connection]