#ste||ar on 2017-07-31 — irc logs at irclog.cct.lsu.edu

2017-05-17 13:54 aserio changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

00:00 <K-ballo> maybe something more elaborated, like this https://github.com/llvm-mirror/libcxx/blob/master/cmake/Modules/CheckLibcxxAtomic.cmake

00:00 <hkaiser_> cool

00:10 diehlpk has quit [Ping timeout: 246 seconds]

00:12 Matombo has quit [Remote host closed the connection]

00:33 <K-ballo> there's no license in that file, does that make it public domain?

00:45 diehlpk has joined #ste||ar

01:26 <hkaiser_> K-ballo: the whole repo has a license, no?

02:02 K-ballo has quit [Quit: K-ballo]

02:04 deep-book-gk has joined #ste||ar

02:06 deep-book-gk has left #ste||ar [#ste||ar]

02:15 hkaiser_ has quit [Quit: bye]

02:47 diehlpk has quit [Ping timeout: 240 seconds]

02:48 mars0000 has joined #ste||ar

03:42 mars0000 has quit [Quit: mars0000]

04:29 mars0000 has joined #ste||ar

04:44 mars0000 has quit [Ping timeout: 240 seconds]

05:22 vamatya has joined #ste||ar

05:24 vamatya_ has joined #ste||ar

05:26 vamatya has quit [Ping timeout: 246 seconds]

06:49 vamatya_ has quit [Ping timeout: 276 seconds]

07:56 bikineev has quit [Remote host closed the connection]

07:57 <jbjnr> Is anyone onlibne this morning?

07:57 <jbjnr> ^online

07:57 <heller> jbjnr: yes!

07:58 <jbjnr> aha. I've just seen your reply on slack to. I was doing a test to see which would get a reply faster. You blew it by using slack before I tried IRC.

07:58 <jbjnr> So I have to check slack too now :(

08:00 bikineev has joined #ste||ar

08:08 <heller> ;)

08:08 <heller> just a coincidence

08:10 mcopik has joined #ste||ar

08:25 <jbjnr> heller: I've just been asked to confirm that we are both available for the HPX course on 5-6 Oct before they send out the formal announcement. You're still in yes?

08:26 <jbjnr> (You cannot say no)

08:26 <heller> I am still in, yes

08:26 <jbjnr> great. thanks

08:27 <jbjnr> So announcement of course should go out today then

08:27 <jbjnr> Just been talking to people about replacing octotiger

08:38 Matombo has joined #ste||ar

08:43 <jbjnr> Announcement just went oot for hpx course

08:49 Matombo has quit [Remote host closed the connection]

08:50 Matombo has joined #ste||ar

08:50 Matombo has quit [Remote host closed the connection]

08:51 Matombo has joined #ste||ar

08:56 Matombo has quit [Remote host closed the connection]

08:56 Matombo has joined #ste||ar

09:05 <heller> jbjnr: replacing octotiger?

09:06 <jbjnr> heller: On the last GB call they told us that dominic was moving to another position and his time on octotiger would be limited. We can continue developing, it, or we can start looking for a new flagship HPX HPC app to push.

09:06 <github> [hpx] StellarBot pushed 1 new commit to gh-pages: https://git.io/v78BN

09:06 <github> hpx/gh-pages c4a57e5 StellarBot: Updating docs

09:07 <heller> jbjnr: ahh, makes sense

09:08 <jbjnr> Since my group is involved in a large project that has just been funded, potentially one of the apps in there might want to be an HPX flagship project.

09:08 <jbjnr> So I was chatting about possibilities

09:08 <jbjnr> no takers yet

09:10 <jbjnr> I'm going to approach these guys at some point http://icc.dur.ac.uk/swift/

09:10 <jbjnr> they have written their own task based runtime.

09:10 <jbjnr> They should use hpx instead.

09:19 <heller> jbjnr: sounds good!

09:20 <heller> "SWIFT: Using task-based parallelism, fully asynchronous communication, and graph partition-based domain decomposition for strong scaling on more than 100,000 cores"

09:20 <heller> this sounds like our turf

09:20 <jbjnr> They are also one of the projects that are funded to work on our machine as part of the next big set of projects

09:21 <jbjnr> so ideal from all bureaucratic angles too

09:21 <heller> except mine

09:22 <jbjnr> ?

09:22 <jbjnr> well, we could invite them to join this FET project?

09:22 <heller> I have no relation to it except it being yet another cool simulation :)\

09:22 <heller> ha

09:22 <heller> yes

09:22 <jbjnr> then all angles are covered

09:22 <jbjnr> since we need a new application anyway

09:22 <heller> good thinking

09:23 <jbjnr> I need to speak to a chap here at CSCS, but he's on vacation currently, so it may be a few weeks before I do anything about this.

09:24 <heller> the FET thingy needs to be acted upon rather quickly

09:25 <heller> and also: top priority getting the thesis done

09:45 Matombo has quit [Ping timeout: 248 seconds]

09:58 Matombo has joined #ste||ar

10:00 <jbjnr> How quickly does the FET thing need doing?

10:05 <heller> submission date is 24th september

10:05 <jbjnr> shit, that's soon.

10:05 <jbjnr> ok. I will send some emails out ...

11:21 bikineev has quit [Remote host closed the connection]

11:33 K-ballo has joined #ste||ar

11:38 diehlpk has joined #ste||ar

12:00 taeguk[m] has quit [Ping timeout: 258 seconds]

12:00 thundergroudon[m has quit [Ping timeout: 240 seconds]

12:07 <zao> FET?

12:08 <zao> Ah, "Future and Emerging Technologies"

12:08 <zao> Only ever knew what the NLA part of NLAFET meant :)

12:08 <zao> (my lads are targetting StarPU, IIUC)

12:09 <jbjnr> https://ec.europa.eu/research/participants/portal/desktop/en/opportunities/h2020/topics/fethpc-02-2017.html

12:11 hkaiser has joined #ste||ar

12:14 hkaiser has quit [Read error: Connection reset by peer]

12:20 hkaiser has joined #ste||ar

12:31 diehlpk has quit [Ping timeout: 246 seconds]

12:56 eschnett has quit [Quit: eschnett]

13:17 eschnett has joined #ste||ar

13:30 diehlpk_work has joined #ste||ar

13:35 diehlpk_work has quit [Ping timeout: 246 seconds]

13:35 diehlpk_work has joined #ste||ar

13:37 <hkaiser> jbjnr: I'm planning to all I can to get new money into octotiger

13:38 <hkaiser> but having more than one HPX flagship application is good as well

13:38 <jbjnr> hkaiser: new money would be great, but without dominic ....

13:38 <hkaiser> with money comes dominic

13:38 <jbjnr> so has he changed depts or something?

13:38 <hkaiser> as said on the call we have 2 proposals in the pipeline

13:38 <hkaiser> not even changed depts, just another group

13:40 <hkaiser> jbjnr: and there is still that big project looming where a postdoc is a marginal expense

13:40 <jbjnr> ok. when he said "new boss" I assumed it meant new dept or something. Why doesn't his new boss want him to work on octobaby

13:40 <hkaiser> shrug, have not talked to him, but planning to

13:41 <jbjnr> when will I be able to tell my bosses that hpx has new funding = big project?

13:41 <hkaiser> well, we have got the promise ;)

13:41 <jbjnr> $$$ > promise

13:41 <hkaiser> indeed!

13:42 <zao> Watch out, the future may throw!

13:42 <hkaiser> lol

13:42 <hkaiser> anyways

13:42 <hkaiser> gtg

13:42 hkaiser has quit [Quit: bye]

13:44 mcopik has quit [Ping timeout: 255 seconds]

13:47 aserio has joined #ste||ar

13:56 <diehlpk_work> aserio, yt?

14:04 <aserio> Yes

14:04 <aserio> diehlpk_work: ^^

14:04 <diehlpk_work> We set up a skype meeting today wiht Hartmut or?

14:05 <aserio> yes

14:05 <aserio> but that will take place in an hour

14:05 <diehlpk_work> At 10 my time?

14:06 <diehlpk_work> Ok, my fault. I had 10 my time in my calendar

14:16 <jbjnr> heller: yt?

14:17 <heller> jbjnr: yes

14:17 <jbjnr> quick question ...

14:17 <heller> shoot

14:18 <jbjnr> I've noticed something strange - when I dump out the topology info (on the resource partitioner branch, so possibly dodgy), it shows the correct information. 72 pus on 36 cores, 2 numa domains. etc. Later when the code is running, I dump out the topology info and it tells me 24 core, 2 domains, 48 pus.

14:18 <jbjnr> any idea why hwloc might 'change it's mind' mid way though a program?

14:19 <heller> strange indeed

14:19 <jbjnr> yes indeedy

14:19 <heller> what happens in between those two calls?

14:19 <jbjnr> some matrix stuff

14:19 <jbjnr> nothing earth shattering.

14:19 <jbjnr> I'm looking for numa related issues and found this oddity

14:19 <heller> are the information outputted using the same functionality? Or different code to print that information?

14:19 <jbjnr> same everything. even same 'this' pointer on the topo class

14:20 <heller> 72 PUs, you might want to check your HPX_MAX_CPU_COUNT cmake variable

14:20 <jbjnr> set to 96, but tried 256 previously

14:20 <heller> ok

14:21 <heller> no idea why it changed its mind ...

14:21 hkaiser has joined #ste||ar

14:21 <heller> could it be that the RP changes some internal, global bitmasks?

14:21 <jbjnr> I'll rebuild hwloc just in case a new vresion fixed anything

14:22 <hkaiser> heller: what happens ?

14:22 <heller> [16:18:29] <jbjnr> I've noticed something strange - when I dump out the topology info (on the resource partitioner branch, so possibly dodgy), it shows the correct information. 72 pus on 36 cores, 2 numa domains. etc. Later when the code is running, I dump out the topology info and it tells me 24 core, 2 domains, 48 pus.

14:22 <jbjnr> well it does, but nothing that would casue this. I've been poking around at it over the weekend

14:23 <heller> ok, that's the only plausible explanation I have, that you somehow mess with the bitmasks

14:23 <jbjnr> yes. I was looking for changes that affect the thread id, pu_masks everything else, but have not uncovered anything unusual

14:24 <hkaiser> jbjnr: I might have screwed up

14:24 <jbjnr> lets blame heller anyway though

14:24 <hkaiser> ok

14:24 <jbjnr> more likele me than you

14:24 <jbjnr> though

14:25 <jbjnr> hkaiser: if there's a comit where you made hwloc related changes, please let me know and I'll have a look

14:25 <hkaiser> none, afair

14:25 <heller> blaming me always works

14:26 <hkaiser> how to reproduce this?

14:26 <jbjnr> hkaiser: not easy. I found it by accident whilst dumping info out from the matrix code. looking for reasons why my numa related changes didn't help. Turned out to be something else, but this is bothering me

14:27 <hkaiser> well, tell me how to reproduce and I'll look into it for you ;)

14:35 <heller> woah, why does everything I touch lead to excessive compile times?

14:36 <hkaiser> the world wants you not to procrastinate

14:37 <heller> a full compilation of my thesis takes about 2 minutes :/

14:38 <jbjnr> heller: sing to the tune of beyonce - "if you like it then you'd better put a template on it"

14:38 <heller> :D

14:38 thundergroudon[m has joined #ste||ar

14:38 <K-ballo> the heller's touch

14:38 <jbjnr> correct

14:39 <K-ballo> heh, the irony...

14:39 <zao> Simply defer everything until modules or the heat death of the universe.

14:39 <zao> Whichever happens first.

14:40 <heller> but: https://pasteboard.co/GDuKOhy.png

14:41 <jbjnr> nice. 160 pages so far. very good

14:44 taeguk[m] has joined #ste||ar

14:47 <github> [hpx] hkaiser closed pull request #2788: Adapt parallel::is_heap and parallel::is_heap_until to Ranges TS. (master...tg_is_heap_range) https://git.io/v7lCt

14:57 aserio has quit [Ping timeout: 246 seconds]

14:58 diehlpk has joined #ste||ar

15:07 Reazul has quit [Quit: Page closed]

15:11 Reazul has joined #ste||ar

15:14 <Reazul> Hello. :) As suggested previously I started by writing a chain of tasks in shared memory. I am not being able to use the future.then() correctly, can you please suggest the right way of doing that. https://pastebin.com/03BhnndN

15:17 <jbjnr> Reazul: future.then takes an argument that is the result type of the future becoming ready. Have a look though some of the examples and unit tests for ideas - for example https://github.com/STEllAR-GROUP/hpx/blob/master/examples/future_reduce/rnd_future_reduce.cpp#L78

15:18 <Reazul> @jbjnr: Thanks :)

15:18 <jbjnr> that example is a bit more complicated than you need, cos it;s a vector of futures...

15:18 <jbjnr> try this in stead ...

15:19 <Reazul> So in that example if there's no lambda we call future.then like this hpx::future<int> result = all_ready.then(reduce);

15:20 <Reazul> That is what I was trying to replicate

15:20 <Reazul> In my case I am trying on a single future instead of a vector.

15:21 <jbjnr> hold on, let me look at your example again

15:21 <Reazul> Thanks

15:23 vamatya has joined #ste||ar

15:26 <heller> Reazul: line 39

15:26 <heller> Reazul: futures[id-1].then( hpx::async<action_type>(here, id) )

15:26 <Reazul> yes

15:26 <heller> that's not how it works

15:27 <Reazul> ok

15:27 <heller> .then expects something callable

15:27 <heller> hpx::async returns a future

15:27 <heller> which is not callable

15:27 <jbjnr> an_action, here, id)

15:27 <Reazul> right

15:27 <jbjnr> https://gist.github.com/biddisco/0a899c6ba705e3e7033f0a731f18233b

15:27 <jbjnr> sorry. pasted wrong thing

15:27 <jbjnr> Reazul: try something more like that

15:27 <heller> for your simple prototype, you wouldn't even need actions

15:28 <heller> note to self: get rid of this horrible horrible hello world example

15:28 <jbjnr> indeed. if you only call action on 'here' no need for them

15:28 <jbjnr> just call the worker function directly

15:28 <Reazul> I tried without the aaction, couldn't get it to compile ( I am very new to c++)

15:28 <Reazul> I use C most of the time so debugging what I am doing wrong will take forever

15:29 <heller> also, task_worker would need to accept a future<void>

15:29 <jbjnr> (lambda capture would need to use id in the [id] capture)

15:29 mars0000 has joined #ste||ar

15:29 <jbjnr> ah. I forgot to add the function param in the lambda. I ive up

15:29 <heller> Reazul: since you are new to C++, it would be great to have a prototype without those tasks, which are at this point just noise

15:30 <heller> try to model the problem you are trying to solve in plain C++ first

15:30 <jbjnr> heller: we need an online compiler that has hpx built into it like the code pastebin sites that allow you to use differnt boost/gcc etc

15:30 <heller> or even C...

15:30 <heller> jbjnr: we do need that indeed

15:30 EverYoung has joined #ste||ar

15:30 <jbjnr> then we could type code snippets without trying to compile them in our head

15:30 EverYoung has quit [Remote host closed the connection]

15:30 <heller> jbjnr: ah come one, where would be the fun then?

15:31 <heller> on*

15:31 <Reazul> @heller: I have prototype of what I am trying to achieve in other runtimes that are in C

15:31 EverYoung has joined #ste||ar

15:31 <heller> ok, do you also have a prototype of a plain serial version?

15:32 <Reazul> @heller: for hpx?

15:32 <heller> for your algorithm you want to parallelize

15:32 <Reazul> or just calling 4 functions serially?

15:33 <heller> if it's just calling 4 functions serially, sure

15:33 <heller> once you have that

15:33 diehlpk has quit [Remote host closed the connection]

15:33 <heller> you can start by invoking the first function asynchronously and wait for it explicitly

15:34 <heller> once you got that to work

15:34 <jbjnr> OFG heller hkaiser guess who is today's donkey of the week winner?

15:34 <heller> you can add a continuation to call the second function etc.

15:34 <heller> jbjnr: you?

15:34 <jbjnr> correct

15:34 <heller> what was the problem?

15:34 <jbjnr> so 72 (pus) in hex is 48, and 36 cores inhex is 24.

15:35 <hkaiser> lol

15:35 <hkaiser> good one

15:35 <jbjnr> my std::cout had the hex flag set, so the numbers are in hex not dec

15:35 <hkaiser> we shuld print to base 13

15:35 <heller> ;)

15:35 <heller> Reazul: https://github.com/STEllAR-GROUP/tutorials/blob/master/examples/03_fibonacci/fibonacci_futures.cpp <--- you might be interested in that one

15:37 <Reazul> ok, Honestly I have gone though the examples, not fibo but hello_world, reduction, ag etc. Since I am having a hard time understanding the syntax I am trying to replicate the examples and alter them.

15:38 <heller> yes

15:38 <Reazul> It would be super helpful if there was examples of how to acieve simple things like create a task, wait for a task, create links between tasks etc

15:38 <heller> hello_world and ag are really bad examples

15:38 <heller> I agree

15:39 <Reazul> I apologize for bothering you all so much :), I am just trying to learn :)

15:39 <heller> http://de.cppreference.com/w/cpp/thread/async

15:39 <heller> FFS

15:39 <heller> http://en.cppreference.com/w/cpp/thread/async

15:40 <heller> that one

15:40 <heller> at the bottom, there is an example

15:40 <heller> replace std:: with hpx:: and you have a winner

15:40 <Reazul> :) thanks, let me tyr that

15:40 <heller> http://en.cppreference.com/w/cpp/thread/future

15:40 <heller> future is here, but that lacks the continuation extensions

15:41 <heller> you should get it to work without that first though ;)

15:41 <Reazul> this is what I got, try calling the same function(task) 4 times with async. (first step)

15:42 <Reazul> then try to use future to link them

15:42 <Reazul> I will try doing that and get back. Thank you for the suggestions

15:42 <heller> so everything but future.then works?

15:43 <Reazul> I tried with out the then it worked, with multiple threads I get different order of tasks

15:43 <heller> https://github.com/STEllAR-GROUP/hpx/blob/master/examples/quickstart/fibonacci_futures.cpp#L90

15:43 <heller> look here

15:43 <Reazul> ok

15:44 <jbjnr> Reazul: https://gist.github.com/biddisco/0a899c6ba705e3e7033f0a731f18233b

15:45 <zao> /usr/include/c++/6/atomic:236: undefined reference to `__atomic_load_16'

15:45 <jbjnr> that's a version of your example that compiles. I did not run it

15:45 <zao> Woo-hoo!

15:45 <jbjnr> gtg

15:45 <zao> I guess you people haven't fixed the std-atomic branch yet, eh? :P

15:47 <Reazul> @jbjnr: Thanks

15:47 <K-ballo> zao: the std-atomic branch is on hold, try the std-atomic-lite one

15:47 <K-ballo> last build on circle caused an ICE due to deprecated algorithms somehow.......... need to look into that

15:49 david_pfander has quit [Ping timeout: 276 seconds]

16:07 pree__ has joined #ste||ar

16:17 akheir has quit [Remote host closed the connection]

16:36 EverYoung has quit [Ping timeout: 246 seconds]

16:41 <zao> The warnings for siple_central_tuplespace_client_exe still scare the crap out of me.

16:42 <hkaiser> zao: yah

16:42 <hkaiser> they are valid but harmless ;)

16:43 <hkaiser> not sure what to do about them

16:43 <K-ballo> is that the simple_base_component one?

16:43 <K-ballo> copy constructor calling the base's default?

16:43 <zao> goddammit, gist down.

16:43 <hkaiser> no, the warning issued by the small object optimization

16:43 <zao> http://paste.ubuntu.com/25214388/

16:43 <K-ballo> oh, yeah..

16:43 <K-ballo> could use constexpr if

16:44 <hkaiser> good one

16:44 <hkaiser> well, I could factor it out, I guess

16:44 <hkaiser> zao: let me see what I can

16:44 <hkaiser> do

16:44 <K-ballo> I decided not to back then, but back then there was neither the warning nor the constexpr if

16:47 patg[[w]] has joined #ste||ar

16:47 bikineev has joined #ste||ar

16:47 <zao> https://paste.ubuntu.com/25214409/

16:48 <zao> I like the huge wall of instantiations followed by "there's const and stuff"

16:48 EverYoung has joined #ste||ar

16:52 mars0000 has quit [Quit: mars0000]

16:52 <zao> /home/zao/slask/stellar/hpx/tests/unit/component/copy_component.cpp:159:1: fatal error: error writing to /tmp/ccTgRLNU.s: No space left on device

16:53 <zao> That's it for today, I'm going home :D

16:56 <zao> I guess that's what I deserve for running the OS and HPX build off a 74G HDD.

16:59 zbyerly_ has joined #ste||ar

17:03 <github> [hpx] hkaiser created fixing_any_warning (+1 new commit): https://git.io/v744w

17:03 <github> hpx/fixing_any_warning c55df05 Hartmut Kaiser: Circumvent scary warning about placement new creating object of larger size than space is available

17:04 <hkaiser> zao: ^^

17:05 <zao> Yay.

17:05 <zao> I need to rebuild this machine, system disk is too small to hold HPX :)

17:08 mcopik has joined #ste||ar

17:27 akheir has joined #ste||ar

17:31 aserio has joined #ste||ar

17:33 hkaiser has quit [Quit: bye]

17:42 <patg[[w]]> pree__: wash I can attend just ping me

17:49 <patg[[w]]> parsa[w]: yt??

17:49 <patg[[w]]> parsa[w]: ping

17:51 <pree__> parsa[w] , wash : in #ste||ar-gsoc channel right in 15 minutes

17:51 <pree__> Thanks

17:52 <patg[[w]]> 8 minutes

17:53 <pree__> sorry

17:53 <pree__> in 8 minutes

17:59 hkaiser has joined #ste||ar

17:59 pree__ has quit [Ping timeout: 240 seconds]

18:01 <wash> pree__?

18:01 <hkaiser> aserio: yt?

18:01 <wash> I'm in that channel, looks like he disconned...

18:02 <aserio> hkaiser: yes

18:02 <hkaiser> see pm, pls

18:10 pree_ has joined #ste||ar

18:16 pree_ has quit [Ping timeout: 246 seconds]

18:32 pree_ has joined #ste||ar

18:47 mars0000 has joined #ste||ar

18:54 bikineev has quit [Remote host closed the connection]

19:22 hkaiser has quit [Read error: Connection reset by peer]

19:24 hkaiser has joined #ste||ar

19:24 aserio has quit [Ping timeout: 246 seconds]

19:30 eschnett has quit [Quit: eschnett]

19:38 aserio has joined #ste||ar

19:44 <Reazul> Is there any example showing how I can create data, distribute it in HPX? I have this program that seems to work in the way I want https://pastebin.com/jELqZWQX . I tried it with openmpi on 6 nodes and it behaves correctly. Now I want to allocate data and distribute it.

19:51 <wash> Reazul what do you mean by tried it with openmpi?

19:51 <wash> Reazul: there's a few ways to distribute data..

19:52 <wash> Reazul: you can use something like partitioned_vector, or hpx::new_. Or you can write your own component types and use them to build distributed entities

19:53 <wash> hkaiser: is there a good partitioned_vector example he can look at?

19:53 <Reazul> @wash: I meant I compiled HPX to use MPI and then tried the example I posted on 6 nodes.

19:55 bikineev has joined #ste||ar

19:56 aserio has quit [Quit: aserio]

19:58 <wash> Reazul: just making sure :).

19:59 <Reazul> :D

19:59 bikineev has quit [Ping timeout: 258 seconds]

20:04 pree_ has quit [Quit: AaBbCc]

20:04 patg[[w]] has quit [Quit: Leaving]

20:17 <github> [hpx] hkaiser force-pushed resource_partitioner from 9b8e3e1 to 8eee17e: https://git.io/v7lfK

20:17 <github> hpx/resource_partitioner 8eee17e Hartmut Kaiser: Adding pool specific performance counters...

20:17 <hkaiser> Reazul: do you have your local example working now?

20:18 <hkaiser> jbjnr: see #2789!

20:18 <github> [hpx] hkaiser opened pull request #2789: Resource partitioner (master...resource_partitioner) https://git.io/v74AS

20:20 <github> [hpx] hkaiser force-pushed fixing_any_warning from c55df05 to c8d310e: https://git.io/v74xk

20:20 <github> hpx/fixing_any_warning c8d310e Hartmut Kaiser: Circumvent scary warning about placement new creating object of larger size than space is available

20:22 eschnett has joined #ste||ar

20:22 <jbjnr> hkaiser: noted #2789 and commented.

20:23 <hkaiser> I don't see any comments saying 'WIP: FIX THIS PROPERLY'

20:24 <hkaiser> jbjnr: ^

20:27 <Reazul> @hkaiser: Yes, I have a working version of chains with each task mapped in different node working now.

20:29 <hkaiser> ok, so now simply apply a couple of changes: shared_ptr<T> --> hpx::id_type, make_shared --> hpx::new_, derive your C++ object from hpx::components::component, and turn the member functions of this object which you want to call remotely into actions

20:33 <hkaiser> Reazul: IOW, the C++ object which you want to access remotely needs to be turned into a hpx 'component'

20:33 <hkaiser> that allows for instances of this C++ type to be instantiated remotely and you can call member functions of this object remotely as well

20:34 aserio has joined #ste||ar

20:36 <hkaiser> Reazul: also read this: https://stellar-group.github.io/hpx/docs/html/hpx/manual/applying_actions.html

20:37 <hkaiser> and this https://stellar-group.github.io/hpx/docs/html/hpx/manual/components.html

20:38 <Reazul> @hkaiser: thanks, let me read the material, your first instructions are not completely clear to me

20:39 akheir has quit [Remote host closed the connection]

20:40 <jbjnr> https://github.com/STEllAR-GROUP/hpx/pull/2789/commits/518b992d693b37b03cd8dad184634f780151e431 hkaiser about 10-12 commits back from tip of branch

20:40 <aserio> hkaiser: see pm

20:42 <hkaiser> jbjnr: ok, will look

20:43 <jbjnr> I can have a look. from my point of view, there ought not to be any great rush to merge this. A few more days won't hurt

20:43 <hkaiser> jbjnr: I agree

20:43 <hkaiser> just wanted to move this forward

20:45 aserio has quit [Quit: aserio]

20:45 <hkaiser> jbjnr: the docs need more work as well

20:45 <github> [hpx] hkaiser force-pushed resource_partitioner from 8eee17e to a6ed025: https://git.io/v7lfK

20:45 <github> hpx/resource_partitioner a6ed025 Hartmut Kaiser: Adding pool specific performance counters...

20:45 <hkaiser> (perf-counter docs)

20:45 <jbjnr> if you have time, feel free to fix the WIP, if not, I will do so. Yes, docs. correct. need to do more there

20:45 <jbjnr> and the PP stuff. :(

20:46 <jbjnr> docs are falling behind

20:46 <hkaiser> yah, that as well

20:49 <Reazul> @hkaiser: are there any example showing how to create data and move them around node boundary using action and componenets? (other than ag)

20:51 <hkaiser> Reazul: why do you want to move data around?

20:52 <Reazul> Well I mean, HPX will do that for me but I need to express that, right>

20:52 <Reazul> ?*

20:52 <hkaiser> why ?

20:52 <hkaiser> you want to avoid moving data as much as possible, right?

20:52 <Reazul> That is the program I am trying to come up with

20:52 <Reazul> of course.

20:53 <hkaiser> hpx has no explicit data movement API, you 'move' data by passing it as an argument to an action invocation or you get it back when returned from such

20:53 <hkaiser> you can think of an action as a form of a remote procedure call

20:54 <Reazul> actions are also representing tasks right?

20:55 <hkaiser> an action represents a function you can call remotely

20:55 <hkaiser> if you invoke an action usually a task is created

20:55 <Reazul> ok

20:56 bikineev has joined #ste||ar

20:57 <Reazul> Let me rephrase my query, is there any example (simple) for distributed memory other than ag?

20:57 <hkaiser> Reazul: what do you want to do?

20:58 <hkaiser> all examples use distributed memory

20:58 <Reazul> I am trying to assess HPX

20:58 <Reazul> with simple benchmark

20:58 <Reazul> I am trying to make sure I am executing what I plan to

20:58 <hkaiser> I think you're trying to reproduce something resembling an MPI application

20:59 <hkaiser> I'd suggest you forget for a moment that your application should run in distributed

20:59 <Reazul> ok

21:00 <hkaiser> an hpx application is very similar to a non-distributed application, no special 'data movement'

21:00 <Reazul> right

21:01 <hkaiser> the only things which are different compared to a non-distributed application are a couple of things caused by the limits of the C++ memory model requiring things like components and actions

21:01 bikineev has quit [Ping timeout: 240 seconds]

21:01 <hkaiser> Reazul: an action is the same as a normal function except that you can invoke it remotely

21:02 <hkaiser> a component is the same as a normal c++ object except that you can instantiate it remotely

21:02 <Reazul> ok

21:02 <hkaiser> an hpx::id_type is the same as a void* referring to something ina virtual gloabl address space

21:03 <hkaiser> you create an instance of a component using hpx::new_<> and you invoke an action using hpx::async

21:03 <hkaiser> both give you a future<> representing the result of the possibly remote operation

21:04 <hkaiser> so in the end your distributed program looks like a local one

21:04 <hkaiser> MPI forces you to write explicit code to send the data and to receive it (i.e. code on both ends of the wire)

21:05 <Reazul> right

21:05 <hkaiser> hpx does this by invoking an action, data is 'passed' as the argument of that action - the receiver does not explicitly 'wait' for this

21:05 <Reazul> right, makes sense

21:06 <hkaiser> so it's the same as calling a function, the function 'receives' the data whenever it is invoked

21:07 <Reazul> I see

21:07 <hkaiser> Reazul: what type of applications do you plan to write?

21:07 <hkaiser> scientific simulations? graphics? financial?

21:08 <Reazul> very simple

21:08 <Reazul> not specific application

21:08 <Reazul> nothing complex

21:08 <hkaiser> I mean eventually, once you 'evaluated' HPX

21:08 <Reazul> I am trying to survey runtimes

21:08 <hkaiser> to what end?

21:09 <Reazul> to give a evaluation of all the contemporary runtimes

21:09 <hkaiser> just distributed runtimes? or local ones as well?

21:09 <Reazul> just distributed

21:09 <hkaiser> ok

21:10 <hkaiser> I don't thik that you will get a good understandin gof things by trying to implement a trivial use case

21:10 <Reazul> why not?

21:10 <Reazul> I also plan to implement simple stencil operations as well

21:10 <hkaiser> because no real-world distributed application is based on trival use cases

21:11 <hkaiser> ahh, ok

21:11 <Reazul> I am trying to evaluate the dependency resolution, task overhead and communication overhead of the runtimes

21:11 <hkaiser> we have a series of 2d stencil examples here: https://github.com/STEllAR-GROUP/tutorials

21:12 <Reazul> Thanks

21:12 <hkaiser> Reazul: ok, that makes sense

21:12 <Reazul> I stated with embarrassingly parallel tasks for shared memory and moving to distributed with chains of task and ultimately come up with stencil as gereal benchmark

21:13 <Reazul> Should give some idea about the implementation efficiency

21:13 <hkaiser> ok

21:13 <hkaiser> with hpx the distributed code will almost look like local code

21:13 <hkaiser> if properly done, that is

21:13 <Reazul> yes

21:14 <Reazul> I will make sure by pasting the code here.

21:14 <Reazul> you guys have been very helpful. Thank you for that :)

21:14 <hkaiser> Reazul: let us know if we can help in any way

21:15 <Reazul> absolutely

21:24 zbyerly_ has quit [Quit: Leaving]

21:29 <github> [hpx] hkaiser force-pushed resource_partitioner from a6ed025 to 39d8aee: https://git.io/v7lfK

21:29 <github> hpx/resource_partitioner 39d8aee Hartmut Kaiser: Adding pool specific performance counters...

21:35 <github> [hpx] jfbastien opened pull request #2790: Fix OSX build (master...build-fix) https://git.io/v7BLg

21:37 <K-ballo> woa

21:38 <hkaiser> woa indeed

21:44 jfbastien has joined #ste||ar

21:45 <jfbastien> wash like parallel STL compute benchmarks. I see is_heap, partition_copy, unique_copy. Not enough!

21:45 <wash> hkaiser: jfbastien is looking for a "good HPX compute benchmark"

21:46 <wash> jfbastien: something flop bound or mem banwidth bound

21:46 <jfbastien> right, with zero network, and as little external dependency as possible. Like fielsystem is a PITA.

21:47 <jfbastien> Just threads, and numbers. In a Zen state together.

21:47 <jfbastien> Locks are cool, atomics are cooler.

21:47 <wash> jfbastien: lemme think a sec

21:47 mars0000 has quit [Quit: mars0000]

21:50 <wash> jfbastien: the 1d_stencil examples, like 1d_stencil_8, would be good. Lemme send you some slides on them, one sec

21:50 <wash> examples/1d_stencil

21:51 <wash> err 1d_stencil_8 might be the distributed one

21:52 <jfbastien> wash they all just do 1 thread

21:52 <wash> jfbastien: -t8

21:52 <wash> to the command line

21:53 <jfbastien> wash ah cool

21:53 <wash> hkaiser: which slide deck has the slides on 1d_stencil?

21:53 <hkaiser> : uhh

21:53 <jfbastien> wash lol hpx::init: std::exception caught: Requested more than 64

21:53 <hkaiser> jfbastien : we've just recently changed it to use all cores by default (on a branch somewwere)

21:53 <hkaiser> lol

21:54 <jfbastien> hkaiser OK, I pulled ToT

21:54 <hkaiser> : yah

21:54 <hkaiser> jfbastien: how many cores do you have?

21:54 <jfbastien> hkaiser some :)

21:54 <wash> jfbastien: you gotta build with a cmake flag for more than 64 cores

21:54 <hkaiser> if you have more than 64 you need to rebuild with a special option

21:54 <jfbastien> wash yeah the error message said what to do

21:54 <hkaiser> KNLs for instance

21:54 <jfbastien> wash I was just amused

21:55 <hkaiser> 64 bit system have a limit of 64 bits in the simplest bitmask ;)

21:55 <jfbastien> :)

21:56 <jfbastien> wash hkaiser so I'm looking for code that has nice compute, but also lots of atomics or locks.

21:56 <jfbastien> looks like 1d_stencil_8 has locks

21:57 <jfbastien> dunno if they're hot tho

21:57 <hkaiser> just local code or distributed?

21:57 <jfbastien> local only

21:57 <wash> jfbastien: this isn't the one I was looking for

21:57 <wash> https://github.com/STEllAR-GROUP/tutorials/blob/master/lbl2016/HPX%20Workshop%20(Berkeley%20C++%20Summit)%20-%204.pptx

21:57 <wash> jfbastien: it has locks but it's not all-to-all

21:57 <jfbastien> OK

21:57 <hkaiser> then you don't need to run stencil_8, stencil_4 is the local only version

21:57 <wash> jfbastien: you want something with a heavily contended lock?

21:57 <jfbastien> wash not jsut one lock tho!

21:58 <jfbastien> like... real parallelism with close collaboration, thin locks or atomics

21:58 <wash> jfbastien: a reduction, maybe. parallel sort or fibonacci

21:58 <hkaiser> jfbastien: we try to write code which has no locks ;)

21:58 bikineev has joined #ste||ar

21:58 <hkaiser> but our schedulers have locks, those are quite hot, actually

21:58 <jfbastien> hkaiser I'm trying to measure lock / atomics perf and improve it :)

21:59 <hkaiser> so in the end any application with a decent amount of work should hit those

21:59 <jfbastien> ah cool

21:59 <jfbastien> oh hmm, I have to run to a meeting, bbiab

21:59 <wash> jfbastien: fibonacci might be good, or the thread overhead benchmarks.

21:59 <hkaiser> the used queues are lockfree, though - not sure if compswaps are interesting to you as well

22:01 <jfbastien> hkaiser yes, <3 cmpxchg

22:01 <wash> jfbastien: run the task benchmarks then or fibonacci

22:02 <wash> fibonacci will give you more contention

22:02 bikineev has quit [Ping timeout: 240 seconds]

22:06 <github> [hpx] hkaiser pushed 3 new commits to master: https://git.io/v7BOw

22:06 <github> hpx/master 320f515 Denis Blank: Fix a parsing error with Visual Studio 2015 which occurred in unwrap...

22:06 <github> hpx/master bff4854 Hartmut Kaiser: Merge pull request #2787 from Naios/unwrap_hotfixes...

22:06 <github> hpx/master c442ec0 Denis Blank: Fix a potential unconditial moves in hpx::util::tuple_cat...

22:06 <18VABUTKO> [hpx] hkaiser closed pull request #2787: Unwrap hotfixes (master...unwrap_hotfixes) https://git.io/v7WNE

22:37 <jfbastien> wash (back) so that's the fib from quick start?

22:43 <jfbastien> wash looks like fibonacci_future is closer to what I want... but it's just pegging one core on my system :p

22:43 <jfbastien> wash hpx::async is sad maybe

22:55 Matombo has quit [Remote host closed the connection]

23:13 <hkaiser> jfbastien: use -tN to run it on N cores

23:13 <wash> yah

23:13 EverYoun_ has joined #ste||ar

23:14 <jfbastien> hkaiser oh right, all those tests like -tN :)

23:14 <jfbastien> yeah that pegs the core

23:14 <jfbastien> *cores

23:14 <hkaiser> nod, those options work for any hpx app

23:14 <hkaiser> do a --hpx:help for all of them

23:16 EverYoung has quit [Ping timeout: 240 seconds]

23:19 EverYoun_ has quit [Ping timeout: 276 seconds]

23:23 EverYoung has joined #ste||ar

23:23 EverYoung has quit [Remote host closed the connection]

23:25 <github> [hpx] K-ballo force-pushed std-atomic-lite from baf4deb to f92abdc: https://git.io/v7tLl

23:25 <github> hpx/std-atomic-lite df2f6a6 Agustin K-ballo Berge: Replace boost::atomic with std::atomic (where possible)

23:25 <github> hpx/std-atomic-lite f92abdc Agustin K-ballo Berge: Add deprecated checks for Boost.Atomic

23:26 mcopik has quit [Ping timeout: 260 seconds]

23:58 bikineev has joined #ste||ar