#ste||ar on 2018-07-05 — irc logs at irclog.cct.lsu.edu

2018-04-23 16:40 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC2018: https://wp.me/p4pxJf-k1

00:09 diehlpk_mobile has quit [Quit: Yaaic - Yet another Android IRC client - http://www.yaaic.org]

02:27 hkaiser has quit [Quit: bye]

02:36 K-ballo has quit [Quit: K-ballo]

03:47 nikunj1997 has joined #ste||ar

03:50 nikunj97 has quit [Ping timeout: 240 seconds]

03:58 nikunj1997 has quit [Quit: Leaving]

04:41 nikunj has joined #ste||ar

05:49 anushi has quit [Ping timeout: 248 seconds]

06:32 jaafar has quit [Ping timeout: 264 seconds]

07:06 Anushi1998 has joined #ste||ar

07:24 jakub_golinowski has quit [Ping timeout: 256 seconds]

07:25 anushi has joined #ste||ar

08:07 david_pfander has joined #ste||ar

08:10 anushi has quit [Ping timeout: 255 seconds]

08:11 anushi has joined #ste||ar

08:15 anushi has quit [Remote host closed the connection]

08:16 anushi has joined #ste||ar

08:22 david_pfander has quit [Ping timeout: 255 seconds]

08:24 anushi has quit [Ping timeout: 276 seconds]

08:26 anushi has joined #ste||ar

08:30 anushi has quit [Remote host closed the connection]

08:31 anushi has joined #ste||ar

08:41 jakub_golinowski has joined #ste||ar

08:47 david_pfander has joined #ste||ar

08:47 david_pfander1 has joined #ste||ar

08:58 mcopik has joined #ste||ar

09:02 Anushi1998 has quit [Ping timeout: 245 seconds]

09:04 david_pfander has quit [Ping timeout: 256 seconds]

09:04 david_pfander1 is now known as david_pfander

09:20 Anushi1998 has joined #ste||ar

09:24 <github> [hpx] StellarBot pushed 1 new commit to gh-pages: https://git.io/fbpFN

09:24 <github> hpx/gh-pages 31f0cfc StellarBot: Updating docs

10:33 jakub_golinowski has quit [Ping timeout: 256 seconds]

10:34 jakub_golinowski has joined #ste||ar

10:36 <marco_> Hello I am back and sorry, but unfortunately I do not see the forest for the trees. I need a parallel::for_each in which the elements are started in sequence and I do not find the appropriate execution policy, ...

10:45 <zao> Still a bit unsure about what the requirements are there.

10:46 <zao> "starting" an action doesn't mean much, even if you could control it.

10:46 <heller> marco_: can you give your specific usecase?

10:47 <zao> If you've got four OS threads servicing HPX, would you kick off the first four items and start additional ones as the first start to complete?

10:48 <zao> Unless you never enter a point which can switch contexts, having started before something else doesn't mean much in terms of completion order.

10:48 <zao> (I forget what the HPX term is for a possible thread swithc is)

10:49 <zao> Some sort of dependency where a task won't start until the N tasks before it has completed, where you've determined N to be the number of threads servicing HPX?

10:50 <zao> (oh wait, that'd be serial, blargh)

10:50 <zao> I'm with heller here :)

10:50 <zao> (and gonna leave it to him)

10:52 <jakub_golinowski> M-ms, yt?

11:02 <marco_> zao: I've a list of independent jobs, and they should start in order, and not in segmented parts for each thread.

11:04 <jakub_golinowski> marco_, why should they start in order?

11:04 <M-ms> jakub_golinowski: here

11:05 <M-ms> good that the build works now! could you make a PR to change HPX_LIBRARIES?

11:05 <jakub_golinowski> M-ms, I was going to ask about if I should do it

11:06 <jakub_golinowski> M-ms, did you look at the face-recognition app?

11:06 <M-ms> as for the tests, could you try disabling the tests that fail and collect a list of them, maybe there's something in common to them

11:06 <M-ms> then we can see how many fail across all of opencv

11:06 <jakub_golinowski> M-ms, this is a good idea, I will look into that

11:06 <M-ms> I haven't had time to look at the app yet

11:07 <M-ms> I'll try to do so tonight or tomorrow

11:07 <M-ms> if I understood correctly it's working pretty well for you already, no?

11:08 <M-ms> and for the opencv PR, you can open a new one and just reference the old one in the description

11:09 <M-ms> heller: yt?

11:09 <marco_> jakub_golinowski: *g*, it is not a technical or algorithmic requirement, it is more a buisiness requirement.

11:17 <heller> M-ms: hey

11:18 <heller> marco_: what is meant by "in order"? the start time should be in order? One ofter the other?

11:18 <heller> marco_: are the different tasks allowed to overlap execution? Do you just need a specific sequence number or something for your task?

11:19 <M-ms> hey, how far did you get with your kokkos explorations? any conclusions yet about how feasible it would be to have an HPX backend for kokkos?

11:20 <heller> M-ms: IMHO, it doesn't make a lot of sense to have a HPX backend for kokkos

11:21 <heller> the models are just too different

11:22 <M-ms> jbjnr: ^

11:23 <M-ms> started reading this http://prod.sandia.gov/techlib/access-control.cgi/2017/1710464.pdf

11:24 <M-ms> heller: I'm asking because we've been discussing how to make some progress on this cuda business

11:24 <heller> i think this is orthogonal

11:24 <heller> it should just work to use the kokkos CUDA backend inside a HPX application

11:25 <M-ms> yeah, that makes sense

11:26 <M-ms> we would have support to go work with the kokkos people on something, but it's not clear yet if "something" is useful and what exactly that that would be

11:26 anushi has quit [Read error: Connection reset by peer]

11:29 <M-ms> cuda graphs doesn't look like it's happening soon enough (for us at least)

11:29 <heller> yes

11:29 <jbjnr> marco_: I believe that if you want a parallel:for_each that starts each element in sequence, then what you want is a serial for_Each. You can use the sequenced/serial execution policy for that.

11:30 <heller> well, the idea of implementing a kokkos backend is tempting

11:30 <M-ms> it'd also be a shame to have to reimplement all the data layout management that kokkos is doing on top of their backends

11:30 <heller> yes

11:30 <heller> but that's orthogonal as well

11:30 <heller> what would be great is, if we could factor out those data structures

11:30 <M-ms> yes, it is

11:31 <M-ms> and not first priority either, cuda is

11:31 <M-ms> jbjnr: I didn't get so far with the kokkos task pdf, is there something about dags on gpus there?

11:33 <heller> well, the nice thing about the Kokkos CUDA thingy is that they directly embedded the hierarchical memory model into their programming model

11:36 <heller> I am just not yet sure how well that maps to tasking ;)

11:36 <jbjnr> M-ms: heller 1) We have a problem with very small tasks, and implementing a back end that supports the kokkos model would get us closer. This might mean having a special kokkos_executor and implementing their scheduling loop in some form on top of our threads. I do believe this would be a lot of work, but mapping their thread teams onto our schedulers/executors might help us to redesign our own internals in such a way that we improve our small task

11:36 <jbjnr> performance.

11:36 <jbjnr> 2) The data layout (array views) + cuda reordering of loops to make thread blocks work better is something we need in HPX regardless of whether we integrate with kokkos, so we might as well use theirs. This can be as simple as just having a cuda only kokkos that we forward stuff too.

11:36 <jbjnr> 3) Ideally we work WITH the kokkos guys to say, you do this well, and we do this well, can we make an API that both of us can use so that we get the nice task API of HPX, alongside the hierarchical mode they use.

11:37 <jbjnr> futures on GPU is the problem we don't have a real solution for. They have a 'hack' that kind of works, but it introduces a shite task methodology we don't want.

11:39 <jbjnr> working with them (for me) is just a way of trying to leverage the best of both worlds to move forward with use and adoption and performance.

11:40 <M-ms> is it as much of a hack as the hpx cuda support? it's obviously not great for lots of small kernel launches but until cuda graphs is here and executors/futures are changed (again) there's not much we can do there

11:41 <jbjnr> If I could find the pdf of the cuda graphs stuff I'd send it to you

11:42 <jbjnr> which doc are you reading?

11:42 <M-ms> did you mean cuda graphs is a hack or what kokkos is doing to have dags on the gpu?

11:43 <M-ms> kokkos task dag capabilities

11:43 <jbjnr> the kokkos dags on GPU is a 'hack' a good one, but it doesn't map well to our task model

11:44 <jbjnr> found it https://github.com/kokkos/kokkos/blob/master/doc/SAND2017-10464-Kokkos-Task-DAG.pdf

11:45 <M-ms> yeah, that's the one I started reading as well (http://prod.sandia.gov/techlib/access-control.cgi/2017/1710464.pdf)

11:48 <M-ms> I'll read for a while, let's discuss later again

11:49 <marco_> heller: yes the start time should be in order, one after the other. the tasks allow overlap execution, there are no dependencies between it. i doesn't need a sequence number ore something else.

11:49 K-ballo has joined #ste||ar

11:54 <heller> marco_: ok, you can achieve that with the parallel execution policy and a chunk size of 1

11:55 <jbjnr> that won't guarantee the ordering though

11:55 <heller> that still doesn't guarantee that the start time will be in order (our execution model doesn't account for that) but the task creation time

11:55 <jbjnr> the tasks will be added round robin to thread queues

11:56 <heller> right

11:56 <jbjnr> and might be pulled off in the wrong orders

11:56 <jbjnr> marco_: we can't do what you ask for without changes to hpx. We can impose a dependency on the 'ending' of a task, via a future, but not on the 'starting' of a task. If the tasks were added in order to a queue and taken off the queue in order in the scheduler, it would work, but we could not guarantee it without changes to the scheduler currently.

11:56 <jbjnr> unless an idea like heller's could be tweaked somehow

11:57 <heller> the real question would be, why you need to order the start time?

11:58 <heller> that would impose quite a ton of synchronization between the task queues

11:58 hkaiser has joined #ste||ar

11:58 <heller> if they run in parallel, there's nothing wrong with them starting execution at the same time

11:59 <jbjnr> he probably has some dodgy counter access at the start of each task and wants to make sure they happen 'in order'

11:59 <jbjnr> nut you are right. address the problem of why and then the real answer will become clear

12:00 <heller> https://gist.github.com/sithhell/d06bb74d346d55a919048e1ac015cf7f

12:00 <heller> that's how i'd guarantee the start order

12:01 <jbjnr> very good

12:01 <jbjnr> spwan the 1+1th one from inside the ith.

12:02 <jbjnr> does make parallel:for_each a bit useless, however, we could 'wrap'it in some template magic to create one

12:02 <hkaiser> could end up being too fine-grained

12:03 <jbjnr> indeed

12:03 <heller> that's what he asked for ...

12:03 <jbjnr> correct.

12:03 <jbjnr> back to work now ...

12:04 <github> [hpx] biddisco force-pushed guided_pool_executor from 349ca7b to 31faced: https://git.io/vxkTv

12:04 <github> hpx/guided_pool_executor b085914 Thomas Heller: Changing the coroutine implementations to do a lazy init...

12:04 <github> hpx/guided_pool_executor 24ec144 John Biddiscombe: Remove staged queue from thread map and run_now param from create_thread api...

12:04 <github> hpx/guided_pool_executor fdc1a6c John Biddiscombe: Remove wait_or_add_new from scheduling loop, thread_queue and schedulers

12:14 jaafar has joined #ste||ar

12:28 nikunj has quit [Remote host closed the connection]

12:30 nikunj has joined #ste||ar

12:34 <marco_> Ok, Thank you very much for the explanation, I will then create my own loop with async.

12:34 <Guest71870> [hpx] Jakub-Golinowski opened pull request #3365: Fix order of hpx libs in HPX_CONF_LIBRARIES. (master...fix_lib_order) https://git.io/fbhmA

12:55 <jbjnr> marco_: you should consider very carefully whether you reaal _need_ to do what you have asked for. if your for_each is large, then the cost of an async for each will become an issue that will probably be bigger than the problem you are trying to solve by launching tasks in order. If you can tell us why you need to start them in order, then we might be able to suggest an alternative strategy.

13:04 anushi has joined #ste||ar

13:10 nikunj97 has joined #ste||ar

13:10 nikunj has quit [Remote host closed the connection]

13:11 <nikunj97> hkaiser: yt?

13:14 <marco_> jbjnr: Ok, I will write a brief review of my application. I will contact you tomorrow or later.

13:15 diehlpk_mobile has joined #ste||ar

13:17 <nikunj97> diehlpk_mobile: is the last date of submitting our blog links and pr links 6th or by 6th?

13:17 diehlpk_mobile has quit [Read error: Connection reset by peer]

13:18 diehlpk_mobile2 has joined #ste||ar

13:20 diehlpk_mobile has joined #ste||ar

13:20 diehlpk_mobile3 has joined #ste||ar

13:22 anushi has quit [Remote host closed the connection]

13:24 <nikunj97> diehlpk_work: yt?

13:24 diehlpk_mobile has quit [Ping timeout: 240 seconds]

13:25 diehlpk_mobile2 has quit [Ping timeout: 260 seconds]

13:25 anushi has joined #ste||ar

13:25 diehlpk_mobile3 has quit [Read error: Connection reset by peer]

13:25 <hkaiser> nikunj97: here

13:25 aserio has joined #ste||ar

13:26 <hkaiser> aserio: see pm, pls

13:26 <nikunj97> hkaiser: I had an idea to resolve the global object situation.

13:26 <nikunj97> Like _init() has the responsibility of initializing all the global objects, why don't we have our own _init() as well?

13:27 <hkaiser> ok

13:27 <nikunj97> so in case a user wishes to user HPX functionality he could do something like hpx.add_object("struct/class_name object_name")

13:28 <nikunj97> or something similar. This way a user can create initialization routine specific to HPX

13:29 <hkaiser> how would you prevent for those constructors to be called twice? what about destruction?

13:33 <nikunj97> I'm currently trying to create a model to handle it

13:35 <nikunj97> hkaiser: Do you think something like this is feasible?

13:37 <hkaiser> worth a try, definitely

13:38 <nikunj97> then I'll investigate further

13:39 <nikunj97> Actually I have been trying to implement it without getting into initialization sequencing issues. I found the exact function that was initializing the global object but it has not been exported to libc.so so I can't wrap it in any way

13:40 <nikunj97> To implement it I would have to implement _init function myself but that itself contains symbols that are not exported. So I would be forced to implement them as well and it would recursively proceed. This was creating portability issues.

13:42 <nikunj97> I tried to implementing it and myself and on failing to do so, I came up with the idea of hpx's own init function to initialize hpx related global objects

13:43 <hkaiser> nod, figures

13:43 <nikunj97> hkaiser: did you review my pr?

13:44 <hkaiser> nikunj97: not yet, sorry

13:45 <hkaiser> working towards it ;-)

13:45 <nikunj97> hkaiser: ok, actually I wanted to add link to my macos pr also to mid term evaluation.

13:46 <hkaiser> nikunj97: pls go ahead and create that pr

13:47 <nikunj97> but there will be an overlap of code between these 2 pr. I'll be adding code to hpx_wrap.cpp, so I thought that I should wait out to let this pr get merged.

13:47 <hkaiser> nikunj97: ok, I'll try to get it done today

13:48 <nikunj97> hkaiser: thanks, I will then add the pr once it's merged and the link to my mid term evaluation as well.

13:53 anushi has quit [Remote host closed the connection]

13:53 anushi has joined #ste||ar

13:59 <diehlpk_work> nikunj97, yes

14:00 <nikunj97> diehlpk_work: is the last date of submitting our blog links and pr links 6th or by 6th?

14:01 <diehlpk_work> nikunj97, I like to have them by 6th, so I cna perpare a blog post over the weekend

14:02 <nikunj97> diehlpk_work: will it be fine if I send them over on 6th?

14:03 <diehlpk_work> Sure, I like tio have them on Saturday my local time zone

14:06 <nikunj97> ok, thanks :)

14:28 akheir has joined #ste||ar

14:51 jakub_golinowski has quit [Quit: Ex-Chat]

14:51 jakub_golinowski has joined #ste||ar

14:57 hkaiser has quit [Quit: bye]

14:58 david_pfander has quit [Remote host closed the connection]

14:58 david_pfander has joined #ste||ar

15:05 galabc has joined #ste||ar

15:20 hkaiser has joined #ste||ar

15:29 Anushi1998 has quit [Quit: Bye]

15:41 anushi has quit [Ping timeout: 248 seconds]

15:42 anushi has joined #ste||ar

15:58 _bibek_ has joined #ste||ar

16:00 hkaiser_ has joined #ste||ar

16:00 anushi has quit [Remote host closed the connection]

16:01 anushi has joined #ste||ar

16:01 hkaiser has quit [Ping timeout: 248 seconds]

16:02 bibek has quit [Ping timeout: 276 seconds]

16:03 hkaiser_ has quit [Client Quit]

16:15 aserio has quit [Ping timeout: 260 seconds]

16:26 anushi has quit [Ping timeout: 260 seconds]

16:27 <jakub_golinowski> M-ms, yt?

16:47 Anushi1998 has joined #ste||ar

17:03 nikunj97 has quit [Quit: bye]

17:05 nikunj has joined #ste||ar

17:09 hkaiser has joined #ste||ar

17:14 <M-ms> jakub_golinowski: uhm, half here

17:17 galabc has quit [Read error: Connection reset by peer]

17:18 galabc has joined #ste||ar

17:21 gabriel_ has joined #ste||ar

17:23 galabc has quit [Ping timeout: 256 seconds]

17:56 K-ballo has quit [Quit: K-ballo]

18:03 anushi has joined #ste||ar

18:07 mcopik has quit [Ping timeout: 265 seconds]

18:09 aserio has joined #ste||ar

18:09 aserio1 has joined #ste||ar

18:11 nikunj has quit [Quit: Leaving]

18:14 aserio has quit [Ping timeout: 265 seconds]

18:14 aserio1 is now known as aserio

18:18 gabriel_ has quit [Ping timeout: 256 seconds]

18:19 mcopik has joined #ste||ar

18:23 anushi has quit [Read error: Connection reset by peer]

18:23 anushi has joined #ste||ar

18:23 galabc has joined #ste||ar

18:23 mcopik has quit [Ping timeout: 240 seconds]

18:31 K-ballo has joined #ste||ar

18:45 anushi has quit [Ping timeout: 256 seconds]

18:46 parsa[[w]] has joined #ste||ar

18:50 parsa[w] has quit [Ping timeout: 260 seconds]

18:52 <aserio> _bibek_: yt?

19:02 <hkaiser> diehlpk_work: yt?

19:02 <diehlpk_work> yes

19:03 <hkaiser> see pm, pls

19:03 galabc has quit [Ping timeout: 268 seconds]

19:20 eschnett has joined #ste||ar

19:26 aserio has quit [Remote host closed the connection]

19:27 aserio has joined #ste||ar

19:33 <heller> aserio: Hey, what's the password?

19:35 <aserio> stellargroup

19:36 <heller> Thanks

19:37 katywilliams has joined #ste||ar

19:37 khuck has joined #ste||ar

19:38 <khuck> aserio: phylanx meeting?

19:41 katywilliams has quit [Client Quit]

20:00 anushi has joined #ste||ar

20:08 khuck has quit []

20:20 <M-ms> jakub_golinowski: trying the face detection now, getting cascadedetect.cpp:1694: error: (-215:Assertion failed) !empty() in function 'detectMultiScale'

20:20 <M-ms> have you had something similar? at least it doesn't seem like hpx messing up

20:20 <jakub_golinowski> this is the ML classifier

20:21 eschnett has quit [Quit: eschnett]

20:22 <jakub_golinowski> M-ms, check if the .xml files are correctly pointed to -> it depends where exactly you are with build dir

20:23 <jakub_golinowski> the app assumes you have a binary in the /hpx_opencv_webcam/build directory

20:23 anushi has quit [Remote host closed the connection]

20:23 anushi has joined #ste||ar

20:38 RostamLog has joined #ste||ar

20:44 jakub_golinowski has quit [Remote host closed the connection]

20:46 jakub_golinowski has joined #ste||ar

20:47 anushi has quit [Ping timeout: 260 seconds]

20:53 hkaiser has joined #ste||ar

20:57 anushi has joined #ste||ar

21:03 akheir has quit [Quit: Leaving]

21:10 K-ballo has quit [Quit: K-ballo]

21:17 <jakub_golinowski> M-ms, I working on the tests in opencv and all the failing ones seems to be somehow connected with OCL

21:20 anushi has quit [Ping timeout: 268 seconds]

21:30 anushi has joined #ste||ar

21:46 aserio has quit [Quit: aserio]

21:54 anushi has quit [Ping timeout: 264 seconds]

22:00 anushi has joined #ste||ar

22:19 quaz0r has quit [Quit: reboot]

22:22 anushi has quit [Ping timeout: 240 seconds]

22:31 anushi has joined #ste||ar

22:32 K-ballo has joined #ste||ar

22:33 jakub_golinowski has quit [Ping timeout: 245 seconds]

22:33 jakub_golinowski has joined #ste||ar

22:51 jaafar has quit [Ping timeout: 248 seconds]

22:55 anushi has quit [Ping timeout: 265 seconds]

23:30 anushi has joined #ste||ar

23:38 jaafar has joined #ste||ar

23:51 jaafar has quit [Ping timeout: 260 seconds]

23:54 anushi has quit [Ping timeout: 268 seconds]

23:59 anushi has joined #ste||ar