#ste||ar on 2017-09-07 — irc logs at irclog.cct.lsu.edu

2017-05-17 13:54 aserio changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

00:09 vamatya has joined #ste||ar

00:28 EverYoung has quit [Ping timeout: 246 seconds]

00:38 mcopik has quit [Remote host closed the connection]

00:38 hkaiser has quit [Read error: Connection reset by peer]

00:58 eschnett has joined #ste||ar

01:14 parsa has joined #ste||ar

01:49 parsa has quit [Quit: Zzzzzzzzzzzz]

01:54 parsa has joined #ste||ar

02:00 parsa has quit [Quit: Zzzzzzzzzzzz]

02:06 vamatya has quit [Ping timeout: 240 seconds]

03:12 K-ballo has quit [Quit: K-ballo]

04:00 parsa has joined #ste||ar

04:21 jfbastien_ has quit [Ping timeout: 248 seconds]

04:27 thundergroudon[m has quit [Ping timeout: 240 seconds]

04:30 parsa has quit [Quit: Zzzzzzzzzzzz]

04:42 thundergroudon[m has joined #ste||ar

06:37 david_pfander has joined #ste||ar

07:15 bikineev has joined #ste||ar

07:20 bikineev has quit [Ping timeout: 248 seconds]

07:22 bikineev has joined #ste||ar

07:24 <github> [hpx] sithhell created throttle_cores (+2 new commits): https://git.io/v5wnv

07:24 <github> hpx/throttle_cores ed72bb5 Thomas Heller: Merge remote-tracking branch 'origin/fix_rp_again' into throttle_cores

07:24 <github> hpx/throttle_cores 8abb710 Thomas Heller: Fixing RP to enable resource throttling...

07:25 <heller> jbjnr: ^^ just commited my changes

07:34 Matombo has joined #ste||ar

07:42 bikineev has quit [Remote host closed the connection]

07:46 <github> [hpx] sithhell pushed 1 new commit to cuda_clang: https://git.io/v5wc0

07:46 <github> hpx/cuda_clang 1d53663 Thomas Heller: Properly exporting the CUDA clang flags

08:21 <github> [hpx] sithhell force-pushed throttle_cores from 8abb710 to 55429da: https://git.io/v5wWj

08:21 <github> hpx/throttle_cores 55429da Thomas Heller: Fixing RP to enable resource throttling...

08:26 bikineev has joined #ste||ar

09:07 <github> [hpx] StellarBot pushed 1 new commit to gh-pages: https://git.io/v5w0k

09:07 <github> hpx/gh-pages f18218a StellarBot: Updating docs

09:17 mcopik has joined #ste||ar

09:53 bikineev has quit [Ping timeout: 240 seconds]

10:14 bikineev has joined #ste||ar

10:23 bikineev has quit [Ping timeout: 240 seconds]

10:34 Matombo has quit [Ping timeout: 260 seconds]

10:41 Matombo has joined #ste||ar

10:51 david_pfander has quit [Ping timeout: 246 seconds]

11:01 K-ballo has joined #ste||ar

11:21 mcopik has quit [Ping timeout: 246 seconds]

11:35 mcopik has joined #ste||ar

11:50 david_pfander has joined #ste||ar

12:02 hkaiser has joined #ste||ar

12:05 <jbjnr> hkaiser: error: no member named 'unwrapped' in namespace 'hpx::util'; did you mean 'unwrapping'?

12:05 <jbjnr> do I want unwrapping?

12:05 <hkaiser> yes

12:05 <hkaiser> jbjnr: unwrapped has been split into unwrap and unwrapping, depending on the use case

12:05 <jbjnr> thanks

12:55 eschnett has quit [Quit: eschnett]

13:13 pree has joined #ste||ar

13:13 pree has quit [Remote host closed the connection]

13:14 pree has joined #ste||ar

13:17 pree has quit [Read error: Connection reset by peer]

13:27 eschnett has joined #ste||ar

13:33 parsa has joined #ste||ar

13:34 pree has joined #ste||ar

13:46 <heller> is there a way to get the thread pool on which I am running on?

13:48 <heller> hpx::get_worker_thread_num() returns the local thread number, I have two pools, and I need the global one to figure out which numa domain I am running on

13:49 <heller> more precisely, I need a way to figure out on which numa node I am currently on

13:51 <heller> another question: Is the pool location being preserved when I just do something like "hpx::async([](){});"

13:53 <heller> no it is not :/

13:53 <heller> it all ends up on the default_pool

13:57 akheir has joined #ste||ar

14:02 aserio has joined #ste||ar

14:09 patg[[w]] has joined #ste||ar

14:09 <patg[[w]]> aserio: is there a meeting today?

14:10 <jbjnr> heller: if you use async() withoua pool executor, it does on default poolt

14:10 <aserio> patg[[w]]: There is no Operation Bell Meeting today

14:10 <jbjnr> if you want to know which pool you are on at the moment, we should add an api call

14:10 <patg[[w]]> aserio: ok thanks

14:10 <jbjnr> inside the scheduler, we know, but inside the task we do not

14:11 <jbjnr> so we'd need to add get_current_pool to RP and we can use the global thread number to lookup the pool

14:11 <jbjnr> co the TLS know the thread Id (global)

14:11 Matombo has quit [Ping timeout: 248 seconds]

14:13 Matombo has joined #ste||ar

14:13 <heller> jbjnr: get_self_id()->get_scheduler_base()->get_parent_pool()

14:13 <jbjnr> yes, inside the scheduler that's fine

14:13 <heller> jbjnr: hpx::get_worker_thread_num() returns the local index, doesn't it?

14:14 <jbjnr> the TLS has the global thread num, whatever fetches that

14:15 hkaiser has quit [Quit: bye]

14:16 <heller> ok, good to know

14:16 <github> [hpx] sithhell pushed 3 new commits to throttle_cores: https://git.io/v5rfo

14:16 <github> hpx/throttle_cores 9ffd4d9 Thomas Heller: Implementing hpx::threads::get_numa_node_number

14:16 <github> hpx/throttle_cores ad5bcfa Thomas Heller: Exporting minimal_deadlock_detection variable

14:16 <github> hpx/throttle_cores 0c4b3b5 Thomas Heller: Respecting the current pool we run on

14:17 <heller> jbjnr: pool->get_worker_thread_num will return local or global now?

14:19 <heller> I think it returns the local one though

14:19 <github> [hpx] sithhell pushed 1 new commit to throttle_cores: https://git.io/v5rfN

14:19 <github> hpx/throttle_cores 4cc9e6f Thomas Heller: Fixing thread number selection...

14:26 mbremer has joined #ste||ar

14:28 <jbjnr> heller: just fyi https://github.com/STEllAR-GROUP/hpx/blob/56ff70a14bfeae1fd757b7782fd0e84207bdc8da/examples/resource_partitioner/shared_priority_scheduler.hpp#L607-L609

14:33 <heller> jbjnr: thanks! Good to know

14:33 <heller> jbjnr: any rational why to always fall back to the default pool?

14:34 <heller> I think staying inside the pool is a saner option

14:38 <github> [hpx] K-ballo created sfinae-expression-complete (+2 new commits): https://git.io/v5rTd

14:38 <github> hpx/sfinae-expression-complete b96965d Agustin K-ballo Berge: Drop special case config macro for "complete" SFINAE support

14:38 <github> hpx/sfinae-expression-complete d53fcff Agustin K-ballo Berge: Remove empty on-framework feature test cmake functionality

14:40 patg[[w]] has quit [Quit: Leaving]

14:42 <jbjnr> heller: default bool - backwards compatibility. was first major checkin. Probably a decent default would be spawn task on same pool as current task unless otherwise requested.

14:42 <heller> Yes, I agree

14:42 <jbjnr> ^yes saner option, easy to fix if we want

14:42 <heller> Already did

14:42 <jbjnr> ok

14:43 <heller> Will have to check the global vs local nonsense once more

14:43 <heller> Overall, I'm pretty happy though

14:44 <heller> One pool per numa domain is giving nice results for me so far

14:45 patg[[w]] has joined #ste||ar

14:46 david_pfander has quit [Ping timeout: 252 seconds]

14:53 zbyerly_ has quit [Ping timeout: 240 seconds]

14:58 <jbjnr> K-ballo: is there a code snippet in hpx somewhere I can reuse that would move a list of variadic args into a tuple, so that I can later use them for async?

14:59 parsa has quit [Quit: Zzzzzzzzzzzz]

15:01 <K-ballo> auto tup = make_tuple(...) on one side, for decaying the arguments.. then move(tup) whenever it is used

15:02 <K-ballo> there might be something along those lines in the parallel algorithm implementations

15:02 <K-ballo> the "task" algorithms, the other policies would just forward, not decay

15:04 <K-ballo> jbjnr: do you want to expand the tuple when calling async?

15:05 <jbjnr> K-ballo: thanks. yes, I will need to call func_1(...) and store the args in a vector of tuples, then in func_2, I will need to pass all the tuples though. I might not need to expand them, not sure yet until I work out what I want to do

15:06 <jbjnr> similar to queiing tasks up for later exec

15:06 <K-ballo> deferred_call ?

15:06 <jbjnr> essentially yes

15:06 <K-ballo> why essentially and not exactly?

15:07 <jbjnr> we want to queue N tasks, and then send them to cuda instead of to hpx::async

15:07 <jbjnr> so it's the same mechanism sort of, but instead of N tasks, we need an array of tasks we can spawn in one go (or something like that)

15:08 <jbjnr> haven't really got that far yet, but wanted to work on the mechanism

15:08 <jbjnr> to store the args in a std::array<arg_tuple,N> until later

15:09 <jbjnr> (or siimilar if array is no good)

15:09 <jbjnr> but all args need to be sent to gpu, so an array might be nice

15:09 <K-ballo> so close to array<deferred_call, N> but without storing the function object separately for each element

15:09 <jbjnr> great yes

15:11 <K-ballo> a slightly more general answer, if the arguments are guaranteed to stay alive then the argument tuple pack is whatever forward_as_tuple return, if they don't as is the case for async calls then make_tuple is needed for decaying those arguments

15:12 <K-ballo> when is time to execute the task, on either case, invoke_fused(fun, move(tuple)) is the easiest way

15:13 <K-ballo> invoke_fused explodes the tuple and passes the elements as arguments to a callable

15:13 <jbjnr> wow. nice.

15:13 <jbjnr> ok thanks very much. I will now play for a bit with those ideas

15:13 <K-ballo> deferred_call packages all that up for just one call, so if in doubt go look under its covers

15:16 <jbjnr> yes thanks. In fact for the cuda call, I might just want to pass an array of tuples and not expand them, no real need to use them as args. I will look

15:19 <jbjnr> heller: does cuda support tuples - do I have to use thrust headers for those?

15:21 <K-ballo> hpx::util::tuple is littered with HPX_HOST_DEVICE and even some __CUDACC__ workarounds

15:24 jaafar has joined #ste||ar

15:27 hkaiser has joined #ste||ar

15:30 <diehlpk_work> hkaiser, I shortend the introduction for the paper. Can you read the introduction and chnage things you like to have changed.

15:34 jaafar has quit [Ping timeout: 248 seconds]

15:40 parsa has joined #ste||ar

15:42 <heller> jbjnr: yes, tuple is supported

15:46 <jbjnr> variadic ones?

15:52 david_pfander has joined #ste||ar

15:56 parsa has quit [Quit: *yawn*]

15:57 <heller> jbjnr: sure thing

15:58 <heller> jbjnr: invoke is supported as well

16:07 hkaiser has quit [Read error: Connection reset by peer]

16:12 hkaiser has joined #ste||ar

16:45 bikineev has joined #ste||ar

16:46 aserio has quit [Ping timeout: 240 seconds]

16:49 bikineev has quit [Remote host closed the connection]

16:51 bikineev has joined #ste||ar

16:58 Matombo has quit [Remote host closed the connection]

17:02 EverYoung has joined #ste||ar

17:04 EverYoun_ has joined #ste||ar

17:08 EverYoung has quit [Ping timeout: 246 seconds]

17:10 <github> [hpx] hkaiser pushed 1 new commit to master: https://git.io/v5ruu

17:10 <github> hpx/master 60a0111 Hartmut Kaiser: Merge pull request #2888 from STEllAR-GROUP/fixing_2881...

17:12 mcopik has quit [Ping timeout: 248 seconds]

17:13 <jbjnr> hkaiser: undefined reference to `hpx_exported_plugins_list_hpx_factory' - any idea what I'm missing. some macro ....

17:20 <heller> jbjnr: debug vs. release build

17:20 <jbjnr> balls. thanks

17:26 vamatya has joined #ste||ar

17:30 <pree> Why tasks that are created and executed with the help of executors(service) are run in OS-threads not in HPX-thread. whether this is true only for service executors or for all executors (sequenced& parallel)

17:30 <pree> Thanks

17:31 <heller> only for the service executors

17:32 <pree> Could you kindly explain why ?

17:32 Matombo has joined #ste||ar

17:33 pree has quit [Remote host closed the connection]

17:33 pree has joined #ste||ar

17:40 pree_ has joined #ste||ar

17:42 <heller> pree: the service executor is used for blocking network calls. those are not executed on a HPX thread

17:43 pree has quit [Ping timeout: 240 seconds]

17:43 pree__ has joined #ste||ar

17:44 pree_ has quit [Ping timeout: 240 seconds]

17:47 hkaiser has quit [Ping timeout: 248 seconds]

17:48 pree__ has quit [Ping timeout: 240 seconds]

17:50 pree__ has joined #ste||ar

17:51 hkaiser has joined #ste||ar

17:56 pree__ is now known as pree

17:56 <pree> heller : thanks : )

17:59 <github> [hpx] hkaiser force-pushed fixing_2885 from 7eac243 to 67af823: https://git.io/v5V1P

17:59 <github> hpx/fixing_2885 67af823 Hartmut Kaiser: Adapt broadcast() to non-unwrapping async<Action>...

18:17 Matombo has quit [Read error: Connection reset by peer]

18:19 Matombo has joined #ste||ar

18:19 Matombo has quit [Remote host closed the connection]

18:20 hkaiser has quit [Quit: bye]

18:21 Matombo has joined #ste||ar

18:25 bikineev has quit [Ping timeout: 252 seconds]

18:30 aserio has joined #ste||ar

18:33 Matombo has quit [Remote host closed the connection]

18:40 <github> [hpx] sithhell opened pull request #2891: Resource Partitioner Fixes (master...throttle_cores) https://git.io/v5rX1

18:41 aserio has quit [Ping timeout: 264 seconds]

18:41 akheir has quit [Remote host closed the connection]

18:42 Matombo has joined #ste||ar

18:44 aserio has joined #ste||ar

18:46 hkaiser has joined #ste||ar

18:47 <hkaiser> aserio: could you send me the link for the meeting again, pls?

18:56 EverYoun_ has quit [Remote host closed the connection]

18:57 EverYoung has joined #ste||ar

19:15 mcopik has joined #ste||ar

19:16 <mbremer> @hkaiser: Do you have some time to chat? I'm a little curious about what things I need to do to get vtune to work properly

19:17 <hkaiser> mbremer: would be a bit later still ok?

19:17 <mbremer> Yeah sure! I'm flexible. When works for you?

19:18 diehlpk_work has quit [Ping timeout: 255 seconds]

19:18 diehlpk_work has joined #ste||ar

19:26 <hkaiser> mbremer: I'm in meeting right now, I'll ping you afterwards

19:27 jaafar has joined #ste||ar

19:37 pree has quit [Ping timeout: 240 seconds]

19:40 StefanLSU has joined #ste||ar

19:50 pree has joined #ste||ar

19:50 <diehlpk_work> What is the parameter again to run several hpx application on one node?

20:02 StefanLSU has quit [Quit: StefanLSU]

20:10 akheir has joined #ste||ar

20:14 <github> [hpx] sithhell created resource_partitioner at 519b074 (+0 new commits): https://git.io/v7lfK

20:16 pree has quit [Quit: AaBbCc]

20:16 <hkaiser> mbremer: you there?

20:17 <mbremer> yup

20:17 <hkaiser> want to talk now?

20:17 <mbremer> Sure. I'll just call via gchat. Give me like 2 or 3 minutes

20:19 mbremer_ has joined #ste||ar

20:19 eschnett has quit [Quit: eschnett]

20:30 aserio has quit [Ping timeout: 246 seconds]

20:39 aserio has joined #ste||ar

20:46 <K-ballo> didn't we used to have some reserve_if_possible utilities?

20:46 <K-ballo> did I refactor them out of existence?

20:48 <K-ballo> reserve_if_reservable :|

20:48 <hkaiser> K-ballo: there are still somewhere

20:48 <K-ballo> yeah, they just got renamed

20:50 <K-ballo> reserve_if_vector -> reserve_if_reservable I get, reserve_if_random_access -> reserve_if_random_access_by_range is a mystery

20:51 <K-ballo> furthermore, reserve_if_random_access_by_range should have been updated to work with "reservables" and not just vector

20:51 mbremer_ has quit [Ping timeout: 260 seconds]

20:51 bikineev has joined #ste||ar

20:52 <hkaiser> K-ballo: sorry, I don't know anything about this

20:53 <K-ballo> anyways, they've got duplicated inside pack_traversal, we should consolidate

20:54 <heller> hkaiser: guess we're running out of circle resources

20:56 <heller> Having buildbot_travis would be nice

21:02 <mcopik> heller: a quick question on CUDA compute - do you remember what should happen to CUDA copy for non trivially copyable types? there's no copy_helper specialization for respective pointer tag, won't it create undefined behavior when host and device ptrs are passed? https://github.com/STEllAR-GROUP/hpx/blob/050d1f17b6b7c77d9ca8fe399ca7e3ddea5ef6f8/hpx/compute/cuda/transfer.hpp#L56

21:06 StefanLSU has joined #ste||ar

21:08 StefanLSU has quit [Client Quit]

21:08 StefanLSU has joined #ste||ar

21:13 <hkaiser> heller: yes

21:13 zbyerly_ has joined #ste||ar

21:13 <hkaiser> K-ballo: I agree

21:14 <heller> mcopik: no idea there. There are no docs on how marshalling is handled

21:18 <hkaiser> heller: yah, we should move things like octotiger off our circleci account

21:19 <hkaiser> somebody has enabled full Vc tests in the octotiger scripts

21:19 <hkaiser> luckily that times out after 2 hours

21:20 <hkaiser> aserio: yt?

21:25 zbyerly_ has quit [Ping timeout: 240 seconds]

21:27 <mcopik> heller: but won't a lack of specialization for e.g. cuda_copyable_pointer_tag_to_host lead to an obviously wrong call to std::copy?

21:29 <heller> Yes

21:29 <heller> What would you suggest?

21:30 akheir has quit [Remote host closed the connection]

21:31 <hkaiser> well, for a start disable the Vc tests

21:32 <hkaiser> heller: ^^

21:32 <aserio> hkaiser: yes

21:32 <aserio> sorry

21:32 <hkaiser> aserio: np

21:32 <hkaiser> is Rohid still around?

21:32 <aserio> I can check

21:32 <hkaiser> Rod*

21:32 StefanLSU has quit [Quit: StefanLSU]

21:33 <aserio> He is still here

21:33 <aserio> does he have IRC

21:33 <aserio> or should I tell him to check his email?

21:33 <mcopik> heller: would you consider having a static assert or anything similar to disallow the user to use a non trivially copyable type T for compute allocator? I think it's better than a segfault

21:33 <mcopik> I'm asking since I'm genuinely surprised

21:34 <hkaiser> aserio: just tell him to look at his circleci stuff ;)

21:35 <aserio> hkaiser: he said that he thinks he knows what the issue is

21:36 <hkaiser> :D

21:40 david_pfander has quit [Ping timeout: 252 seconds]

21:40 patg[[w]] has quit [Quit: Leaving]

21:41 rod_t has joined #ste||ar

21:43 jaafar has quit [Ping timeout: 248 seconds]

21:44 StefanLSU has joined #ste||ar

21:48 StefanLSU has quit [Client Quit]

21:48 EverYoun_ has joined #ste||ar

21:50 StefanLSU has joined #ste||ar

21:51 EverYoung has quit [Ping timeout: 255 seconds]

21:52 StefanLSU has quit [Client Quit]

21:55 aserio has quit [Quit: aserio]

21:56 rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

21:56 rod_t has joined #ste||ar

21:59 Matombo has quit [Remote host closed the connection]

22:02 rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

22:07 mbremer has quit [Quit: Page closed]

22:09 EverYoun_ has quit [Remote host closed the connection]

22:13 bikineev has quit [Remote host closed the connection]

22:28 EverYoung has joined #ste||ar

22:32 EverYoung has quit [Ping timeout: 255 seconds]

22:35 parsa has joined #ste||ar

22:36 <parsa> hkaiser: ping

22:40 <hkaiser> parsa: here

22:41 <parsa> hkaiser: can you take a quick look at my paper's abstract and conclusion sections?

22:41 <hkaiser> parsa: in the repo?

22:41 <parsa> yeah

22:43 <hkaiser> will do

22:45 <hkaiser> parsa: doesn't compile for me

22:46 <parsa> i can compile on mac and windows and the pdf is right there in the repo with the code

22:46 <hkaiser> parsa: got your pdf

22:47 <hkaiser> parsa: an abstract should contain 3 parts: problem statement, solution applied, and results

22:47 <hkaiser> your abstract is more of an introduction

22:48 <hkaiser> also, why are the conclusions come before section 4.2 etc.?

22:49 <parsa> what do you mean? the conclusions section is section 6 on page 9

22:49 <hkaiser> not in the pdf you linked

22:49 <hkaiser> ok, nvm, my fault

22:51 <hkaiser> parsa: let's talk tomorrow about the paper

22:52 EverYoung has joined #ste||ar

22:52 EverYoung has quit [Remote host closed the connection]

22:53 EverYoung has joined #ste||ar

22:54 <parsa> hkaiser: okay, what time would be convenient?

22:54 <hkaiser> any tim

22:54 <hkaiser> e

23:02 EverYoun_ has joined #ste||ar

23:05 EverYoung has quit [Ping timeout: 246 seconds]

23:06 rod_t has joined #ste||ar

23:11 parsa has quit [Quit: Zzzzzzzzzzzz]

23:25 parsa has joined #ste||ar

23:28 EverYoun_ has quit [Remote host closed the connection]

23:37 mcopik has quit [Ping timeout: 246 seconds]

23:59 parsa has quit [Quit: Zzzzzzzzzzzz]