#ste||ar on 2019-08-28 — irc logs at irclog.cct.lsu.edu

2019-06-17 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoD: https://developers.google.com/season-of-docs/

03:16 hkaiser has quit [Quit: bye]

04:18 <Yorlik> How would I start an action and demand to start execution right away, like realtime priority? I could still wait for the future, since it's a long running action, but I don't want any delay in startup.

04:25 <Yorlik> NVM - found it: hpx::async<agns::game::Controller::start_action>(hpx::launch::sync, gcId);

04:29 <Yorlik> Argh - crap thats wrong ... fork maybe?

05:59 <jbjnr_> Yorlik: fork will suspend the current task and immediately switch to the new one - but this would only work for a local action. If you wanted to do it remotely, you need to look at direct_Action

06:03 <Yorlik> How would I start a task immediately but async - is there a way to control that ?

06:07 <jbjnr_> fork policy should work, (I tend to use a high priority task and am prepared to wait)

06:08 <Yorlik> But you said fork suspends the current task. i want it to be non blocking but guarentee to start immediately

06:08 <Yorlik> Or asap

06:09 <jbjnr_> "non blocking but guarentee to start immediately" - that's a contradiction unless you spawn a completely new worker thread to launch it on

06:09 <jbjnr_> can you reserve a core for a special thread pool for these realtime tasks?

06:10 <jbjnr_> (one at a time), or more threads in the pool if you need multiple

06:10 <jbjnr_> cores^

06:10 <Yorlik> I am starting this out of hpx_main and its essentially a task running for the lifetime of the program

06:11 <jbjnr_> Then the best thing to do would be create aseparate thread pool with 1 core in it and only run that task on that pool

06:11 <Yorlik> OK. Is there a way to iterate of my worker threads to initialize thread local objects?

06:11 <jbjnr_> then normal tasks will use the 'default' thread pool and your special task will be in it's own sandbox

06:12 <Yorlik> Makes sense. My default pool would have a lua engine in each OS thread. I'd like to initialize all of them at start. How could I iterate over them?

06:12 <jbjnr_> scroll down to the end of here - https://github.com/STEllAR-GROUP/hpx/blob/master/tests/unit/resource/named_pool_executor.cpp does this help you?

06:12 <Yorlik> And start an init task on each

06:13 <jbjnr_> aha. That's slightly different

06:13 <Yorlik> i just want to call a function at startup on each thread which created the static thread local lua engine

06:14 <jbjnr_> yup. let me think a moment

06:14 <jbjnr_> I'm looking for an example to help you

06:14 <Yorlik> We don't keep state in the lua engines - they just run one off tasks that store data on the C++ side

06:14 <Yorlik> So we can run any gameobject message handlers on any lua state

06:16 <jbjnr_> if you are running this from main(), then just loop over the number of threads and async spawn a task for each lua engine/interpreter, if no other taks are running then each core will take one task automagically

06:17 <Yorlik> That I could do.

06:17 <Yorlik> I have a separate init function that would just do it

06:17 <jbjnr_> to be 100% certain that nothing else runs there, create a pool name "lua" or something and allocate N cores to it, then start N lua tasks on that pol. All other taks will go on the "default" pool

06:18 <jbjnr_> the default pool must have at least one core assigned to it

06:18 <Yorlik> How would I pin a task to a thread?

06:19 <jbjnr_> we plan to allow pools to coexist on the same cores (so on 1 4 core machine you could have a 4 core lua pool and a 1 core default pool and 1 core would have to run two worker threads, but we haven't enabled it yet)

06:19 <jbjnr_> look https://github.com/STEllAR-GROUP/hpx/blob/master/examples/resource_partitioner/simple_resource_partitioner.cpp#L89

06:19 <Yorlik> That might for example make sense for pipelining scenarios

06:19 <jbjnr_> step 1, create a lua pol (this example creates an MPI pool)

06:20 <jbjnr_> step 2, create an executor that is bound to that pools

06:20 <jbjnr_> step 3, async(executor, task)

06:20 <jbjnr_> step 1 must be done at startup, steps 2,3 can be done any time

06:20 <Yorlik> So I can pin to a pool, but not a thread in that pool? Would I need single thread pools then?

06:22 <jbjnr_> you could create N thread pools each with one core, but if the task never suspends, one it starts runing it will stay on the core it started on, so it is sort of pinned by default.

06:22 <jbjnr_> we do support launching on a single core within a pool, but there isn't an example and the API is 'in flux'

06:22 <jbjnr_> so it would be better to use N pools like the first example I showed you

06:23 <Yorlik> So I cannot iterate over the threads within a pool but just have several single threded pools I could then iterate over?

06:23 <jbjnr_> (named_pool_executor test)

06:23 <jbjnr_> yes.

06:23 <Yorlik> Which would spool any scheduling ifc.

06:23 <Yorlik> ifc=ofc

06:24 <Yorlik> err -- spool = spoil

06:24 <jbjnr_> it is possible to iterate over threads, but the API is a bit broken and you're better off using N pools and iterating over them with an executor for each

06:24 <Yorlik> But what about my scheduling then?

06:24 <Yorlik> I still would like to just give a tsk to the pool and not care where its running

06:25 <Yorlik> I simply want a guaranteed initialization at start.

06:25 <jbjnr_> but you said these lua tasks run forever, so no other tasks can run there anyway?

06:25 <Yorlik> Nono

06:25 <Yorlik> The lua tasks are short

06:25 <Yorlik> But the task doing the main event loop runs forever

06:25 <Yorlik> It does the task construction

06:26 <jbjnr_> video chat?

06:26 <Yorlik> Like batching 100 objects for update into a task

06:26 <Yorlik> Sure

06:26 <jbjnr_> appear.in?

06:26 <Yorlik> Ya

06:27 <jbjnr_> I'll just make a coffee. Started meeting in stellar room of appear. in, join when you like

06:27 <Yorlik> OK

07:10 <jbjnr_> sorry cut you off abruptly at the end then

07:10 <Yorlik> NP - thanks for the time you took !

07:11 <Yorlik> Starting talking about this project easily become a bit too open eneded ;)

08:28 rori has joined #ste||ar

08:36 <zao> Yorlik: Can't scope-creep if you have no scope :P

08:36 <Yorlik> If only my English would go far enough to understand that ...

08:37 <zao> Scope-creep being the act of growing something well-defined and simple far past its intended boundaries.

08:37 <Yorlik> Naw - it wasn't scope creep - it's just a big project. :)

08:38 <Yorlik> Many aspects from different directions

08:38 <zao> If you aim to do everything, nothing is out of bounds :)

08:38 <Yorlik> Technology, Art, Psychology, Philosophy, ...

08:38 <Yorlik> it's just complex, not infinite :)

11:36 hkaiser has joined #ste||ar

11:44 <jbjnr_> hkaiser: got a moment?

11:45 <hkaiser> jbjnr_: hey

11:45 <jbjnr_> the changes you made a month or two ago to the schedulers for fast_idle and scheduler mode...

11:46 <jbjnr_> every thread has it's own 'mode' flags, so they check things like stealing independently - is there a use case for this - it seems like all the threads in a pool are likely to be controlled the same - is this used currently?

11:46 <hkaiser> this is to avoid false sharing

11:47 <jbjnr_> but false sharing only matters if the cache line is modified - if they all share the flags and are not changing frequently - then it doesn't matter

11:47 <jbjnr_> no?

11:47 <hkaiser> jbjnr_: is it really just a problem when things are modified?

11:48 <jbjnr_> well if 10 threads al use a var in a location, but it never changes value, then the cache isn't invalidated (as long as we use the cach_line_padding) so there's no need to reload the cache line etc etc

11:49 <hkaiser> jbjnr_: wikipedia: "When a system participant attempts to periodically access data that will never be altered by another party, but those data share a cache block with data that are altered, the caching protocol may force the first participant to reload the whole unit despite a lack of logical necessity."

11:49 <jbjnr_> it might get evicted from local cache - but that would happen anyway

11:49 <jbjnr_> the key phrase is "tat are altered"

11:49 <jbjnr_> ^that

11:49 <jbjnr_> if the flags are mostly consts, then it doens't matter

11:49 <hkaiser> it says 'that never be altered'

11:50 <jbjnr_> I don't think we need the duplication.

11:50 <jbjnr_> bbiab

11:50 <hkaiser> shrug, feel free to remove it if it has no perf effect

11:52 <jbjnr_> as long as nothing that shares the cache line is changed, then there is no need for any core to reload the line, so I thinkit's enough to pad it to the line and that's that. If the flag is changed after a parallel_for loop, then it will trigger a reload on each thread, but that would have happened anyway

11:53 <jbjnr_> I also noticed that you made a few tweaks to return values from wait_or_add_new and other functions. Could I ask you to do me a favour ...

11:56 <hkaiser> I don't think I did those changes, at least can't remember that I did

11:56 <jbjnr_> I'm still worried that I do the wrong thing in wait_or_add_new and I don't believe the comments around it are very clear, could you please have a look at the comment for wait_or_add new and reword it to make it a bit more clear what should happen. then send me the text and I'll add it to my next commit/PR?

11:57 <jbjnr_> oh^ if it wasn't you then no matter, I'll make the changes and clean up sas and when ...

11:59 <hkaiser> ok

11:59 <K-ballo> these https://github.com/STEllAR-GROUP/hpx/pull/2495 ?

12:00 <K-ballo> no those look rather old

12:02 <jbjnr_> https://github.com/STEllAR-GROUP/hpx/blob/6739ba0003e4429db2877f958574b840c1c07e5c/hpx/runtime/threads/policies/static_queue_scheduler.hpp#L107 the comment there. Then the queue is called wait_or_add_new and inside there is some stealing code and that kind of thing. Now I've modified this stuff heavily in my scheduler, but I have anagging doubt that I am not behaving quite the way expected and I would like to know the expected actions in this function

12:18 <hkaiser> jbjnr_: I think the comment says it all, I'm not aware of any other things that should be done there

12:18 <jbjnr_> should stealing be done inside wait_or_add_new, or done in get_next_thread

12:19 <jbjnr_> (IYHO)

12:34 <hkaiser> jbjnr_: we do steal from ready queues in get_next_thread, but steal from staged queues in wait_or_add_new

12:39 <hkaiser> jbjnr_: but that's some heller has implemented, he will know best

12:39 <hkaiser> something*

12:42 <hkaiser> simbergm: I don't know how to make the transition to hpx::program_options smooth for the user

12:42 <hkaiser> no idea how to make this a gradual process

12:43 <hkaiser> if users have code that relies on boost::prgram_options it will simply fail to compile

12:43 <hkaiser> I could let the user decide at configure time whether to use boost, but that wouldn

12:43 <hkaiser> t enable us to issue any deprecation warnings

12:43 <jbjnr_> hkaiser: ok thanks. That seems fine then. I will check csarefully whats in the other schedulers - I have a lot of conflicts that I thibnk I've messed up in my stuff and I need to spend some time double checking everything.

12:43 <hkaiser> ok

12:44 <jbjnr_> simbergm: merge that threads stuff asap to give me time to clean up my stuff rebased onto it.

12:44 <jbjnr_> (when oyu're happy it doesn't break anything)

12:44 <hkaiser> or we issue the deprecation warning at configure time - the user might not relate that to his compilation problems, though

12:46 <K-ballo> templates..? duck typing?

12:46 <simbergm> jbjnr_: gotcha, it was clean before my last rebase so hopefully I can merge it once pycicle is ready (tomorrow)

12:47 <K-ballo> if it looks like a boost::program_options::description object, warn/reject?

12:47 <simbergm> hkaiser: can we alias hpx::program_options to boost::program_options for one release? have it off in the next release, switch it to the "real" hpx::program_options the next one

12:48 <simbergm> it should be compatible after all if we're copying it over

12:48 <simbergm> (not sure if it's changed significantly between 1.61 and 1.71)

12:54 <jbjnr_> hkaiser: or simbergm do any of the thread_xxx_executors work properly with the new thread pools if I want to put a thread on core 0, core 1, core 2, etc

12:55 <jbjnr_> yorlik was asking about this and I want to make a demo that uses them if they are functional. I'm hoping they can convert their thread numbers into hints that are passed into the schedukers properly and I can disable stealing

13:02 <simbergm> jbjnr_: afaik they support it as much as any other executor

13:03 <jbjnr_> ok, which executor can I say, bind to thread 0 on this pool, thread 1 on the pool etc?

13:03 <simbergm> I never remember how we pass schedule hints to async/apply but as long as the scheduler actually cares about hints, the executors will pass them along

13:05 <hkaiser> simbergm: we can do that, but we can't warn the user

13:07 <simbergm> jbjnr_: maybe it's not exposed... (I distinctly remember there was a way to do it, but I can't find an example now)

13:07 <simbergm> hkaiser: can we not?

13:07 <jbjnr_> we have a pool_executor, but I'm not aware of a 'core_executor'

13:07 <hkaiser> simbergm: how can we?

13:08 <jbjnr_> I would like to create one that works cleanly with the new RP stuff, but if one of the existing ones works, then it should be cleaned up and repurposed

13:08 <hkaiser> simbergm: if the user include boost::po in his code and call HPX using those objects

13:08 <jbjnr_> I'm not quite sure what the thread_pool_executor and os_executors are for

13:08 <hkaiser> jbjnr_: construct a pool that uses some cores only

13:08 <simbergm> hkaiser: right, I'm dumb... you're right

13:09 <K-ballo> we can tell boost::po from hpx::po

13:09 <hkaiser> simbergm: I would like to avoid replicating all of the init stuff for both libraries

13:10 <hkaiser> K-ballo: how so?

13:10 <simbergm> hkaiser: yep, that'd be a pain

13:10 <K-ballo> subtypes?

13:11 <hkaiser> derive our own types from the boost types, hmm, that might work

13:11 <K-ballo> only as far as detecting those, the subtypes won't show in their chained call interfaces or anything

13:11 <hkaiser> sure

13:12 <hkaiser> I see what you mean

13:12 <hkaiser> make the derived types implicitly constructable from the bases

13:13 <hkaiser> we could attach the deprecation warnings to those constructors

13:13 <K-ballo> even simpler

13:14 <hkaiser> that would create some unneeded copies, but hey

13:25 aserio has joined #ste||ar

13:26 <jbjnr_> did anyone answer my executor questions? I think they were lost in the other conversation

13:27 hkaiser has quit [Quit: bye]

13:30 <simbergm> jbjnr_: https://github.com/STEllAR-GROUP/hpx/blob/e1a34b6a17aa78b66c99dc97607e1dbe13625675/hpx/runtime/threads/executors/default_executor.hpp#L35-L36

13:30 <simbergm> the default_executor has other problems (slow) but that's in principle the easiest way to pass a hint to the scheduler

13:31 <jbjnr_> I never did narrow down what was causing the default executor to run slow. An allocation somewhere ....

13:31 <simbergm> even better: https://github.com/STEllAR-GROUP/hpx/blob/e1a34b6a17aa78b66c99dc97607e1dbe13625675/hpx/runtime/threads/executors/default_executor.hpp#L105-L107

13:34 <jbjnr_> https://github.com/STEllAR-GROUP/hpx/issues/2997

13:34 <jbjnr_> still noone knows what they are for! you've been using them too!

13:37 K-ballo1 has joined #ste||ar

13:39 K-ballo has quit [Ping timeout: 248 seconds]

13:39 K-ballo1 is now known as K-ballo

13:42 <jbjnr_> heller: yt?

13:51 <heller> jbjnr_: hey

13:54 <heller> jbjnr_: when stealing happens should be totally irrelevant

13:55 <jbjnr_> heller: https://github.com/STEllAR-GROUP/hpx/issues/524

13:55 <jbjnr_> I wanted to ask about your final comment there

13:55 <jbjnr_> can you tell me what those last two executors are supposed to do preceisely

13:56 <heller> Sure

13:56 <heller> I think it's not 100% accurate

13:59 <heller> The executors mentioned were supposed to set up their own queues and avoid stealing from other queues

13:59 <jbjnr_> are they obsolete?

13:59 <heller> IIRC it never worked out as we'd like it to work

13:59 <jbjnr_> so that's a "yes, go ahead and delete them"?

13:59 <jbjnr_> :)

14:00 <heller> this_thread_executor: no

14:00 <jbjnr_> no, just the last two

14:00 <heller> The other two neither

14:00 <jbjnr_> tey don't appear to be used

14:00 <heller> Well, they are useful

14:00 <jbjnr_> for what?

14:00 <heller> So we need different alternatives

14:02 <heller> Prior to the rm, we used them to decompose our local cores

14:04 <heller> While the RM could make them obsolete, there's still the last mile to go there, I think

14:04 <jbjnr_> can you tell me what they are supoposed to do and then I will try to make them work, or replace them with something that does

14:05 <jbjnr_> we have a pool executor for spawning on a pol, but thread_pool_executor ?

14:05 <jbjnr_> and thread_pool_attached_executor ?

14:05 <jbjnr_> not clear what they want to do

14:05 <jbjnr_> or be

14:06 <heller> Right

14:06 <jbjnr_> the only place they appear in the code is in their definitions and a few unit tests, no examples or other tests of other things use them

14:06 <heller> So thread_pool_executor spawns a new thread pool on the specified cores

14:07 hkaiser has joined #ste||ar

14:08 <heller> The attached one, just attaches to a subset of queues that are already part of a queue

14:08 <jbjnr_> oh. the embedded scheduler stuff.

14:08 <heller> The stream benchmark uses it, no?

14:09 <heller> Yes

14:09 <jbjnr_> don't see them in there

14:11 <heller> Hmmm

14:12 <heller> https://github.com/STEllAR-GROUP/hpx/blob/master/hpx/compute/host/block_executor.hpp#L39

14:13 <jbjnr_> ok. I see that's the one for the right scheduler. I forgot about that

14:13 <heller> Yuo

14:13 <jbjnr_> ok, so I should probably try to investiagte that code too and see if it can be massged into working with the RP and the thread hints and stuff

14:13 <heller> And it's not that it doesn't work...

14:14 <heller> Yes, that would be perfect

15:09 <Yorlik> I'd sugesst to not delete too much stuff (yet). HPX isn't THAT famous yet, and you never know what crazy use cases people come up with. Someone had a reason to create that thing you want top delete in the first place ;).

15:10 <Yorlik> IIRC you usually have people from science using, but now we are coming and trying tomake a game with it. New use cases coming up, like "Give me a way to cleanly initialize" or "Give me persistent AGAS IDs" ...

15:19 <zao> I tried using HPX for applications once.

15:19 <zao> Then I took an arrow to the kne.

15:19 <zao> *knee

15:20 <Yorlik> Better an arrow in the knee than an arrow in the know ;)

15:21 <Yorlik> Created my first HPX components from Lua today :)

15:32 aserio has quit [Ping timeout: 264 seconds]

15:39 lsl88 has quit [Read error: Connection reset by peer]

15:55 aserio has joined #ste||ar

15:56 nikunj has joined #ste||ar

16:53 aserio has quit [Ping timeout: 248 seconds]

16:53 rori has quit [Quit: bye]

17:25 <nikunj> hkaiser: yt?

17:46 Yorlik has quit [Quit: Leaving]

17:53 <nikunj> is it possible to initialize a lambda capture in C++ 11?

17:54 <nikunj> K-ballo: ^^

17:58 Yorlik has joined #ste||ar

18:16 <hkaiser> nikunj

18:16 <hkaiser> nikunj: we need to create that reproducibility appendix for the paper

18:22 <nikunj> hkaiser: ohh

18:22 <nikunj> which would mean giving the scripts right?

18:33 <diehlpk_work> nikunj, https://submissions.supercomputing.org/?page=SampleForm&id=PaperPaperUpload&site=sc19

18:33 <nikunj> diehlpk_work: thanks

18:33 <diehlpk_work> Here is a sample formm but have you ever locked in the submission system?

18:33 <diehlpk_work> *logged

18:34 <diehlpk_work> There you should see what kind of information they want to have

18:34 <nikunj> can we submit this paper as student paper?

18:35 <diehlpk_work> Just generate a submission for your workshop and upload what you have, you can update everything till the deadline

18:35 <nikunj> I'm not sure about the first point, but I pass all other points

18:35 <nikunj> diehlpk_work: I have not logged into the submission system

18:35 <diehlpk_work> Please do to check what information is needed for your workshop

18:36 <nikunj> let me create an account now

18:36 <diehlpk_work> Gabriel and I had several issues to upload our papers

18:36 <nikunj> ohh

18:36 <diehlpk_work> I had to contact the support to get the SC paper uploaded

18:37 <diehlpk_work> Therefore, you should upload in advance to have still enough time to contact them

18:37 <nikunj> what should I use for Company/Institution section of contact info?

18:37 <nikunj> IIT Roorkee or STE||AR GROUP?

18:40 <diehlpk_work> First

18:41 <nikunj> ok thanks

18:49 <nikunj> diehlpk_work: Do I need to fill in STE||AR as 2nd Company/Institution?

18:49 <diehlpk_work> No, as long you have it on the paper it is fine

18:50 <nikunj> alright

18:50 <diehlpk_work> As stellar group is not a company or a institute, it can lead to complications

18:50 <nikunj> ohh, makes sense

18:51 <diehlpk_work> Once, I had to provide a state and a country for stellar group

18:52 <nikunj> hkaiser: I have successfully created an account. I will be able to submit the paper

18:52 <nikunj> I will do the same tomorrow

18:53 <nikunj> diehlpk_work: thanks a lot for the help!

18:54 <nikunj> diehlpk_work: does my work come under student submission?

18:54 <nikunj> I'm not sure of the first point which says: "It must be original work by the student, in which the student is the primary contributor (e.g., responsible for at least 50% of the work)."

18:55 <diehlpk_work> I would not submit it as a student submission

18:55 <nikunj> ohh any reasons?

18:55 <diehlpk_work> I am not even sure what the difference is

18:55 <heller> diehlpk_work: Betelgeuse, Orion

18:56 <diehlpk_work> heller, Yeah, next time I will add it and see what happens

18:57 <diehlpk_work> nikunj, discuss with hkaiser about the student submission

18:57 <hkaiser> nikunj: submission should be possible again now, I pinged Keita

18:58 <nikunj> hkaiser: submission as in?

18:58 <nikunj> I didn't quite get what you mean to say

19:17 <nikunj> hkaiser: I've made the graph fonts bigger and also made the graph bigger as well

19:19 <hkaiser> nikunj: the paper submission site was closed this morning

19:19 <hkaiser> now its open again, so you scan go ahead and submit a first version asap

19:19 <nikunj> hkaiser: ohh

19:19 <nikunj> I didn't know that

19:20 <nikunj> hkaiser: I've also made the y axis to start from 0 for 1d stencil

19:21 bita has joined #ste||ar

19:25 <nikunj> hkaiser: please go through the graphs when you have time. I'll submit the paper post that

19:28 aserio has joined #ste||ar

19:48 <nikunj> hkaiser: also, I ported most of it to C++11. 2 errors still remain, first initializing within a lambda (which idk is possible with C++11) and std::index_sequence. If you have a way to get around, please let me know and I will make the change

20:05 hkaiser has quit [Quit: bye]

20:45 maxwellr96 has quit [Read error: Connection reset by peer]

20:59 hkaiser has joined #ste||ar

21:05 <hkaiser> nikunj: so we should create that appendix, diehlpk_work has given you the template already, has he?

21:06 <diehlpk_work> nikunj, Do you have to submit the form or the pdf document?

21:06 <diehlpk_work> nikunj, Have you seen that there is a bash script you have to run and provide the output?

21:07 <diehlpk_work> https://www.google.com/url?q=https://github.com/SC-Tech-Program/Author-Kit/blob/master/collect_environment.sh&sa=D&source=hangouts&ust=1567112862534000&usg=AFQjCNHewE8oxo5Qxs81KcUL9xGrsh3YYA

21:08 <hkaiser> diehlpk_work: thanks for helping with this!

21:13 diehlpk_work has quit [Remote host closed the connection]

21:18 diehlpk_work has joined #ste||ar

21:18 aserio has quit [Ping timeout: 246 seconds]

21:34 aserio has joined #ste||ar

22:17 aserio has quit [Quit: aserio]