#ste||ar on 2017-10-08 — irc logs at irclog.cct.lsu.edu

2017-05-17 13:54 aserio changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

00:47 diehlpk has joined #ste||ar

00:52 diehlpk has quit [Ping timeout: 248 seconds]

01:35 EverYoung has joined #ste||ar

01:39 EverYoung has quit [Ping timeout: 258 seconds]

02:04 hkaiser has quit [Quit: bye]

03:12 K-ballo has quit [Quit: K-ballo]

03:22 EverYoung has joined #ste||ar

03:31 EverYoung has quit [Ping timeout: 258 seconds]

05:01 pree has joined #ste||ar

05:34 EverYoung has joined #ste||ar

05:39 EverYoung has quit [Ping timeout: 258 seconds]

07:51 pree has quit [Remote host closed the connection]

08:23 pree has joined #ste||ar

08:36 EverYoung has joined #ste||ar

08:41 EverYoung has quit [Ping timeout: 255 seconds]

09:52 pree has quit [Ping timeout: 240 seconds]

10:00 pree has joined #ste||ar

11:19 pree has quit [Remote host closed the connection]

13:11 K-ballo has joined #ste||ar

14:11 <zao> tests.unit.resource.throttle has timed out twice in 104+1 runs, but that's known I believe.

14:12 hkaiser has joined #ste||ar

14:12 <zao> https://gist.github.com/zao/27bb56299de67fa961737a40c852fdce :D

14:14 <hkaiser> zao: the executor failures are know - I jope heller works on those

14:14 <hkaiser> (at least he promises to work on those for a while now)

14:16 <zao> Is it possible to run multiple test suites at the same time on a machine?

14:16 <zao> w.r.t TCP ports and whatnot?

14:16 <hkaiser> hmmm, probably not

14:17 <zao> Was considering setting up a single-machine SLURM with oversubscription, so I could run like 2 or 4 nodes on the machine.

14:17 <zao> My local Slurm guru tells me it shouldn't be rocket surgery, but then you've got shared port space among nodes.

14:17 <hkaiser> could be implemented, though

14:18 <zao> Nothing important, just idly wondering.

14:23 <heller> hkaiser: i'm working on those

14:24 <heller> I already pushed a partial patch, the only missing piece is the throttle stuff...

14:24 <hkaiser> heller: :D

14:33 <heller> I'll post a PR once the kids are in bed. Let's not waste to much time on this feature which won't be used for real anyways

14:33 <hkaiser> heller: remove the throttle scheduler

14:34 <heller> I'm not talking about that scheduler

14:35 <heller> I'm talking about the remove_processing_unit function provided by the RP

14:35 <hkaiser> huh?

14:35 <hkaiser> why do you think it's not needed? and btw, I wasn't even aware that this is a problem

14:35 <heller> See tests/unit/resource/throttle.cpp

14:36 <heller> Well, it works right now

14:36 <hkaiser> ahh, so it's not the throttle scheduler, but the RP functionality to remove PUs from a scheduler

14:36 <heller> Yes

14:36 <hkaiser> don't remove this, it's essential

14:36 <heller> Sure

14:36 <hkaiser> you said: 'Let's not waste to much time on this feature which won't be used for real anyways'

14:37 <heller> I never planned on removing it. Just on not testing it right now

14:37 <hkaiser> I will use it for real

14:37 <hkaiser> you lost me

14:37 <hkaiser> what are you fixing then?

14:39 <heller> I'm trying to fix it

14:39 <heller> Then I'm under the impression that I'm the only one using it

14:39 <hkaiser> what is 'it'?

14:40 <heller> Gtg, I'll get back to you later

14:40 <hkaiser> k

15:45 <zao> Ooh nice, distributed.tcp.migrate_component has failed once instead of the sporadic timing out, https://gist.github.com/zao/0148bdf47a7372b17d8baef9eb300946

16:17 <zao> https://spectrum.ieee.org/semiconductors/processors/breaking-the-multicore-bottleneck

16:17 <zao> Hardware queues - cute

16:20 <hkaiser> zao: that's long overdue

16:54 pree has joined #ste||ar

16:56 pree_ has joined #ste||ar

16:56 pree has quit [Read error: Connection reset by peer]

17:16 <heller> hkaiser: ok, so the test in tests/unit/resource/throttle.cpp is using the RP to turn cores on and off. This is what I use for the throttling in allscale now, so the special scheduler can go from my side now

17:18 <heller> the problem is that the changes to make that work properly messes with the regular shutdown detection. By reverting the shutdown detection as we had before, more or less, breaks this unit test

17:18 <heller> for some reason, the background threads aren't shut down properly when removing one specific scheduling loop

17:19 <heller> but anything else seems to work properly for now.

17:21 pree_ has quit [Ping timeout: 255 seconds]

17:24 <heller> and I can't find the place where this is happening right now :/

17:29 <heller> the thing is, that I don't think someone is actively using this feature right now... we probably should get current master working again as a priority

17:34 pree_ has joined #ste||ar

17:37 <K-ballo> master is broken?

17:38 <heller> yes

17:50 jaafar has joined #ste||ar

18:10 pree_ is now known as pree

18:19 <github> [hpx] hkaiser created reporting_set_affinity_problems (+1 new commit): https://git.io/vdgJz

18:19 <github> hpx/reporting_set_affinity_problems a9079ca Hartmut Kaiser: Making error reporting during problems with setting affinity masks more verbose...

20:36 mcopik has joined #ste||ar

20:43 pree has quit [Quit: AaBbCc]

21:36 mcopik has quit [Ping timeout: 248 seconds]

21:52 mcopik has joined #ste||ar

22:33 jaafar has quit [Ping timeout: 240 seconds]

22:37 EverYoung has joined #ste||ar

22:40 EverYoung has quit [Remote host closed the connection]

22:40 EverYoung has joined #ste||ar

23:15 EverYoung has quit [Remote host closed the connection]

23:16 EverYoung has joined #ste||ar

23:30 EverYoung has quit [Remote host closed the connection]

23:34 EverYoung has joined #ste||ar

23:57 <github> [hpx] hkaiser force-pushed reporting_set_affinity_problems from a9079ca to 0573ee9: https://git.io/vdgCV

23:57 <github> hpx/reporting_set_affinity_problems 0573ee9 Hartmut Kaiser: Making error reporting during problems with setting affinity masks more verbose...