#ste||ar on 2020-05-15 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:56 nan111 has quit [Remote host closed the connection]

01:06 hkaiser has joined #ste||ar

01:11 Amy1 has joined #ste||ar

01:53 hkaiser has quit [Quit: bye]

02:07 Nikunj__ has quit [Read error: Connection reset by peer]

02:27 Yorlik has quit [Ping timeout: 264 seconds]

04:03 bita_ has quit [Read error: Connection reset by peer]

04:25 Amy2 has joined #ste||ar

04:27 Amy1 has quit [Ping timeout: 260 seconds]

06:38 <tarzeau> where could i ask something about amd mi50 usage in clusters?

08:31 mcopik has joined #ste||ar

08:31 mcopik has quit [Client Quit]

10:06 <mdiers[m]> I use vega and rx570 on workstations, could that help?

10:06 gonidelis has joined #ste||ar

10:07 <jbjnr> is hkaiser online?

10:11 gonidelis has quit [Ping timeout: 245 seconds]

10:23 <tarzeau> mdiers[m]: for pytorch and tensorflow (not using cpu but that gpu)?

10:25 <tarzeau> as described here https://www.preining.info/blog/2020/05/switching-from-nvidia-to-amd-including-tensorflow/

10:26 <mdiers[m]> interesting, i'm also working on tensorflow right now. there is a tensorflow-rocm docker container. i got it running with singularity and made some first tests.

10:27 Yorlik has joined #ste||ar

10:28 <mdiers[m]> https://hub.docker.com/r/rocm/tensorflow

10:30 <tarzeau> mdiers[m]: tf 1.x or 2.2.0?

10:30 <tarzeau> (built yourself using bazel, or pypi binaries)?

10:32 nikunj has quit [Ping timeout: 260 seconds]

10:32 nikunj has joined #ste||ar

10:33 <mdiers[m]> i think it was still 1.15. my problem was to get rocm per rpm running on the system without affecting other things (vnc/mesa)

10:33 <mdiers[m]> <tarzeau "(built yourself using bazel, or "> dockerhub

10:34 gonidelis has joined #ste||ar

10:36 <gonidelis> jbjnr it's probably like 05:00 in the morning in Louisiana ;p . He will be probably be logged in in about ~2 hours

10:36 <mdiers[m]> I got it running, but I haven't gotten any further at the moment, because almost only nvidia is available and the priority is a c++/python interface.

11:12 hkaiser has joined #ste||ar

11:16 <Yorlik> hkaiser: YT ?

11:22 <Yorlik> heller1: My attempts from yesterday (still wonky in many ways): https://i.imgur.com/w1XIHmw.png

11:28 <Yorlik> And Inverse (FPS) https://imgur.com/a/ptD19Y5

11:29 <Yorlik> The Data at smallish Object Counts is quite chaotic - not sure it's meaningful - I might have to improve the measurements here

11:37 <gonidelis> As I reading past PR's I can see that there is a directory called `hpx/parallel/segmented_algorithms`. What was that about? What is its present name ?

11:37 <gonidelis> am ^^

11:45 <hkaiser> Yorlik: here

11:45 <Yorlik> Hello!

11:45 <Yorlik> Did you see the image of the measurements I made yesterday?

11:46 <hkaiser> gonidelis: https://github.com/STEllAR-GROUP/hpx/tree/master/libs/segmented_algorithms/include/hpx/parallel

11:46 <hkaiser> Yorlik: 5000ms/frame?

11:46 <Yorlik> See the object count

11:47 <hkaiser> 5s/frame?

11:47 <Yorlik> I need to understand better what happened here and surely there might be errors

11:47 <hkaiser> doesn't sound right

11:48 <Yorlik> 5 seconds single threaded for 200k Objects?

11:48 <gonidelis> hkaiser is there a reason to have segmented_algorithms since ranges have been introduced?

11:48 <Yorlik> Thats 200K messages sent and processed and the according calls into Lua

11:49 <hkaiser> gonidelis: segmented algorithms operate on segmented (possibly distributed) data partitions, that's different

11:49 <hkaiser> Yorlik: so this is ok?

11:49 <gonidelis> gonidelis oh ok... thakns

11:49 <hkaiser> Yorlik: so 25 us/object

11:49 <hkaiser> not too bad, true

11:49 <Yorlik> Yes

11:50 <Yorlik> Including a call into a Lua State and running a script there.

11:50 <hkaiser> nod

11:50 <Yorlik> I was more interested about what it tells about our scalability

11:50 <hkaiser> ms[m]: sorry for spamming you with review comments

11:51 <ms[m]> hkaiser: no worries, sorry and thanks for looking through

11:51 <Yorlik> And OFC the measurements have a lot of weaknesses - htis is more a rough exploration of the situation than something compliant with scientific standards.

11:51 <ms[m]> I didn't really test anything in the PR yet so that was expected...

11:51 <hkaiser> Yorlik: if you want to scaling plots, the plot something like objects/s or objects/frame

11:51 <hkaiser> that should go up linearly, ideally

11:51 <Yorlik> Did you see the inverse plot?

11:51 <Yorlik> https://i.imgur.com/su6RCrF.png

11:51 <Yorlik> That's the framerate

11:52 <Yorlik> The numbers at the lower end for low object counts are bonkers

11:52 <hkaiser> I'd plot objects/frame instead

11:52 <hkaiser> because, that's what you're interested in, no?

11:52 <Yorlik> Yes

11:53 <Yorlik> There's a bunch of stuff I could do.

11:53 <Yorlik> Maybe that graph, yes

11:53 <hkaiser> the fps plot doesn't tell you anything as you might idle

11:53 <Yorlik> And then fix some unhandled exceptions I envountered and improve the measurement

11:54 <Yorlik> It's the unbounded updater - it never idles

11:54 <hkaiser> then it doesn't make sense that you level off when going to higher core numbers

11:54 <Yorlik> I think I have mearuement errors when the frametime is too low

11:55 <hkaiser> fps should theoretically go up linearly with number of cores

11:55 <Yorlik> The curve for the higher object counts makes sense

11:55 <Yorlik> And FPS is log scale on Y

11:55 <hkaiser> doesn't make sense anyways

11:56 <hkaiser> why if fps getting worse when running on more cores?

11:56 <hkaiser> *is*

11:56 <Yorlik> I think it's an artifact on the low object numbers

11:57 <Yorlik> Might even be rounding errors

11:57 <hkaiser> 100 objects on 12 cores, that is ~8 objects per core

11:57 <Yorlik> == a lot of overhead

11:57 <hkaiser> that means the update should take about 200 us per cycle

11:57 <Yorlik> I used the default chunker

11:57 <hkaiser> so you should see scaling (not perfect scaling mind you)

11:57 <Yorlik> I think I'll repeat the measurement with the autochunker

11:57 <hkaiser> shrug

11:58 <hkaiser> something is off with your measurements

11:58 <Yorlik> The default chunker splits it up, even if it doesn't make sense at very low object counts

11:59 <Yorlik> So it gets inefficient in this extreme edge case

11:59 <Yorlik> OFC splitting up 8 objects into 8 core with short update times doesn't make sense, right?

12:00 <Yorlik> I think that is part of the artifact

12:01 <Yorlik> I'll think of a way to automate the measurement, so I don't have to do it all manually (every data point is a manual run and processing of log data)

12:07 <gonidelis> how can I find the target for compiling just `/algorithms` ?

12:13 <hkaiser> gonidelis: make help | grep algorithms ?

12:14 <gonidelis> hkaiser thank you!

12:15 <gonidelis> hkaiser why do you use fwiterB and fwiterE on the iterators adaptation? I mean what do these letters stand for?

12:16 <hkaiser> forward iterator begin/end

12:21 <gonidelis> oh great! I was searching for sth like A,B or 1,2 but that makes more sense ;D =D

12:33 <gonidelis> hkaiser I can see that in `for_each.hpp`, `HPX_CONCEPT_REQUIRES_` is used in the parameters of the template declaration. While in `reduce.hpp` (which is the newer + better version of iterator based algos) there is `std::enable_if` outside the template parameters. It is actually placed as the return type (??? correct me if I am wrong) of `reduce()`.

12:33 <gonidelis> I remember you saying that we use the later one on the MACROS to achieve the effect. So do we use `enable_if` vs `HPX_CONCEPT_REQUIERS` according to the case or do we just go with `enable_if` from now on as a more modern solution?

12:34 <hkaiser> gonidelis: I don't remember why it's done one way here and another way there

12:34 <hkaiser> the macro expands to enable_if anyways, so I think the reduce is older and has not been changed to use the macros

12:37 <gonidelis> hkaiser ok i totally get it. I shall prefer going with the MACRO then... (do you think that we should gradually try to turn the `enable_if`s into MACROs?)

12:38 <hkaiser> gonidelis: we can do that, the macros help especially if you have more than one condition

12:42 <gonidelis> hkaiser ok I will keep it in mind as soon as I manage to adapt `for_each`. Just one last quite important question (sorry for the spam). We know that the `begin` should be different from the `end` iterator. What should be the type on the `algorithm_result<ExPolixy, Iter>` at function's result type? I guess it's `IterB`, right?

13:00 <jbjnr> hkaiser: I have a memory somewhere that ou recently committed an executor wrapper of some kind. I'd like to see it, but I can't remember what it was called. Is it in master or a branch anywhere?

13:00 <jbjnr> * hkaiser: I have a memory somewhere that you recently committed an executor wrapper of some kind. I'd like to see it, but I can't remember what it was called. Is it in master or a branch anywhere?

13:06 <hkaiser> gonidelis: looks at the spec (standard), I think it should be the begin iterator

13:06 <hkaiser> jbjnr: examples/quickstart/executor_with_thread_hooks.cpp

13:15 <jbjnr> thanks

13:21 <hkaiser> ms[m], jbjnr, heller1: I sent a mail wrt sponsoring yesterday - care to respond?

13:22 <heller1> Yorlik: so you're happy with the performance so far?

13:22 <Yorlik> All in all yes - but I feel I need to understand more

13:23 <ms[m]> hkaiser: where to? cscs.ch address?

13:23 <hkaiser> hpx-pmc ml

13:23 <Yorlik> The machine ofc: awesome. But I'd like to automate and improve the measurements

13:24 <heller1> hkaiser: awesome, thanks!

13:25 <Yorlik> heller: interestin is, that on certain configurations of cores and workload I triggered exceptions - possibly races and a lock i removed - i needed to reinstall it - will have to try to make it more fine grained.

13:27 <ms[m]> hkaiser: thanks for pinging me, I found a bunch of pmc emails in my spam (sorry if there were some old ones you expected a reply on...)

13:27 <hkaiser> Yorlik: that's expected - races tend to show up with higher core counts

13:27 <hkaiser> mdiers[m]: any time

13:28 <Yorlik> I'll have to investigate more - but first I want to fix some things and automate measureing . 98 manual datapoints tonight was a bit crazy

13:28 <Yorlik> It also is error prone ofc.

13:28 <jbjnr> all my pmc email goes to spam too ms[m]

13:28 <hkaiser> darn, ms[m]: any time

13:28 <hkaiser> jbjnr: that's where it belongs ;-)

13:28 <jbjnr> and gsoc mostly :(

13:30 <jbjnr> hkaiser: I will replace some of my limiting executor with cut'n'paste from your executor wrapper. I like your better.

13:30 <jbjnr> ^yours

13:30 <jbjnr> Mine was not forwarding properly

13:30 <hkaiser> jbjnr: ok

13:33 <heller1> hkaiser: i really like the idea of sponsorship and the general direction

13:34 <hkaiser> heller1: great! just send a +1, then (if you don't mind)

13:37 <heller1> Didn't I?

13:37 <hkaiser> as an email?

13:38 <hkaiser> haven't seen it (yet)

13:38 <hkaiser> ahh got it now

13:39 <ms[m]> hkaiser: just replied, very good initiative!

13:40 <hkaiser> thanks!

13:41 <mdiers[m]> hkaiser: short?

13:42 <hkaiser> mdiers[m]: yah, sorry

13:42 <heller1> How do I join the open collective?

13:43 <hkaiser> register on their website and give me your nick, I'll add you to the hpx project

13:43 <hkaiser> ms[m], jbjnr: same for you ^^

13:44 <mdiers[m]> hkaiser: so go ahead

13:44 <hkaiser> mdiers[m]: I mistypes your nick and accidentially highlighted your name, sorry

13:44 <hkaiser> *mistyped*

13:45 <hkaiser> meant to talk to ms[m]

13:45 <ms[m]> hkaiser: https://opencollective.com/mikael-simberg

13:46 <hkaiser> added

13:47 <heller1> hkaiser: https://opencollective.com/thomas-heller

13:47 <hkaiser> you have to approve it, though

13:47 <hkaiser> heller1: added

13:48 <mdiers[m]> hkaiser: Oh no problem, then I can go on with my project-x the bookshelf wall ;-)

13:48 <hkaiser> mdiers[m]: absolutely!

13:57 weilewei has quit [Ping timeout: 245 seconds]

14:11 weilewei has joined #ste||ar

14:52 bita_ has joined #ste||ar

14:54 nan111 has joined #ste||ar

15:34 <jbjnr> something is fishy. sync execute gives a different task thread_if than the parent task. It used to give the same thread_id

15:40 rtohid has joined #ste||ar

15:47 <bita_> hkaiser, do you have a minute?

15:48 <hkaiser> bita_: I'm in a meeting right now, could we talk a bit later?

15:48 <bita_> of course

15:49 <hkaiser> bita_: I have 10 minutes now

15:50 <bita_> In this file, https://github.com/STEllAR-GROUP/phylanx/blob/8faae0bed73cdece78f3bdacabb52f40f4e8b5a0/tests/unit/plugins/dist_matrixops/dist_slice_2_loc.cpp, test_slice_column_0 works but test_slice_column_1 doesn't. test_slice_column_1 results in an empty array in one of the localities

15:50 <bita_> I think annotation-wise it is Okay, but I am not sure how to make an empty primitive

15:50 <hkaiser> ok, what can I do?

15:51 <hkaiser> you mean ho wto return an empty partition?

15:51 <bita_> I get the error of {what}: Invalid array of elements: HPX(unhandled_exception) followed by invalid state: thread pool is not running: HPX(invalid_status

15:51 <bita_> yes

15:51 <hkaiser> what do you return now?

15:51 <hkaiser> a null-sized vector? nil?

15:51 <bita_> On locality 1 I return annotate_d([], "array_1_sliced/1",

15:51 <bita_> list("tile", list("columns", 0, 0)))

15:52 <hkaiser> well, I'd need to run the code to see what's wrong

15:52 <hkaiser> what branch should I look at?

15:52 <bita_> https://github.com/STEllAR-GROUP/phylanx/blob/8faae0bed73cdece78f3bdacabb52f40f4e8b5a0/phylanx/execution_tree/primitives/slice_node_data_2d.hpp#L1217

15:52 <bita_> dist_slice

15:52 <hkaiser> ok

15:53 <hkaiser> wil try later today

15:53 <bita_> thank you

16:01 Nikunj__ has joined #ste||ar

16:02 <nan111> hkaiser, will we have a meeting today?

16:09 <hkaiser> nan111: ahh yes, sorry

16:10 <hkaiser> nan111: I'm in

16:11 nikunj97 has joined #ste||ar

16:14 Nikunj__ has quit [Ping timeout: 260 seconds]

16:21 <ms[m]> hkaiser: heller just fyi, daint is unlikely to come back up still this week...

16:22 <hkaiser> ms[m]: thanks for letting us know

16:30 <heller1> hkaiser, gonidelis: FYI, 19:00 Fridays is very bad for me

16:42 <gonidelis> heller1 we could change that then...

16:44 gonidelis63 has joined #ste||ar

16:45 <hkaiser> heller1: what time would work for you?

16:47 gonidelis has quit [Ping timeout: 245 seconds]

16:49 gonidelis63 is now known as gonidelis

16:49 gonidelis has quit [Remote host closed the connection]

16:49 gonidelis has joined #ste||ar

16:50 <gonidelis> oh i am so sory. had problem with my connection and missed your `base_iterator` messages =( =( if you please could repeat them i would appreciate it

16:51 <gonidelis> rori

16:52 <rori> sure

17:17 weilewei has quit [Remote host closed the connection]

17:17 diehlpk_work_ has quit [Remote host closed the connection]

17:17 weilewei has joined #ste||ar

17:18 diehlpk_work_ has joined #ste||ar

17:20 nan111 has quit [Remote host closed the connection]

17:27 nan111 has joined #ste||ar

17:33 karame_ has quit [Remote host closed the connection]

17:38 gonidelis has quit [Ping timeout: 245 seconds]

17:39 <heller1> hkaiser, gonidelis: around 4 would be better

17:41 <hkaiser> heller1: I can do Friday's 9am/4pm

17:41 <hkaiser> rori: how about you?

17:41 <rori> perfect for me

17:42 <hkaiser> gonidelis: what time would that be for you? 6pm?

17:44 karame_ has joined #ste||ar

17:46 rtohid has quit [Remote host closed the connection]

17:46 <hkaiser> heller1, rori: let's decide when he's back

17:47 <rori> 5pm for him I believe

17:47 <hkaiser> k

17:48 rtohid has joined #ste||ar

18:06 akheir has joined #ste||ar

18:14 <bita_> hkaiser, using primitive_argument_type(ast::nil{true}, attached_annotation) and represent the result with nil works for my problem. I will ask you if there is a better method in the personal meeting. So, debugging that is not a priority, thanks for the offer though

18:19 <hkaiser> bita_: nod, thought so

18:24 nan111 has quit [Remote host closed the connection]

18:25 weilewei has quit [Remote host closed the connection]

18:45 nan111 has joined #ste||ar

19:07 rtohid has quit [Remote host closed the connection]

19:09 rtohid has joined #ste||ar

19:21 nan111 has quit [Remote host closed the connection]

19:21 rtohid has quit [Remote host closed the connection]

19:21 karame_ has quit [Remote host closed the connection]

19:27 akheir1 has joined #ste||ar

19:29 akheir has quit [Ping timeout: 240 seconds]

19:53 nikunj97 has quit [Quit: Leaving]

19:54 nikunj has quit [Read error: Connection reset by peer]

19:54 nikunj has joined #ste||ar

20:00 nan11 has joined #ste||ar

20:16 nikunj has quit [Ping timeout: 265 seconds]

20:16 nikunj has joined #ste||ar

20:17 nan11 has quit [Remote host closed the connection]

20:18 weilewei has joined #ste||ar

20:19 nan11 has joined #ste||ar

20:19 <K-ballo> we are getting github sponsors now?

20:37 nikunj97 has joined #ste||ar

21:01 rtohid has joined #ste||ar

21:06 <weilewei> K-ballo who?

21:09 <K-ballo> STE||AR

21:14 <weilewei> or you mean the Acknowledgements part in hpx github?

21:21 <K-ballo> maybe I was not supposed to say anything..? https://github.com/sponsors

21:32 bita_ has quit [Quit: Leaving]

21:55 <jbjnr> K-ballo: It's not that we are getting sponsors - only that we are registering ourselves so that we can one day (if anyone wants to sponsor us)

22:10 rtohid has left #ste||ar [#ste||ar]

23:28 nan11 has quit [Remote host closed the connection]