#ste||ar on 2021-03-13 — irc logs at irclog.cct.lsu.edu

2020-09-17 16:16 K-ballo changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

00:26 nanmiao has quit [Quit: Connection closed]

02:25 K-ballo has quit [Quit: K-ballo]

02:37 surya69 has joined #ste||ar

02:38 <surya69> I have cmake version 3.16.3 ,Am id good to go for installing hpx

02:40 <zao> Probably.

02:41 <zao> I think there's a table of minimum and required versions of dependencies and tools in the documentation.

02:41 <zao> https://hpx-docs.stellar-group.org/latest/html/manual/building_hpx.html#prerequisites

02:42 <zao> Best way to find out is to try :D

03:08 diehlpk_work has quit [Remote host closed the connection]

03:31 hkaiser has quit [Quit: bye]

03:33 deepika_birthare has joined #ste||ar

03:36 deepika_birthare has quit [Quit: Connection closed]

03:36 deepika_birthare has joined #ste||ar

03:41 <deepika_birthare> Hello Everyone. I want to contribute to Stellar Project for gsoc 2021. I'm interested in UI Improvements for Performance Visualization, particularly.

03:42 <deepika_birthare> Is this the right place to discuss my queries?

03:43 <srinivasyadav227> yes

03:47 <srinivasyadav227> deepika_birthare: welcome and yes this is the place to discuss abt stellar ;)

03:49 <deepika_birthare> ok, thanks.

03:50 surya69 has quit [Ping timeout: 240 seconds]

03:57 deepika_birthare has quit [Quit: Connection closed]

04:05 <jedi18[m]> What's is_indirect_callable for? I don't need to add it to the tag_fallback_invoke overloads right?

04:53 <jedi18[m]> What I meant is that https://github.com/STEllAR-GROUP/hpx/blob/674f2076b3463c349375763a454851a0fd957e86/libs/parallelism/algorithms/include/hpx/parallel/container_algorithms/minmax.hpp#L174

04:53 <jedi18[m]> Do I need to add the is_indirect_callable only to the overload taking in an ExPolicy and Rng? Or all of them?

06:42 <gonidelis[m]> jedi18: expolicy + F(unction) / Predicate

06:45 <jedi18[m]> Oh ok so you mean only to the overloads which take in an execution policy and a predicate?

06:46 <gonidelis[m]> yes

06:46 <gonidelis[m]> check my older overloads to see what i replace that with (if i do. i think i don't)

06:46 <gonidelis[m]> i am actually 80% percent now that i don't

06:47 <gonidelis[m]> check any algo that takes a predicate

06:47 <gonidelis[m]> hint: the non-container algos seize to support projections (we are going full standard comforming here) and only the ranges ones do now ;)

06:49 <jedi18[m]> Ohh ok thanks, I was getting some bugs which was due to something else but thought it was related to this

07:35 <jedi18[m]> Are there no segmented algorithm tests for min_element/max_element/minmax_element?

07:38 <srinivasyadav227> could someone look at this gist https://gist.github.com/srinivasyadav18/4fff58ce8b6450ee65ef34e0f4eece88#file-cpp-L17

07:39 <srinivasyadav227> can't I pass **(int& i)** on line 17, I want to take each element in range as ref

07:41 <zao> "The signature of this predicate should be equivalent to: <ignored> pred(const Type &a);"

07:41 <zao> Bah, Google giving me ancient versions of HPX docs, but still :)

07:42 <zao> (still holds for current HPX, at least for the non-range algorithm)

07:42 <srinivasyadav227> I want to change the value

07:43 <srinivasyadav227> I mean if I pass const Type &a, its read only right

07:44 <srinivasyadav227> zao: btw, thanks for "2>&1 | tee compile.log", it helped me a lot! ;)

07:46 <zao> Non-ranges has a hpx::for_loop, which provides an iterator to the function object. Not sure if there's a ranges counterpart for that.

07:47 <zao> Oh wait, documentation was just sorted weird.

07:47 <zao> `for_each` takes the dereferenced element by const reference, while a `for_loop` takes an iterator which the function may dereference itself.

07:48 <zao> The signature fore the function object is likely directly inherited from the standard algorithm - see https://en.cppreference.com/w/cpp/algorithm/for_each

07:50 <srinivasyadav227> i saw the same example, there std::for_each(nums.begin(), nums.end(), []**(int &n)**{ n++; });

07:51 <srinivasyadav227> that int was taken as ref

07:58 <zao> (note that your highlighting with asterisks does not render at all in IRC)

07:59 <zao> An example, where?

07:59 <zao> Ah, in cppreference?

08:00 <srinivasyadav227> <zao "Ah, in cppreference?"> yes

08:00 <zao> Well, did you try the HPX algorithm, and how did it fare?

08:01 <srinivasyadav227> no..i didnt try that yet

08:01 <zao> I assumed from your initial question that there was a problem :)

08:02 <srinivasyadav227> yes, i wasnt able to use int& on lamda

08:02 <srinivasyadav227> for hpx::ranges::for_each

08:03 <zao> Note that the non-ranges implementation says "If the type of first satisfies the requirements of a mutable iterator, f may apply non-constant functions through the dereferenced iterator."

08:04 <zao> As for the documentation for the ranges implementation, they're largely nonsensical as they seem to document the wrong algorithm completely, talking still about iterators and sentinels.

08:04 <zao> Or again, apparently I might not be able to read and ranges:: contains both range and non-range algorithms?

08:04 <zao> Who the heck made this? :D

08:13 <zao> srinivasyadav227: This example seems to be using the overload with an explicit executor type and taking references just fine: https://github.com/STEllAR-GROUP/hpx/blob/master/examples/quickstart/partitioned_vector_spmd_foreach.cpp#L167-L169

08:15 <zao> Execution policy, but still.

08:15 <srinivasyadav227> oh, yea, but i don't know whats wrong with mine, yea i will try again if i could catch anything

08:16 <zao> I can't trust the documentation as it seems a bit weird...

08:16 <zao> What error do you get, and does it change if you specify a policy?

08:17 <zao> srinivasyadav227: Note also that your range may not be over mutable objects in the first place.

08:17 <srinivasyadav227> no.. its talking about tag_fallback_invoke

08:17 <srinivasyadav227> shall i paste the gist?

08:17 <zao> Not immediately familiar with boost::irange, but I'd guess that it's returning values or const refs.

08:19 <srinivasyadav227> oops, yea..its returning lvalue ref

08:19 <srinivasyadav227> sorry, thanks ;)

08:21 <zao> hah

08:23 <zao> Happy to help, even though I probably sowed a lot of confusion along the way :D

08:24 <srinivasyadav227> it was a bit silly from my side, ;) but tqsm

10:59 <jedi18[m]> I can't find tests for the segmented overloads of the minmax algorithms, will I have to add tests?

11:46 K-ballo has joined #ste||ar

12:12 Siddhant has joined #ste||ar

12:13 Siddhant has quit [Client Quit]

13:45 hkaiser has joined #ste||ar

14:27 <srinivasyadav227> hkaiser: i have edited datapar/transform_loop.hpp (#5240) which was previously giving build errors, now its building fine and i have installed locally.

14:27 <hkaiser> nice!

14:28 <srinivasyadav227> i did small performace testing against Vc and openmp

14:28 <hkaiser> it's much worse, is it?

14:28 <srinivasyadav227> https://gist.github.com/srinivasyadav18/170471b2adad7f485cac27de2e669000

14:29 <srinivasyadav227> it was better than sequential, but no where near omp simd

14:29 <hkaiser> nod

14:29 <hkaiser> is this a release build?

14:30 <srinivasyadav227> you mean hpx 1.6?

14:30 <hkaiser> your executable, did you build it in release or debug?

14:30 <hkaiser> what's your CMAKE_BUILD_TYPE?

14:30 <srinivasyadav227> i dont know about that

14:30 <srinivasyadav227> i mean i am not familiar with that

14:31 <srinivasyadav227> i will check out, should it be release or debug?

14:31 <hkaiser> what does it print if you add --hpx:info to the command line?

14:31 <hkaiser> it has to be release if you do performance measurements

14:32 <srinivasyadav227> where should i add --hpx:info ? to cmake ?

14:32 <hkaiser> no, your application

14:33 <srinivasyadav227> oh got it one min

14:35 <srinivasyadav227> i got this output : https://gist.github.com/srinivasyadav18/c70de1b7f0ac35223ee6792628cf2a4e

14:35 <hkaiser> ahh, it's --hpx:version, then - you you retry, please?

14:36 <srinivasyadav227> yea sure

14:36 <srinivasyadav227> its release mode

14:36 <hkaiser> ok

14:37 <hkaiser> so it's as I remember, the compiler can't optimize it sufficiently to get a real speedup :/

14:37 <srinivasyadav227> so, performance i low right?

14:37 <srinivasyadav227> ohh

14:37 <hkaiser> yes, it's barely faster compared to the seq case

14:38 <hkaiser> and the omp version is 4 times faster

14:38 <srinivasyadav227> yea, i have tested with various size inputs like 1 << 10, 1 << 20, 1 << 30 but mostly speedup was only 20% more

14:39 <hkaiser> right

14:39 <srinivasyadav227> we should adapt new ones i guess ;)

14:39 <hkaiser> this is where we need to simplify the overall code such that the compiler can actually optimize things properly

14:40 <hkaiser> one step at a time, though

14:40 <hkaiser> let's get things disentangled first, then we should adapt to the gcc simd types

14:41 <srinivasyadav227> ok, i will again check with direct Vc invocation

14:42 <hkaiser> srinivasyadav227: yes, might be a good test, use vc directly and compare its performance

14:43 <srinivasyadav227> ok ;)

14:46 hkaiser has quit [Read error: Connection reset by peer]

14:48 hkaiser has joined #ste||ar

15:36 <jedi18[m]> <jedi18[m] "I can't find tests for the segme"> Could someone please respond to this?

15:53 shubh has joined #ste||ar

15:53 <shubh> Hello Everyone

15:54 <srinivasyadav227> shubh: Hello!

16:07 <hkaiser> hey shubh, welcome

16:07 <shubh> Thank You hkaiser

16:08 <gnikunj[m]> hkaiser: should I look into execution space implementation now? (or would you really want me to hold my horses ;-) )

16:09 <gnikunj[m]> asking coz I'm bored and I'll either work on execution space or take a 2-3d vacation from using laptop :P

16:09 <hkaiser> gnikunj[m]: could we get the current stuff into hpx-kokkos? also, I'd like to see some perf analysis on the current implementation

16:09 <hkaiser> bored is not good ;-)

16:10 <gnikunj[m]> we should do a call with ms and see. the returning executor is a worthwhile feature that can make it back to hpx-kokkos.

16:10 <shubh> you are bored in starting phase

16:10 <hkaiser> gnikunj[m]: yes

16:10 <gnikunj[m]> Also, what sort of examples would you want me to implement for perf analysis?

16:10 <hkaiser> the same as we've used for the paper, just running on the device

16:11 <gnikunj[m]> so stencil + artificial

16:11 <hkaiser> nod

16:11 <gnikunj[m]> got it. Shouldn't be too difficult right now. Let me see what I can do ;-)

16:11 <hkaiser> this would still require some work, but would be worthwhile as it could be turned into a followon paper

16:12 <hkaiser> shubh: gnikunj[m] is working on things for a while now (2 or 3 years)

16:12 <gnikunj[m]> right. I'm thinking on how to get stencil implemented right now. As it stands, artificial one's should be simple enough to change the executor to get it working.

16:12 <gnikunj[m]> close to 3 years now :D

16:13 <gnikunj[m]> hkaiser: alright then. You have me boarded on some perf analysis. Should we report those in the next meeting though?

16:14 <srinivasyadav227> gnikunj[m]: wow!, thats really great ;)

16:14 <shubh> Okay Sorry gnikunj[m] then it make senseXD

16:14 <hkaiser> whatever we have ready by then

16:14 <gnikunj[m]> (they might increase their expectations)

16:14 <hkaiser> nah

16:14 <srinivasyadav227> hkaiser: theres no documentation for Vc?

16:14 <hkaiser> so far it's aligned with our interests, so it's ok

16:14 <gnikunj[m]> whatever you say captain ;-_

16:14 <gnikunj[m]> *;-)

16:15 <hkaiser> srinivasyadav227: https://vcdevel.github.io/Vc-1.4.1/

16:16 <srinivasyadav227> oh shit, i was struggling since one hour, going through their source code

16:28 <gonidelis[m]> jedi18 late reply sorry

16:28 <gonidelis[m]> I don't think we need segmented tests

16:30 <jedi18[m]> Oh ok thanks, then I guess the PR is almost ready, I just need to add those sentinel tests

16:33 <gonidelis[m]> ;) This are more of a custom tests of mine: that means you will probably have to figure out a test case of yours according to the algo functionality

16:33 <gonidelis[m]> but the structure is the same

16:34 <gonidelis[m]> (don't forget to add iter_sent.hpp utility)

16:35 <gonidelis[m]> jedi18: have you made the pr? i cannot find it on our list

16:35 <jedi18[m]> No no, I haven't yet

16:35 <gonidelis[m]> if not, you could open it up just to make sure CI is warm

16:35 <gonidelis[m]> you commit addiotional changes then

16:36 <jedi18[m]> I need to find an example to run the segmented overloads to see if it compiles right

16:36 <gonidelis[m]> plus we will have some more time for review

16:36 <gonidelis[m]> segmented example?

16:36 <jedi18[m]> Oh right ok, I'll create the PR soon

16:36 <gonidelis[m]> thanks!!!

16:36 <gonidelis[m]> :D

16:37 <jedi18[m]> Yeah some example of the code that I can run to test it

16:38 <jedi18[m]> Btw I notice there's a file called minmax_element_performance but I can't find the corresponding project to be able to run it

16:38 <hkaiser> jedi18[m]: we should have segmented tests for all the supported algorithms

16:38 <hkaiser> those should still work after your changes

16:38 <jedi18[m]> Yeah but none of the segmented tests use minmax

16:39 <hkaiser> ok, do those algorithms actually have segmented implementations?

16:40 <jedi18[m]> hkaiser: yes https://github.com/STEllAR-GROUP/hpx/blob/master/libs/full/segmented_algorithms/include/hpx/parallel/segmented_algorithms/minmax.hpp

16:40 <hkaiser> ahh

16:41 <hkaiser> so we were lazy ;-)

16:41 <hkaiser> feel free to create tests, then

16:41 <jedi18[m]> I've updated it to use tag_invoke, just need to test it now

16:41 <hkaiser> would creating tests be ok for you?

16:42 <jedi18[m]> Would I be able to figure it out by looking at the other tests?

16:42 <hkaiser> just based on the existing ones

16:42 <jedi18[m]> Well, wouldn't hurt to try :D

16:45 <hkaiser> good attitude ;-)

16:45 <srinivasyadav227> any reasons why "auto start = std::chrono::high_resolution_clock::now();" would be failing? its showing me template argument deduction failed

16:45 <hkaiser> why should it fail?

16:47 <hkaiser> see https://en.cppreference.com/w/cpp/chrono/high_resolution_clock/now for an example

16:47 <srinivasyadav227> /usr/include/c++/9/ostream:691:5: note: template argument deduction/substitution failed:

16:47 <hkaiser> not on that line, though - I'd assume

16:50 <srinivasyadav227> aah, ostream overload!

16:51 <shubh> Please tell me how I can proceed

16:51 <srinivasyadav227> i was able to use with hpx::chrono, but the same gave me error with std::chrono,, thanks to hpx ;)

16:52 <hkaiser> srinivasyadav227: no idea what your problem is

16:52 <hkaiser> shubh: what do you mean?

16:52 <srinivasyadav227> hkaiser: got it, any way, thanks!

16:53 <shubh> srinivasyadav227 am new here and I want to work on GSoC project

16:53 <shubh> I*

16:53 <hkaiser> shubh: sure - welcome

16:54 <hkaiser> have you read this: https://github.com/STEllAR-GROUP/hpx/wiki/Hints-for-Successful-Proposals?

16:54 <hkaiser> and this: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2021

16:54 <shubh> Okay

16:56 <shubh> So I start working on project proposal

16:57 <hkaiser> shubh: well first you need to decide what you want to work on

16:57 <jedi18[m]> hkaiser: Could you help me decide between https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2021#range-based-parallel-algorithms and https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2021#implement-shift_left-and-shift_right-parallel-algorithms ?

16:58 <jedi18[m]> Which would you recommend given the PRs I've already made and am working on?

16:58 <hkaiser> jedi18[m]: I think shubh said he was interested in the shift algorithms, but from my end, anything would be fine ;-)

16:58 <hkaiser> sorry, gotta run now

16:58 hkaiser has quit [Quit: bye]

17:00 <shubh> yup

17:02 <jedi18[m]> <shubh "yup"> Shoot, you beat me to it :D. Ok then I'll try ranges one for now, please do let me know if you decide to change your idea

17:03 <shubh> No, I have already talked to Hkaiser regarding Add shift_left and shift_right algorithms project.

17:03 <shubh> So, I am not interested in changing my project idea.

17:05 <jedi18[m]> Oh ok sure np

17:13 Ri2Raj has joined #ste||ar

17:17 shubh has left #ste||ar [#ste||ar]

17:21 <jedi18[m]> gonidelis: I've created the draft PR https://github.com/STEllAR-GROUP/hpx/pull/5241

17:22 <Ri2Raj> Hello everyone, myself Rituraj Dutta pursuing my Btech in Information Technology from Gauhati University of Science and Technology and currently I'm in my second year and I'm interested in the project mentioned in the list "To Implement Iterative Solvers". I've a decent knowledge in C++ mainly C++11/14 as I have been using it since high school. I

17:22 <Ri2Raj> also some knowledge on parallel programming with CUDA ( only the basics though). So, basically I'm new in this field of HPX. It would be of great hep from the community for little bit of help and support.

17:23 <Ri2Raj> *help

17:26 Ri2Raj has quit [Quit: Connection closed]

17:26 Ri2Raj has joined #ste||ar

17:38 <gonidelis[m]> Ri2Raj: wellcome

17:38 <gonidelis[m]> jedi18: great!!

17:38 <gonidelis[m]> welcome^^

17:40 <gonidelis[m]> don't forget the docs ;)

17:42 <Ri2Raj> Thanks gonidelis[m]

17:47 <jedi18[m]> <gonidelis[m] "don't forget the docs ;)"> Thanks, I'll add it in the next commit

17:56 Ri2Raj61 has joined #ste||ar

17:56 Ri2Raj61 has quit [Client Quit]

19:01 Ri2Raj has quit [Quit: Connection closed]

20:41 hkaiser has joined #ste||ar

21:07 jehelset has quit [Remote host closed the connection]