#ste||ar on 2020-12-30 — irc logs at irclog.cct.lsu.edu

2020-09-17 16:16 K-ballo changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

01:53 <jaafar> I was browsing the code and noticed that there is a transform_loop_n that is not implemented in terms of loop_n - and the latter seems to have optimizations for e.g. SIMD

01:54 <jaafar> Should transform_loop_n use loop_n?

02:02 <hkaiser> jaafar: good question

02:38 K-ballo has quit [Quit: K-ballo]

02:53 hkaiser has quit [Quit: bye]

07:20 surbhi has joined #ste||ar

08:41 <gonidelis[m]> jaafar could you give a ref?

09:47 surbhi has quit [Ping timeout: 246 seconds]

10:07 surbhi has joined #ste||ar

12:14 K-ballo has joined #ste||ar

12:33 parsa has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

12:38 parsa has joined #ste||ar

12:42 parsa has quit [Client Quit]

12:44 parsa has joined #ste||ar

12:45 parsa has quit [Client Quit]

13:10 parsa has joined #ste||ar

13:13 parsa has quit [Client Quit]

13:16 parsa has joined #ste||ar

17:28 <jaafar> gonidelis[m]: sure I was wondering why this code: https://github.com/STEllAR-GROUP/hpx/blob/5bd72df58b538c28460d94887c0cab05bd295441/libs/parallelism/algorithms/include/hpx/parallel/algorithms/exclusive_scan.hpp#L133

17:29 <jaafar> didn't use transform_loop_n when it was, in fact, a transform. So I replaced the code and it got slower

17:30 <jaafar> then I looked to see why that might be and saw things like this in loop_n: https://github.com/STEllAR-GROUP/hpx/blob/5bd72df58b538c28460d94887c0cab05bd295441/libs/parallelism/algorithms/include/hpx/parallel/util/loop.hpp#L153

17:30 <jaafar> loop unrolling, which isn't in transform_loop_n

17:31 <jaafar> it seemed like rewriting transform_loop_n to use loop_n ought in theory to not cost anything, and possibly give gains for users

17:45 <gonidelis[m]> hmmm

17:46 <gonidelis[m]> jaafar: By taking this into consideration, we should optimize `transform_loop_n` accordingly.

17:46 <jaafar> It seemed to me that just using loop_n might be enough

17:46 <gonidelis[m]> jaafar: I happen to work in the `transform` C++20 adaptation right now and I have encountered `transform_loop_n` but it was just the interface

17:52 <gonidelis[m]> jaafar: should we not check into the performance of `transform_loop_n` though? Just to see where it stinks?

17:52 surbhi has quit [Ping timeout: 260 seconds]

18:59 <jaafar> oh sure, whatever you think :)

19:00 <jaafar> I was just surprised to see that one was not implemented in terms of the other. transform_loop_n has a raw C++ loop instead.

19:06 <gonidelis[m]> jaafar cool thanks for letting know... I might come across this issue in next few weeks

19:06 <gonidelis[m]> I will update in that case

19:58 diehlpk_work has joined #ste||ar