#ste||ar on 2023-04-15 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

01:30 Yorlik_ has joined #ste||ar

01:34 Yorlik has quit [Ping timeout: 264 seconds]

01:57 hkaiser_ has quit [Quit: Bye!]

02:41 diehlpk has joined #ste||ar

02:50 diehlpk has quit [Quit: Leaving.]

05:44 K-ballo has quit [Read error: Connection reset by peer]

05:44 K-ballo has joined #ste||ar

06:41 K-ballo has quit [Ping timeout: 240 seconds]

06:41 K-ballo has joined #ste||ar

09:00 Yorlik_ is now known as Yorlik

13:38 hkaiser has joined #ste||ar

14:04 hkaiser has quit [Ping timeout: 260 seconds]

14:07 hkaiser has joined #ste||ar

16:13 K-ballo1 has joined #ste||ar

16:13 K-ballo has quit [Ping timeout: 252 seconds]

16:13 K-ballo1 is now known as K-ballo

17:45 HHN93 has joined #ste||ar

17:50 HHN93 has quit [Ping timeout: 260 seconds]

18:03 parsa[fn] has quit [Ping timeout: 260 seconds]

18:04 parsa[fn] has joined #ste||ar

18:28 HHN93 has joined #ste||ar

18:36 HHN93 has quit [Ping timeout: 260 seconds]

19:13 tufei has quit [Quit: Leaving]

19:25 tufei has joined #ste||ar

20:04 prakhar42 has joined #ste||ar

20:46 HHN93 has joined #ste||ar

20:47 <HHN93> the CI/CD tests for github commits, if one of the tests fail, does it mask other failures too?

20:47 <HHN93> or all tests guaranteed to run?

20:47 <hkaiser> it shouldn't

20:47 <hkaiser> depends on the error

20:47 <HHN93> what's wrong with our test suite though?

20:48 <hkaiser> if a test fails compiling it stops, if all tests compile, all will be run

20:48 <HHN93> it is only timeout errors from segmented algorithms?

20:48 <hkaiser> yah, those are known

20:49 <hkaiser> not your fault

20:49 <HHN93> any idea why they occur? is it some bug in our code?

20:49 <HHN93> or bug in the test suite

20:49 <HHN93> or bug in the distributed setup of rostam?

20:51 <hkaiser> HHN93: we don't know yet - nobody has investigated. I think pansysk75[m] plans to have a look

20:51 <HHN93> oh ok, I was thinking about having a look too. Wanted to know if there's some place I can start at

20:52 <HHN93> also, (https://github.com/STEllAR-GROUP/hpx/issues/6224) has there been any discussion on if we intend to implement it?

20:52 <hkaiser> not yet - I'm not sure if we should implement those

20:53 <HHN93> i guess we could have par counter parts if the user guarentees the fold op is commutative

20:53 <HHN93> using reduction

20:53 <hkaiser> the only argument I have would be to provide implementations for users that can't rely on C++20/23

20:53 <hkaiser> well, C++23 only

20:53 <hkaiser> HHN93: not only

20:53 <HHN93> associative*

20:54 <hkaiser> I think the spec implies sequential execution, but I'm not 100% sure

20:54 <HHN93> we can have an overload which accepts exPolicy parameter

20:54 <HHN93> I believe there was a talk where we claimed switching std:: to hpx:: should work, so implementing it seems to make sense

20:55 <hkaiser> ok

20:55 <hkaiser> HHN93: I'm not sure however how you could enforce commutativity

20:56 <hkaiser> I also think, we would require associativity as well

20:56 <HHN93> we need the user to guarantee that associativity

20:56 <HHN93> my idea was to do reduction

20:56 <hkaiser> we do have reduce()

20:57 <HHN93> I am arguing that fold ops should be added to HPX, just to maintain consistency with std::, I believe that was our goal

20:58 <hkaiser> nod, fair point

20:58 <HHN93> adding an overload to run fold in par is an idea I have, personally i am not very sure about it either as users might miss the fact that the op must be associative

21:00 <HHN93> hkaiser please look into (https://github.com/STEllAR-GROUP/hpx/pull/6216) and (https://github.com/STEllAR-GROUP/hpx/pull/6225) they fix some bugs in set_operations and make_heap

21:01 <HHN93> I will try to look into if there are more such bugs in other algorithms too and try to push before the next release

21:01 <hkaiser> HHN93: for 6216: didn't you say you wanted to create a test that verifies the fix?

21:02 <HHN93> I am not able to come up with a singular test to verify it, the random generator tests for all lengths from 1 to 8 so it should work

21:03 <hkaiser> why not?

21:03 prakhar42 has quit [Quit: Client closed]

21:03 <HHN93> bug depends on number of threads

21:03 <hkaiser> we can control the number of threads, can't we?

21:04 <HHN93> i wasn't aware we can control number threads for tests, but I am not sure how it matters. We would like to test if the bug occurs for any number of threads right?

21:05 <hkaiser> we would like to test that the problem is fixed at least for one case that we know failed before

21:05 <HHN93> there is also quite some uncertain behaviour with the bug, there was also out of memory accesses, ,which meant you could end up with the correct answer even if the bug exists. Testing over a large number of situations seems like the best option

21:06 <hkaiser> I don't disagree

21:06 <hkaiser> what I'm saying is that we have certain cases that fail, why not prove that those are fixed?

21:07 <HHN93> `we would like to test that the problem is fixed at least for one case that we know failed before`

21:07 <HHN93> ok I will add the known test, I agree it covers a lot of cases. If there is anything it doesn't the randomised should TCs take care of it

21:07 <hkaiser> yes

21:07 <hkaiser> I appreciate your help with this

21:07 <HHN93> np, just trying to learn

21:10 <HHN93> added test described it #6198 to PR (https://github.com/STEllAR-GROUP/hpx/pull/6216)

21:12 <hkaiser> nice, thanks! let's wait for the tests to cycle

21:13 <HHN93> btw do we have CDS, I haven't seen any HPX tutorial describing CDS

21:13 <hkaiser> what's CDS?

21:14 <HHN93> Concurrent data structures

21:14 <hkaiser> ahh

21:14 <hkaiser> we have some, stack, queue, dqueue

21:14 <HHN93> ok cool will look for them

21:16 <hkaiser> HHN93: at some point we tried to integrate HPX with libCDS (https://github.com/khizmax/libcds), but that was not finished

21:17 <HHN93> ok

21:18 <hkaiser> HHN93: there is some leftover code in HPX, even (HPX_WITH_LIBCDS=On enables that)

21:18 <HHN93> leftover code of our own CDS implementations?

21:21 <HHN93> hkaiser https://github.com/STEllAR-GROUP/hpx/tree/master/libs/core/concurrency/include/hpx/concurrency is this what you are referring to?

21:22 <hkaiser> yes, that's our data structures

21:23 <HHN93> do we intend to expand the data structures we provide to our users? or are we planning to work on libCDS integration instead?

21:28 <hkaiser> as long as the CDS relies on lock free operations, you can use any existing code with hpx - no need to integrate

21:28 <hkaiser> for instance from tbb

21:29 <hkaiser> the problem is that some CDS require interaction with the threading system (hazard pointers, for instance) - those need to be integrated into HPX's threading

22:41 HHN93 has quit [Quit: Client closed]