#ste||ar on 2022-06-11 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

01:51 hkaiser has quit [Quit: Bye!]

05:47 Yorlik has joined #ste||ar

06:43 K-ballo has quit [Ping timeout: 240 seconds]

06:43 K-ballo has joined #ste||ar

12:56 hkaiser has joined #ste||ar

13:44 K-ballo1 has joined #ste||ar

13:45 K-ballo has quit [Ping timeout: 248 seconds]

13:45 K-ballo1 is now known as K-ballo

14:50 <gonidelis[m]> https://github.com/STEllAR-GROUP/hpx/blob/8ace32a0acd0eaf8afab6b3f8f2044a68fced9dc/libs/core/tag_invoke/include/hpx/functional/detail/invoke.hpp#L60

14:50 <gonidelis[m]> K-ballo hkaiser what is `mem_obj` here? i see it takes no args

14:53 <hkaiser> that line has no mem_obj

14:53 <gonidelis[m]> https://github.com/STEllAR-GROUP/hpx/blob/8ace32a0acd0eaf8afab6b3f8f2044a68fced9dc/libs/core/tag_invoke/include/hpx/functional/detail/invoke.hpp#L55

14:54 <gonidelis[m]> `invoke_mem_obj` i mean

14:55 <hkaiser> that's a wrapper function object that -- when invoked -- returns the value of the member data item it wraps

14:55 <hkaiser> using the first argument as the this for the object to access

14:56 <hkaiser> (first and only argument)

15:06 <satacker[m]> hkaiser: gonidelis I was trying to implement sender_base, sender, and sender_to `concepts` as `tags`. For the sender requirements is it enough to query the completion signatures of the sender?

15:06 <satacker[m]> https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2022/p2300r5.html#spec-execution.senders

15:08 <satacker[m]> > <@satacker:matrix.org> hkaiser: gonidelis I was trying to implement sender_base, sender, and sender_to `concepts` as `tags`. For the sender requirements is it enough to query the completion signatures of the sender?

15:08 <satacker[m]> > https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2022/p2300r5.html#spec-execution.senders

15:08 <satacker[m]> Or should I simply call the `is_sender_v` ?

15:13 <hkaiser> just use is_sender_v

15:13 <hkaiser> it does the right thing

15:13 <hkaiser> satacker[m]: ^^

15:14 <hkaiser> satacker[m]: why do you thing the concepts should be CPOs?

15:14 <hkaiser> aren't those simply boolean compiletime conditions?

15:14 <hkaiser> think*

15:16 <satacker[m]> I thought of `sender_base` as a CPO, it's the base part that made me think so

15:18 <satacker[m]> Also I thought of tag dispatching

15:27 <satacker[m]> Question 2:

15:27 <satacker[m]> Can `sender_of` implementation be done as follows

15:27 <satacker[m]> -->Verify if it is a sender and then return the completion signatures using `get_completion_signatures` ?

15:29 <hkaiser> satacker[m]: we do have is_sender (https://github.com/STEllAR-GROUP/hpx/blob/master/libs/core/execution_base/include/hpx/execution_base/completion_signatures.hpp#L603-L609) and is_sender_to (https://github.com/STEllAR-GROUP/hpx/blob/master/libs/core/execution_base/include/hpx/execution_base/completion_signatures.hpp#L603-L609)

15:31 <hkaiser> sender_of<S, R> should return true/false depending on whether S is a sender and R is a receiver supporting the completion signatures of S

15:32 <hkaiser> our is_sender_to does not check the completion signatures part (yet), though

15:33 <satacker[m]> So I need to make sure that their return types matches one of the completion signatures (val, error, stopped )?

15:35 <hkaiser> what's missing is the is_receiver_of<R, completion_signatures_of_t<S, env_of_t<R>>> trait

15:36 <hkaiser> once we have that, is_sender_of is trivial as it simply checks is_sender<S> && is_receiver_of<R, completion_signatures_of_t<S, env_of_t<R>>>

15:38 <hkaiser> well, we do have is_receiver_of, but it doesn't check the completion signatures part: https://github.com/STEllAR-GROUP/hpx/blob/master/libs/core/execution_base/include/hpx/execution_base/receiver.hpp#L168-L172

15:39 <hkaiser> it currently checks whether set_value would succeed

15:39 <satacker[m]> I didn't get why `is_receiver_of` is called to check `is_sender_of`

15:39 <hkaiser> satacker[m]: so I would suggest you look into fixing is_receiver_of first, then change is_sender_of

15:40 <hkaiser> is_sender_of returns true if S is a sender and R is a receiver that can handle the completions signatures of S

15:41 * satacker[m] sent a code block: https://libera.ems.host/_matrix/media/r0/download/libera.chat/679ff9b771f3938a469aaeec436233529f093b9f

15:41 <hkaiser> i.e. is_sender_of<S, R> == is_sender<S> && is_receiver_of<R, completion_signatures_of_t<S, env_of_t<R>>

15:43 <satacker[m]> Thanks, that and the definitions clear it

18:44 <john98zakaria[m]> I have a very general design question.... (full message at https://libera.ems.host/_matrix/media/r0/download/libera.chat/2ab3b04b42ce1e3f43092198a8f3589102e2194a)

18:44 <john98zakaria[m]> I guess the question boils down to "should you use a future as a mutex?

18:44 <john98zakaria[m]> * a mutex/barrier ?

18:47 <hkaiser> john98zakaria[m]: first - channels support moving data

18:47 <hkaiser> whether to use mutexes, futures, barriers or something else really depends on the use case

18:47 <john98zakaria[m]> hkaiser: If I do std::move I loose my buffer object

18:47 <hkaiser> use a barrier if you need to synchronize between several threads

18:48 <john98zakaria[m]> And then I need to reallocate it

18:48 <hkaiser> john98zakaria[m]: true

18:48 <hkaiser> if everything is on one locality only you can pass a reference to your data

18:48 <john98zakaria[m]> That's why I'll rebuild my own one message zero copy channel

18:49 <john98zakaria[m]> hkaiser: It's distributed

18:49 <hkaiser> we also have the serialize_buffer, that pass references locally and copies remotely

18:50 <john98zakaria[m]> hkaiser: But then the receiver needs to allocate new memory each time

18:50 <hkaiser> essentially an array wrapper that manages ownership properly depending on whether it is sent locally or remotely

18:50 <hkaiser> it needs to allocate only in the remote case

18:50 <hkaiser> but I see what you're saying, you want to put the data into an existing receive buffer

18:51 <john98zakaria[m]> hkaiser: Which is exactly what I am doing

18:51 <john98zakaria[m]> Exactly

18:52 <hkaiser> that will work only if the sender has some kind of handle that identifies the receive buffer

18:53 <hkaiser> John Biddiscombe (not here right now was working on integrating rdma with the network layer such that it places the received data directly where it should go

18:53 <john98zakaria[m]> My plan was to make 2 zero copy servers, each on a different locality.

18:53 <john98zakaria[m]> Each locality can write in its own server.

18:53 <john98zakaria[m]> The other uses its serialize buffer to grab the data from the remote locality

18:54 <hkaiser> and he wanted to use the channel API to hide all of this from the user

18:54 <john98zakaria[m]> That's exactly what I am looking for 😍

18:55 <hkaiser> if you really want to achieve zero-copy on the receiving end you'll need to drill a hole into the parcelport API or talk directly to the network

18:56 <john98zakaria[m]> I don't think I am that capable in c++ yet 😅

18:56 <hkaiser> now that I think about it, we might actually be able to do true zero copy without too many changes, just by using special types that have custom serialization support

18:57 <hkaiser> well, maybe not - the receiver still would need to allocate the buffer were the network puts the data

18:57 <john98zakaria[m]> If we have a way to "inject" the allocator, then it would work.

18:58 <hkaiser> serialize_buffer supports that, you're right

18:59 <hkaiser> and yes, the zerocopy_rdma example demostrates that - I had completely forgotten about it

19:00 <john98zakaria[m]> hkaiser: I imagine the following

19:00 <john98zakaria[m]> Each have to allocate memory on channel registration where they want their data to be put .

19:00 <john98zakaria[m]> If locality a sends a message larger than the buffer of b, a resize is triggered, otherwise the reception will happen copy free.

19:02 <john98zakaria[m]> That's how I would build my thing.

19:02 <john98zakaria[m]> I will unfortunately have to do 2 round trips for that, but it's still better that allocating hundreds of mbs

20:15 K-ballo1 has joined #ste||ar

20:16 K-ballo has quit [Ping timeout: 248 seconds]

20:16 K-ballo1 is now known as K-ballo

20:19 <hkaiser> john98zakaria[m]: if you do towo roundtrips, then you can use the first to request an allocator that can be used with the second roundtrip to identify the receive buffer

20:31 Yorlik has quit [Ping timeout: 272 seconds]