hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu
K-ballo has quit [Quit: K-ballo]
hkaiser has quit [Quit: Bye!]
Yorlik has joined #ste||ar
john98zakaria[m] has joined #ste||ar
K-ballo has joined #ste||ar
hkaiser has joined #ste||ar
<john98zakaria[m]>
I am trying to setup the summa algorithm, however my communicators are failing and I don't understand what the error message means
<john98zakaria[m]>
index is out of range for this base_and_gate: HPX(bad_parameter)
<hkaiser>
I'll try to run it later today (once I've had my coffee)
<hkaiser>
how do you run your executable, just one locality?
<john98zakaria[m]>
hkaiser: Thank you <3
<john98zakaria[m]>
I am running on a single node using mpirun -n 4
<hkaiser>
ok, so four localities - got it.
<hkaiser>
but your code initializes the communicators with 2 sites
<hkaiser>
all_gather by default will use the current locality index, so if your communicator expects only two connecting site this will lead to the error you see
<john98zakaria[m]>
Does a site mean a node or a process?
<hkaiser>
john98zakaria[m]: you might want to pass the corrcet this_site argument to all_gather to constrain things to the correct indicies
<hkaiser>
a site in your case means a process
<hkaiser>
by 'a site' we mean a unique endpoint participating in the collective operation
<hkaiser>
if you create a communicator with two sites, then the corresponding collective operation shouldn't use anything by zero or one as their site indicies
<hkaiser>
if you don't specify a this_site argument, then the seuqnce number of the locality (process) will be used - i.e. the MPI rank