hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
<heller_>
i thought you mentioned openmpi version 4 the other day (which seems to work fine for me)
mdiers_1 has joined #ste||ar
mdiers_ has quit [Remote host closed the connection]
mdiers_1 is now known as mdiers_
K-ballo has quit [Quit: K-ballo]
hkaiser has quit [Quit: bye]
nikunj has joined #ste||ar
david_pfander has joined #ste||ar
<Amy1>
which benchmark do you use to test memory peak bandwidth?
<jbjnr__>
there should be a tests/performance/local/stream benchmark, but it might not be maintained and need some tweaks
<jbjnr__>
we made changes o the way threads are assigned etc and the test might not have been updated
<Amy1>
someelse???
<jbjnr__>
not sure I understand the question
nikunj has quit [Quit: Leaving]
<Amy1>
which benchmark do you use to test memory peak bandwidth?
<Amy1>
I want to know a common used?
<jbjnr__>
tests/performance/local/stream.cpp
<Amy1>
could you give me a link?
<Amy1>
github link
<jbjnr__>
just click on the subdirs and you'll find it
<hkaiser>
simbergm: an assert before setting the state in thread_data may help
aserio has joined #ste||ar
<simbergm>
yeah, sounds good
eschnett_ has joined #ste||ar
<zao>
I took a look at the debian patches for OpenMPI, nothing that's intended to affect x86_64.
<zao>
Just arm64, HPPA, Hurd, and some build infra.
<zao>
debian/rules has a fair bit of build flag overriding, tho.
bita has quit [Read error: Connection reset by peer]
<zao>
I found this which seemed very close in description, but it doesn't change anything for me if I run with other transports (self,tcp or self,sm) or the suggested disable of CMA for the vader btl. - https://github.com/open-mpi/ompi/issues/4948
<zao>
I guess that it _could_ be container related still.
<zao>
heller_: Did you try your own OpenMPI inside or outside a container?
<zao>
Bah, that theory smashed, my EasyBuild-sourced OpenMPI works in the container.
<zao>
heller_: Even more interesting, if I just add that self-built OpenMPI to LD_LIBRARY_PATH for the container-built test, it passes.
<zao>
So there's something inherently hecked up with the distro's libopenmpi
<zao>
Time to get some actual work work done before the end of the day, this has been a fun rabbit hole :D
<Yorlik>
hkaiser: I shared a google doc folder with you for the persistent id_type requirements. You should have mail.
<hkaiser>
Yorlik: gotcha
<Yorlik>
:)
<Yorlik>
Can you see the google chat ?
<Yorlik>
Saw you typing already.
david_pfander has quit [Ping timeout: 268 seconds]
nikunj has quit [Quit: Leaving]
<zao>
I’m going to try OpenMPI on an actual machine tomorrow if I get time.
<zao>
Via distort
<zao>
*distro
jaafar has joined #ste||ar
eschnett_ has quit [Quit: eschnett_]
eschnett_ has joined #ste||ar
aserio has quit [Ping timeout: 264 seconds]
<heller_>
zao: outside a container
<heller_>
hkaiser: I am available now
<hkaiser>
heller_: sec
aserio has joined #ste||ar
Abhishek09 has joined #ste||ar
<Abhishek09>
hello guys
Abhishek09 has quit [Quit: Page closed]
<Yorlik>
hkaiser ?
<Yorlik>
Relocating an object in a store/load cycle as we thought of could actually be done while the system is running if - and only if - the application/developer knows 100% sure there are no remaining references that would need updating, or if there were some sort of reference bookkeeping in the application.
<Yorlik>
Just fantasizing about restructuring a long running application while it's running.
<zao>
Yorlik: I'm patiently waiting for you to find out that saving/loading Lua state is a royal pain in the back :D
<Yorlik>
Actually we will not have any mutrable state in the lua states
<Yorlik>
We will make extensive use of lua working on exposed userdata objects
aserio has quit [Ping timeout: 264 seconds]
bibek has quit [Quit: Konversation terminated!]
eschnett_ has quit [Quit: eschnett_]
aserio has joined #ste||ar
aserio has quit [Quit: aserio]
<zao>
Good news! Boost seems to have gotten GSoC this year.
<K-ballo>
"seems to" indeed
<heller_>
yay
<K-ballo>
how many years have we participated in GSoC? 5? 7?
hkaiser has quit [Read error: Connection reset by peer]
hkaiser has joined #ste||ar
<hkaiser>
Yorlik: I think this is limitation too strong
<hkaiser>
it is sufficient for an object to be 'migratable to storage' as long as there is no active thread scheduled or running
<hkaiser>
if other objects decide to schedule work on an object that is currently in storage this can be 'brought back to life' transparently as long as the application is running
<Yorlik>
hkaiser: My line of thoughts was just around optimizing a running simulation with lots of object migrations. It's a special case ofc.
<hkaiser>
Yorlik: I see 'migration to storage' (checkpointing) as a special case of object migration
<Yorlik>
How do you think about the remapping then which places AGAS responsibility and Object locality together again?
<Yorlik>
Do you think it could be done live efficiently?