hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | | HPX: A cure for performance impaired parallel applications | | Buildbot: | Log: | GSoC:
Amy1 has joined #ste||ar
akheir has quit [Quit: Leaving]
nan11 has quit [Remote host closed the connection]
nikunj has quit [Read error: Connection reset by peer]
nikunj has joined #ste||ar
bita has joined #ste||ar
Amy1 has quit [Quit: WeeChat 2.2]
Amy1 has joined #ste||ar
bita has quit [Read error: Connection reset by peer]
Yorlik has quit [Read error: Connection reset by peer]
Yorlik has joined #ste||ar
nikunj has quit [Ping timeout: 240 seconds]
nikunj has joined #ste||ar
nikunj has quit [Ping timeout: 258 seconds]
nikunj has joined #ste||ar
hkaiser has quit [Quit: bye]
nikunj has quit [Ping timeout: 256 seconds]
nikunj has joined #ste||ar
nikunj has quit [Ping timeout: 240 seconds]
nikunj has joined #ste||ar
weilewei has quit [Remote host closed the connection]
nikunj has quit [Ping timeout: 246 seconds]
nikunj has joined #ste||ar
<Yorlik> I'm getting exceptions in hpx with jemalloc in debug mode. Is there a known issue with the debug version of jemalloc? Should I not use it?
Vir has quit [Ping timeout: 264 seconds]
Vir has joined #ste||ar
Vir has quit [Changing host]
Vir has joined #ste||ar
<heller1> not that I am aware of
<heller1> what kind of exceptions do you get?
<Yorlik> read access violation in <vector> _Orphan_range(...) coming from some initialization code in hpx. None of my code is in the call stack. I'll retry and link against the release version of jemalloc, because that was what I changed.
<heller1> ok
<Yorlik> Weird. Same error with release build
<Yorlik> I'll make a full rebuild of HPX debug
hkaiser has joined #ste||ar
<Yorlik> The error persists. When I start it directly I get a windows error (0x0000142). In the debugger its the mentioned exception.
<heller1> aha
<heller1> how does the backtrace look like?
<Yorlik> No issues in release
<heller1> it's jut debug?
<heller1> * it's just debug?
<Yorlik> Yes
<heller1> can you show the full call stack please?
<Yorlik> Using jemalloc release both times. I wonder if it is unrelated to jemalloc actually.
<Yorlik> I am using jemalloc master
<heller1> I have no idea
<heller1> since when does this exception occur? What did you change?
<Yorlik> I changed just the atomation of my jemalloc integration
<Yorlik> The variables are set correctly as the output tells
<Yorlik> But I also pulled the last stable hpx
<heller1> did that change in automation mean you got a new version of jemalloc?
<Yorlik> So - the error could be elsewhere.
<Yorlik> Could be , yes
<Yorlik> their master is supposed to be stable
<Yorlik> Are you pinning jemalloc to some version?
<Yorlik> like thir 3/4 branch?
<Yorlik> Or a specific tag?
<heller1> I usually pick a release
<heller1> then let it sit
<heller1> and update from time to time
<Yorlik> I'll try again and pin it to 5.2.1
hkaiser has quit [Ping timeout: 240 seconds]
K-ballo has quit [Remote host closed the connection]
K-ballo has joined #ste||ar
hkaiser has joined #ste||ar
<hkaiser> Yorlik: could be a global initialization sequencing problem
<hkaiser> since we're changing things quite rapidly this could happen easily
<hkaiser> would be good to find out what variable is causing this
hkaiser has quit [Ping timeout: 252 seconds]
nikunj has quit [Ping timeout: 265 seconds]
nikunj has joined #ste||ar
K-ballo has quit [Remote host closed the connection]
K-ballo has joined #ste||ar
hkaiser has joined #ste||ar
<hkaiser> simbergm: please feel free to use the zoom meeting link I sent for the PMC meeting for the Kokkos/HPX meeting as well
<hkaiser> everybody else: here is the link for the PMC meeting at 9am CDT:
<hkaiser> (in case you'd like to join)
K-ballo has quit [Remote host closed the connection]
K-ballo has joined #ste||ar
<simbergm> hkaiser: great, thanks! are you planning to join this time? I think it might be a short one...
<simbergm> freifrau_von_bleifrei: gdaiss I don't have any updates so it's all you for the kokkos meeting :)
<simbergm> is 15:45 enough?
<gdaiss[m]> ms:
<gdaiss[m]> * ms: I think 15.45 is fine for us :)
<gdaiss[m]> freifrau_von_bleifrei: Or do you need more time?
<simbergm> we can do 15:30 as well if you feel like talking :P but I thought 15 minutes might be enough
<gdaiss[m]> Let's do 15.45! I think we don't have too many exciting updates either - it'll probably just be about what we are currently working on as well
<freifrau_von_ble> 👍️
<Yorlik> Is there a way to go backwards in the line of stable tagged releases? I want to find out if it was a recent stable which broke my build or if it was jemalloc.
<Yorlik> Would every commit not marked with a red X be suitable?
<heller1> Yorlik: I guess so
<Yorlik> Commit ca04ade7bf0883d5ca0343d983b0927dd3ae1d5a works for me
<Yorlik> heller1, hkaiser ^^
weilewei has joined #ste||ar
bita has joined #ste||ar
nikunj97 has joined #ste||ar
nan11 has joined #ste||ar
rtohid has joined #ste||ar
rtohid has quit [Remote host closed the connection]
K-ballo has quit [Remote host closed the connection]
K-ballo has joined #ste||ar
aalekhnigam has joined #ste||ar
nikunj has quit [Read error: Connection reset by peer]
nikunj has joined #ste||ar
Nikunj__ has joined #ste||ar
aalekhnigam has quit [Remote host closed the connection]
nikunj97 has quit [Ping timeout: 256 seconds]
nikunj has quit [Ping timeout: 256 seconds]
nikunj has joined #ste||ar
aalekhnigam has joined #ste||ar
Amy1 has quit [Ping timeout: 256 seconds]
Amy1 has joined #ste||ar
nikunj has quit [Ping timeout: 250 seconds]
nikunj has joined #ste||ar
aalekhnigam has quit [Remote host closed the connection]
aalekhnigam has joined #ste||ar
aalekhnigam has quit [Remote host closed the connection]
nikunj has quit [Ping timeout: 256 seconds]
aalekhnigam has joined #ste||ar
nikunj has joined #ste||ar
aalekhnigam has quit [Ping timeout: 260 seconds]
nikunj has quit [Ping timeout: 265 seconds]
nikunj has joined #ste||ar
aalekhnigam has joined #ste||ar
aalekhnigam has quit [Ping timeout: 265 seconds]
aalekhnigam has joined #ste||ar
<simbergm> Yorlik: and the one after doesn't?
Nikunj__ has quit [Ping timeout: 240 seconds]
rtohid has joined #ste||ar
wate123_Jun has joined #ste||ar
aalekhnigam has quit [Remote host closed the connection]
rtohid has quit [Ping timeout: 240 seconds]
rtohid has joined #ste||ar
aalekhnigam has joined #ste||ar
<bita> hkaiser, my retile issue is resolved. It was a copy and paste error (forgot to change row to col, so one of the conditions were missing)
<hkaiser> bita: nod, as expected ;-)
<hkaiser> I'm glad you solved it!
<bita> :)
<diehlpk_work> hkaiser, Orsola's talk
aalekhnigam has quit [Remote host closed the connection]
aalekhnigam has joined #ste||ar
aalekhnigam has quit [Ping timeout: 250 seconds]
<weilewei> hkaiser I think I manage to get the indices correct and have smaller cuda array allocation, yea! I will run more tests to verify my implementation.
<hkaiser> weilewei: \o/
<weilewei> :)
<zao> I've got a need to persist some data for visualisation. Is HDF5 still the least horrible library and format for that?
rtohid has left #ste||ar [#ste||ar]