aserio changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
<K-ballo> heller: maybe we should consider -fsyntax-only, caveats and all, but only for circle ci
Vir has quit [Ping timeout: 240 seconds]
EverYoung has joined #ste||ar
Vir has joined #ste||ar
EverYoung has quit [Ping timeout: 255 seconds]
<github> [hpx] hkaiser created enforce_function_initialization (+1 new commit): https://git.io/vdiP6
<github> hpx/enforce_function_initialization 9a01810 Hartmut Kaiser: Make sure the function vtables are initialized before use
<github> [hpx] hkaiser opened pull request #2945: Make sure the function vtables are initialized before use (master...enforce_function_initialization) https://git.io/vdiPi
<github> [hpx] hkaiser opened pull request #2946: Adding explicit feature test for thread_local (master...fixing_2137) https://git.io/vdiPX
hkaiser has quit [Quit: bye]
K-ballo has quit [Quit: K-ballo]
pagrubel has joined #ste||ar
pagrubel has quit [Ping timeout: 255 seconds]
<github> [hpx] sithhell force-pushed circle_2 from 39080e1 to a0cdcfc: https://git.io/vdKfd
<github> hpx/circle_2 a0cdcfc Thomas Heller: Switching to CircleCI 2.0...
<github> [hpx] sithhell closed pull request #2944: Attempting to solve affinity problems on CircleCI (master...fixing_affinity_issue) https://git.io/vdifI
<github> [hpx] sithhell pushed 1 new commit to fix_this_thread_executor: https://git.io/vdiQm
<github> hpx/fix_this_thread_executor a71173b Thomas Heller: Merge branch 'master' into fix_this_thread_executor
EverYoung has joined #ste||ar
EverYoung has quit [Ping timeout: 255 seconds]
david_pfander has joined #ste||ar
<jbjnr> heller: yt? I am getting an error at runtime that is "RTTI symbol not found for class 'hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<void (*(int&&, bool&&, char const*&&))(unsigned long, bool, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)>, hpx::lcos::detail::task_base<void> >'"
<jbjnr> any idea who might be looking for the RTTI and not finding it?
<heller> hmmm
<heller> not from the top of my head
<heller> what does the debugger say?
<jbjnr> it seems to be coming from somewhere deep inside async, but I didn't find the exact place where it happens. No actual error, just a message emitted by the runtime
<jbjnr> (or by libstdc+
<jbjnr> + more likely)
<jbjnr> ooh. maybe not. I found something. I'll be back ina mo
<heller> ok
<jbjnr> it's coming from the deereferencing of a boost::intrusive_ptr
<jbjnr> according to google, it's caused by gdb not finsding a symbol ...
<heller> so it is inside of gdb after all?
<heller> and not in our code?
<jbjnr> correct
<zao> 16 cores "install components all", 6 cores "examples tests" - real 126m36.310s
<zao> 16 cores "install components all", 16 cores "examples tests" - real 90m12.573s
<zao> Seems like I gain time, even if I'm swapping :)
EverYoung has joined #ste||ar
<heller> zao: yay!~
<heller> jbjnr: ahh. you can ignore that, gdb sometimes does this, I don't know where this comes from
EverYoung has quit [Ping timeout: 246 seconds]
<zao> heller: Installed an additional 16G of memory yesterday after running AMD's tests.
<heller> zao: wwee!
<zao> Now 8c/16t w/ 32G of mem.
<msimberg> what do i need for setting the number of threads from an hpx.ini file?
<msimberg> --hpx:ini=hpx.os_threads=n works for me but if i do hpx:config=hpx.ini it doesn't
<msimberg> and hpx.ini contains
<msimberg> [hpx]
<msimberg> os_threads = 1
<heller> hmmm
<heller> should work
<heller> what does the application print when adding a --hpx:dump-config?
pree has joined #ste||ar
<zao> I forget if I asked this before, but is there any way to build all the test binaries without running the test suite at the end?
<jbjnr> make tests.unit tests.regression
<jbjnr> tests.performance
<jbjnr> make help | grep tests. top three
<zao> Ah, great!
pree has quit [Read error: Connection reset by peer]
david_pfander has quit [Remote host closed the connection]
david_pfander has joined #ste||ar
pree has joined #ste||ar
<msimberg> @heller dump-config output is consistent, i.e. it still shows os_threads = 4 when i use the ini file, and with the other methods i can change it
<heller> msimberg: looks like a bug
<heller> I assume your machine has 4 cores?
<msimberg> if i put a made up config value in the ini file it shows up in dump-config so it's reading the file
<msimberg> yeah, 4 hyperthreads
<msimberg> also tried just using hpx.os_threads directly in the ini file, but it makes no difference
<msimberg> unless it's something one of you can fix very easily i could have a go at fixing it, would be a good exercise
<jbjnr> msimberg: Are you using a simple (hello world) example, or does the example use the resource partitioner -cos looking at the code, the ini file won't be parsed at the right time of the RP is being used.
<jbjnr> the changes we made to use RP miht have broken this - the threads setup is done first, before the rest of init
<msimberg> this was with 1d_stencil_4_parallel
<jbjnr> ok
<msimberg> which is not using the rp
<jbjnr> I'm testing hello_world right now
<jbjnr> it is broken
<jbjnr> file an issue
<jbjnr> we probably broke it with the changes to setup when RP was introduced
<msimberg> ok
<msimberg> thanks for testing
<jbjnr> need to move the reading of the ini file to before the thread binding stuff. what a pain.
pree has quit [Read error: Connection reset by peer]
<msimberg> @jbjnr i also tried the simple_resource_partitioner example
<msimberg> if i run it with simple_resource_partitioner --use-pools --pool-threads=3 it segfaults
<jbjnr> the RP example would definitely fail, because we only parse the command line for threads before calling init
<msimberg> i.e. there's only one thread left for the main pool
<jbjnr> it shouldn't segfault.
<msimberg> if i do --pool-threads=4 it correctly tells me that the main pool is empty
<jbjnr> any idea what the stacktrace is?
pree has joined #ste||ar
<msimberg> no stacktrace, can try in debug
<jbjnr> don't worry. I'm testing that stuff hre anyway, once my build is working again, I'll try it
<msimberg> ok, thanks
<heller> everything is using the RP
<heller> so my guess is: the ini file gets parsed and the values set. After that, the RP runs and overwrites that value
<heller> something along those lines
<heller> would have to dig as well
<jbjnr> no. The RP only checks the basic command line options (threads/bind etc) in a quick parse, then it sets up stuff and calls the old init. The old init reads the ini file
<jbjnr> it's is all happening too late.
<jbjnr> we'lll need to add a parse of the ini file to the pre-RP init phase
<jbjnr> it'll only tak hartmut 5 mins to fix it.
K-ballo has joined #ste||ar
<heller> aha
<heller> alrighty. I have cmake and python scripts now that matches build options with available software
<heller> and assigns workers etc.
hkaiser has joined #ste||ar
zbyerly_ has joined #ste||ar
<jbjnr> ". We are delighted to inform you that your submission entitled HPX -- A open source lib for Parallelism and Concurrency has been accepted! We have attached the reviews below. "
<zbyerly_> jbjnr, lol i was about to say the same thing!
<zbyerly_> wooop woop
<jbjnr> they have low standards indeed!
<zbyerly_> jbjnr, that's cool with me
<jbjnr> zbyerly will you be presenting it?
<zbyerly_> that's the plan, afaik
<zbyerly_> heller asked me to
<jbjnr> you'll be the only one there probably!
<jbjnr> I think the reviewers expectations were much higher. for an open source workshop I thought the material in the paper was about right. Not too much detail, just an overview of the project.
<zbyerly_> we got 0, 0, 2
<heller> 2 borderline paper, ouch
<zbyerly_> ""If selected for a talk, I would like to suggest the authors consider offering a modest demonstration of their software library."
<zbyerly_> Hello world from locality 0!
<heller> lol
<zao> We need to show that HPX works? Oh gods.
<zbyerly_> review 1: "The writeup to me looked like a summary of HPX."
<zbyerly_> uh.. yup
<heller> that kind of was the intention ;)
<zbyerly_> they dinged us about our sloppy paper for sure
<jbjnr> well it's a shit workshop so they got what they deserved
<zbyerly_> jbjnr, you are so negative!
<jbjnr> absolutley.
<jbjnr> "Singing the Blues isn't about making yourself feel better, it's about making everyone else feel as bad as you"
<zbyerly_> jbjnr, yeah life sucks and then you die
<jbjnr> t realis the quote cam from there.
<zbyerly_> we did something stupid that made it look like we had already submitted the paper ot another place
<zbyerly_> and two of the reviewers dinged us for that
<zbyerly_> if we hadn't done that we would have gotten accepted even harder
<jbjnr> I shan't be losing any sleep over it
<zbyerly_> some of the critisizem of hte paper was "I would like to hear a lot more about every section"
<zbyerly_> one reviewer said "this section could be an entire paper. next section: this section could also be an entire paper"
<hkaiser> doesn't matter - it got accepted - congrats everybody
<zbyerly_> lol the camera ready deadline is today??
<jbjnr> you'd better pull your finger out then and get off irc...
<hkaiser> lol
<zbyerly_> it must be a week later, since the author notification was a week late
<hkaiser> zbyerly_: pls talk to Adrian about travel to SC, also the early bird deadline for SC registrations is this Sunday
<zbyerly_> hkaiser, ok, will do.
eschnett has joined #ste||ar
<heller> Do my messages to hpx-users not get through?
<jbjnr> I see one fro you today (early this morning) and another from yesterday ...
<jbjnr> you don't get your own messages though
<heller> Just wondering
harsh_ has joined #ste||ar
<heller> hkaiser: yeah, #2938 is finally green...
<hkaiser> heller: nice!
<hkaiser> there was no other option anymore...
<hkaiser> couldn't have been anything else
<heller> Yeah, thanks for the fix
<hkaiser> K-ballo: could you review #2945, pls?
<hkaiser> shouldn't take long
<K-ballo> hkaiser: looks good
<hkaiser> I was not sure if your intention was to make everything constexpr, which this patch would not allow
<K-ballo> it can't be, because of the function registration
<K-ballo> the non serializable vtables are constexpr, so that's a regression on them, but I don't know how to split the cases easily
<hkaiser> right
<hkaiser> thanks
hkaiser has quit [Quit: bye]
zbyerly_ has quit [Ping timeout: 246 seconds]
pree has quit [Read error: Connection reset by peer]
aserio has joined #ste||ar
eschnett has quit [Quit: eschnett]
EverYoung has joined #ste||ar
pree has joined #ste||ar
pree has quit [Read error: Connection reset by peer]
EverYoung has quit [Ping timeout: 258 seconds]
eschnett has joined #ste||ar
heller has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
heller has joined #ste||ar
david_pfander has quit [Ping timeout: 240 seconds]
hkaiser has joined #ste||ar
harsh_ has quit [Quit: Leaving]
aserio has quit [Ping timeout: 248 seconds]
<heller> How much green will we get now?
<K-ballo> no matter how many times you say it, I keep interpreting that in a different way
<heller> I mean green as in all tests pass. Not the orange green
<zbyerly> jbjnr, heller the camera ready deadline has been pushed out by one week.
<K-ballo> heller: hah! it actually makes me think about money
aserio has joined #ste||ar
<heller> K-ballo: lol
<heller> zbyerly: yay
<heller> Lots to do
EverYoung has joined #ste||ar
aserio has quit [Read error: Connection reset by peer]
EverYoung has quit [Remote host closed the connection]
EverYoung has joined #ste||ar
aserio has joined #ste||ar
pagrubel has joined #ste||ar
aserio1 has joined #ste||ar
aserio has quit [Ping timeout: 240 seconds]
aserio1 is now known as aserio
rod_t has left #ste||ar [#ste||ar]
pree has joined #ste||ar
diehlpk has joined #ste||ar
<heller> Stupid throttle test
<heller> hkaiser: fine with disabling the throttle test for now?
<diehlpk> src/tcmalloc.cc:284] Attempt to free invalid pointer 0x297f840
<diehlpk> Aborted (core dumped)
<diehlpk> Does anyone know where this error comes from?
pree has quit [Ping timeout: 264 seconds]
<heller> diehlpk: could be a double free or so
<heller> gdb will tell you
<heller> The error itself is pretty useless on its own
<hkaiser> heller: what does disabling the test gives us?
pree has joined #ste||ar
<diehlpk> heller, gdb locates this error inside of hpx
<diehlpk> hkaiser, Could it be related to blaze with hpx support?
rod_t has joined #ste||ar
<hkaiser> diehlpk: shrug, no idea - everything is possible
<diehlpk> Ok, will investigate
<hkaiser> : nod
<diehlpk> blaze solver is two times faster than libeigen
<hkaiser> cool!
<hkaiser> very nice
<diehlpk> So they solving step is much shorter
pree has quit [Ping timeout: 240 seconds]
<diehlpk> Yes, Prashant started to use HPX too
<diehlpk> He is contributing heavily to the code
<hkaiser> :D
hkaiser has quit [Quit: bye]
<heller> hkaiser: green status
<heller> diehlpk: well, what does the stacktrace say?
<heller> diehlpk: also, which code leads to the error and can it be reproduced to a small, self contained testcase?
<heller> aserio: we should make a blog post about HPX+Blaze
<heller> aserio: a really nice step forward
<diehlpk> heller, I replaced in my code libeigen with blaze and enabled hpx support
<heller> diehlpk: that still doesn't tell me anything ;)
<diehlpk> I can invite you to the private github repo
<heller> a reduced testcase would be even better
<diehlpk> Yes, for the reduced testcase, I have to locate the error
<heller> diehlpk: aha, that's odd. the error is happening on startup
<heller> or shutdown even
<diehlpk> Shutdown
<heller> alright
<heller> one thing that might lead to the error is that you are mixing debug and release builds (your application is debug, HPX is release)
<diehlpk> I will check
<heller> it's a strange error, I haven't seen that at all
aserio has quit [Read error: Connection reset by peer]
pree has joined #ste||ar
pree has quit [Read error: Connection reset by peer]
pree has joined #ste||ar
<pree> heller : Is HPX.compute has some documentation over ?
<pree> Thanskcc
<pree> *thanks
<diehlpk> heller, Ok, rebuild both with release and now this
<heller> diehlpk: the first thing i'd check is if it happens with an empty hpx_main. And then slowly enable piece by piece until i get the failure again, you can then pretty easily isolate the problem
<heller> alright
<heller> there you go
<heller> what's the content of Main.cpp?
pree has quit [Quit: AaBbCc]
<diehlpk> This is my content of main
<heller> Show me everything of that file please
zbyerly_ has joined #ste||ar
<diehlpk> There you go
aserio has joined #ste||ar
<aserio> heller: I will look into it!
<heller> aserio: great! thanks
<heller> aserio: maybe a small performance comparison with OpenMP and Boost.Thread would be nice
<heller> parallel mkl also
zbyerly_ has quit [Quit: Leaving]
<heller> how do you compile this file?
<heller> diehlpk: ^^
aserio has quit [Ping timeout: 255 seconds]
<diehlpk> add_hpx_executable(PeridynamicHPX SOURCES src/Main.cpp DEPENDENCIES Util IO Material Problem ${BLAS_LIBRARIES})
<diehlpk> heller, It was compiling and running before
<heller> ok
<heller> one sec
<diehlpk> This things appears when I started to use blaze with hpx supprt
<heller> diehlpk: can you do a "grep -rl hpx_main *" in your source directory please?
<diehlpk> heller, see pm
<heller> something is odd with your setup
<diehlpk> Ok, I can recompile hpx and my code again
<diehlpk> heller, deleted build folder of both and rebuild both of them.
<diehlpk> Error is still there
<heller> the linker error?
<diehlpk> Yes
<diehlpk> Ok, if i specify Release in the Cmake of the project it is not working.
<diehlpk> Specifying it empty and it compiles
<diehlpk> With empty build type it compiles at least
<heller> empty build type is usually debug
diehlpk has quit [Ping timeout: 255 seconds]
aserio has joined #ste||ar
pagrubel has quit [Ping timeout: 240 seconds]
hkaiser has joined #ste||ar
<github> [hpx] hkaiser pushed 13 new commits to master: https://git.io/vdXlJ
<github> hpx/master d52b477 Denis Blank: Implement an API for asynchronous pack traversal...
<github> hpx/master 0139b9d Denis Blank: Prepare traverse_pack_async for using it inside when_all
<github> hpx/master e9e8878 Denis Blank: Inherit the visitor rather than embedding it as a class member...
<hkaiser> K-ballo: ^^
<hkaiser> async pack has been merged
<K-ballo> alright.. time to dive into the dataflow overload set
<K-ballo> fun.
jaafar has joined #ste||ar
eschnett has quit [Quit: eschnett]
<aserio> hkaiser: what does nil{} evaluate to?
<hkaiser> nil{}
<hkaiser> nil, nada, nothing
<hkaiser> phylanx::execution_tree::valid(nil{}) will be false
<aserio> I am trying to pass something to test_generate_tree
<aserio> I tried to create an instance of nil to compare it to the result but it did not like that
<hkaiser> use phylanx::execution_tree::valid instead
<zao> `nil`? Sounds like someone doesn't plan on working under a Obj-C++ compiler :)
<hkaiser> lol
<github> [hpx] biddisco force-pushed alloc_membind from 489d456 to 6524d7c: https://git.io/vdnl9
<github> hpx/alloc_membind 86cb4cd John Biddiscombe: Add hwloc alloc_membind support and nodeset bitmap type...
<github> hpx/alloc_membind d6807b7 John Biddiscombe: Move hwloc_bitmap_ptr to threads namespace
<github> hpx/alloc_membind e7a7f1d John Biddiscombe: Add ostream operator for hwloc bitmap printout
<jbjnr> does circleci use the new workflows stuff now for PRs, or is it still using the old one
<hkaiser> still old, I think
<hkaiser> heller: has not created a PR for this yet
aserio has quit [Quit: aserio]
rod_t has left #ste||ar [#ste||ar]
jaafar has quit [Ping timeout: 248 seconds]
Aalice has joined #ste||ar
<Aalice> I was wanting to build a NERSC module for 1.0.0. Before I downloaded 0.9.99.tar.gz with wget. Is there a new way to do that now? I don't see hpx_1.0.0.tar.gz under stellar.cct.lsu.edu/files
<Aalice> In particular, I'm trying to update the directions we have under:
<Aalice> and replace the old builds in any module files by doing a new build first. Thanks for help.
<zao> The github releases may be the way to go - https://github.com/STEllAR-GROUP/hpx/releases/tag/1.0.0
<zao> http://stellar.cct.lsu.edu/downloads/ seems to link to http://stellar.cct.lsu.edu/files/hpx_1.0.0.tar.gz f.ex. too, but the directory itself doesn't have listings.
<Aalice> thanks everyone.
<Aalice> I'm wondering if I still need to swap out for older modules of gcc, etc. , but I'll see what happens. And there is no longer a specific knl branch is there?
<hkaiser> Aalice: no special knl branch, just a special cmake toolchain file