#ste||ar on 2020-03-28 — irc logs at irclog.cct.lsu.edu

2020-02-24 20:46 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/ | GSoC: https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020

00:02 <hkaiser> Yorlik: I'm using jemalloc, that's a big difference

00:03 <Yorlik> So - worth it?

00:03 <hkaiser> absolutely

00:03 <Yorlik> Lua does a lot of small allocations

00:03 <Yorlik> That might actually be a gamechanger then

00:03 <Yorlik> I'll give it a try. Is there anything special i should have in mind when integrating it?

00:04 <hkaiser> jemalloc is not replacing the system allocator, though, but must be used explcitly (what we do in hpx)

00:04 <hkaiser> at least not on windows

00:04 <Yorlik> So Lua won't use it automagically?

00:04 <Yorlik> We might patch Lua if it's really worth it.

00:04 <hkaiser> right, except if yo can tell lua to use it

00:05 <Yorlik> does it replace malloc or is it a special function?

00:05 <hkaiser> we have a special c++ allocator we use everywhere

00:05 <hkaiser> on windows it does not replace malloc

00:05 <Yorlik> I'll read the jemalloc docs

00:05 <Yorlik> tcmalloc doesn't work on windows, does it?

00:05 <hkaiser> Yorlik: https://github.com/STEllAR-GROUP/hpx/blob/master/libs/allocator_support/include/hpx/allocator_support/internal_allocator.hpp

00:06 <hkaiser> Yorlik: it does, but I have never used it

00:06 <Yorlik> OK

00:06 <Yorlik> Might try both

00:06 <hkaiser> hpx might even support it on windows, it used to a while back, not sure whether it still works now

00:06 <Yorlik> OK.

00:07 <Yorlik> Thanks for the info!

00:08 <Yorlik> So you're wrapping jemalloc into a C++ allocator?

00:08 <hkaiser> yes

00:08 <hkaiser> on windows

00:08 <hkaiser> on linux, both jemalloc and tcmalloc just replace the system allocator

00:09 <Yorlik> IC.

00:09 <hkaiser> Yorlik: there is also mimalloc which is supposed to be even faster, we support it but I have not actually tried it

00:10 <Yorlik> So three malloc replacements to try.

00:10 <hkaiser> I think on windows mimalloc replaces the system allocator, so it might be the easiest to use for you

00:10 <hkaiser> https://github.com/microsoft/mimalloc

00:10 <Yorlik> Makes sense, though on the long run we'll use Linux

00:11 <Yorlik> I just like working with visual studio

00:11 <hkaiser> sure, same here

00:14 <Yorlik> Since HPX is a dll - will it us it if I link my main program against mimalloc?

00:14 <hkaiser> Yorlik: you might want to enable it on hpx, then it should be automatically used by your executable as well

00:15 <Yorlik> OK - I'll look up the switches.

00:15 <hkaiser> Yorlik: https://github.com/STEllAR-GROUP/hpx/blob/master/cmake/HPX_SetupAllocator.cmake#L67-L78

00:15 <hkaiser> -DHPX_WITH_ALLOCATOR=mimalloc

00:16 <Yorlik> Thanks a ton !

00:16 <hkaiser> Yorlik: sorry, it's HPX_WITH_MALLOC=mimalloc

00:16 <Yorlik> OK

00:17 <hkaiser> pls let me know how it works, never used it

00:19 <Yorlik> How do I point to the library / header

00:20 <Yorlik> Or do I just place the dll in the path?

00:31 <hkaiser> Yorlik: I think they have cmake support

00:31 <Yorlik> Still working on it - I'll figure it out

00:32 <Yorlik> Something interfered.

00:32 <Yorlik> Need to continue a bit later

00:33 <hkaiser> Yorlik: we just do a find_package(mimalloc), so you can probably use the standard variables, like MIMALLOC_DIR to point to the cmake config files

00:33 <Yorlik> Yes.

00:33 <Yorlik> Its already built - but I can't go further right now.

00:40 wate123 has joined #ste||ar

00:43 wate123 is now known as wate123_Jun

01:30 <Yorlik> hkaiser: Finished the first compile with mimalloc, doing the others now

01:30 <Yorlik> I had to install it with cmake to get everything right

01:33 wate123_Jun has quit [Remote host closed the connection]

01:41 wate123_Jun has joined #ste||ar

01:41 wate123_Jun has quit [Remote host closed the connection]

01:42 wate123__ has joined #ste||ar

01:47 bita has quit [Quit: Leaving]

01:48 <zao> Yorlik: Symbols in executables or dynamic libraries only satisfy lookups explicitly made against those modules or indirectly by redirection to those modules.

01:48 <zao> There's none of the symbol soup that you get on Linux and other libdl-like systems where symbols can be overridden from heaven knows where.

01:49 <Yorlik> IC. So it'll hopefully work - recompiling the server - all three uilds worked.

01:49 <zao> There might be hooks in the CRT or suchlike that can be leveraged to impact DLLs using the same CRT, but it's likely that you need to consider it on a per-module basis.

02:05 akheir1_ has quit [Read error: Connection reset by peer]

02:05 akheir1_ has joined #ste||ar

02:08 <hkaiser> zao: they runtime patch the standard allocator

02:08 <hkaiser> so having it linked to one module affects all

02:08 <hkaiser> kinda like weak symbols on linux

02:08 <zao> Allocator as in the C++ one from the CRT?

02:08 <hkaiser> yes

02:08 <zao> And not HeapAlloc and friends?

02:09 <hkaiser> and malloc/free

02:09 <Yorlik> The CMake test compile it alays does breaks

02:09 <Yorlik> @SET "PATH=%PATH%;

02:09 <Yorlik> Woops

02:10 <Yorlik> 1> [CMake] LINK : error LNK2001: unresolved external symbol mi_version

02:10 <hkaiser> it misses the library then

02:10 <Yorlik> mimalloc gets found and everything and is in the path

02:10 <hkaiser> as I said, I never tried it...

02:10 <Yorlik> I'll figure it out

02:10 <hkaiser> is the library on the command line?

02:11 <hkaiser> Yorlik: I can try tomorrow, too tired now

02:11 <Yorlik> no

02:11 <Yorlik> NP

02:11 <Yorlik> I might have overlooked something. But the mimake_DIR is set and the path too.

02:17 <hkaiser> Yorlik: it's done here: https://github.com/STEllAR-GROUP/hpx/blob/master/cmake/HPX_SetupAllocator.cmake#L72

02:17 <Yorlik> I think I need to add it as library to my app and forgot thgat

02:18 <Yorlik> It's only in HPX

02:18 <hkaiser> that's fine

02:18 <hkaiser> does hpx build?

02:18 <Yorlik> Yes it did

02:18 <Yorlik> I had to install mimalloc with cmake directly

02:18 <hkaiser> the your app does not need to do anything

02:18 <Yorlik> It didn't work from MSVC

02:19 <Yorlik> Somehow it wants mimalloc

02:19 <hkaiser> does it add mi_version to the linker explicitly?

02:19 <hkaiser> then you'll have to link with the library

02:19 <hkaiser> probably doesn't urt to link it to the app as well

02:20 <Yorlik> It breaks when generating the cache

02:20 <hkaiser> hmm

02:21 <hkaiser> hpx re-exports mimalloc to its apps if you use HPX::hpx as a dependency

02:21 <Yorlik> I am using latest mimalloc - maybe the changed the symbol

02:21 <hkaiser> nah

02:22 <hkaiser> mi_version is importedby hpx and it finds it

02:22 <Yorlik> Do I need to add mimalloc in the Component_dependencies?

02:22 <hkaiser> you may want to add it to your app as a dependency

02:22 <Yorlik> in the link libraries or in the hpx setup?

02:23 <hkaiser> your app

02:23 <Yorlik> You mean the header?

02:23 <hkaiser> no your cmake

02:23 <Yorlik> Or just link only?

02:23 <hkaiser> where you build your application add_executable or similar

02:24 shahrzad has joined #ste||ar

02:24 <Yorlik> I have it in my TARGET_LINK_LIBRARIES

02:24 <hkaiser> no idea

02:24 <hkaiser> anyways, I'm off - will try tomorrow

02:24 <Yorlik> OK

02:24 <hkaiser> find out what module fails linking and add it as a dependency to that

02:24 <Yorlik> G'Night!

02:24 <Yorlik> OK

02:25 <Yorlik> Its the Cmake test compile - lol

02:25 <hkaiser> whatever

02:25 <hkaiser> if you use a symbol you need to have it as a target_link_library dependency

02:25 <Yorlik> Its not my program. It's the cmake test comopile done every time that fails

02:26 <Yorlik> Tomorrow ...

02:26 <hkaiser> Yorlik: well cmake uses your setting to test compile things

02:26 <hkaiser> doesn't it?

02:26 <Yorlik> It should.

02:26 <hkaiser> ok

02:26 <Yorlik> Find package doesn't fail

02:26 <hkaiser> cheers for now

02:26 <Yorlik> Good Night !

02:26 hkaiser has quit [Quit: bye]

03:06 nikunj has quit [Read error: Connection reset by peer]

03:27 nikunj has joined #ste||ar

03:33 wate123__ has quit [Remote host closed the connection]

03:33 wate123_Jun has joined #ste||ar

03:34 wate123__ has joined #ste||ar

03:34 weilewei has quit [Remote host closed the connection]

03:37 wate123_Jun has quit [Ping timeout: 256 seconds]

03:38 wate123__ has quit [Ping timeout: 240 seconds]

03:43 akheir1_ has quit [Quit: Leaving]

04:28 wate123_Jun has joined #ste||ar

04:30 diehlpk_work has quit [Remote host closed the connection]

05:11 shahrzad has quit [Quit: Leaving]

05:13 wate123_Jun has quit [Ping timeout: 256 seconds]

05:53 wate123_Jun has joined #ste||ar

05:58 wate123_Jun has quit [Ping timeout: 240 seconds]

07:27 <simbergm> Yorlik: can you comment on 4462 again with the details of the tests that fail for you? I'll have a look at it later

07:27 <simbergm> all the error messages etc, does it fail to link, build, run, and so on

08:08 ibalampanis has joined #ste||ar

08:08 ibalampanis has quit [Remote host closed the connection]

08:14 <simbergm> Yorlik, actually, I think I know what's wrong

08:14 <simbergm> I meant to update one of the tests to test that, will check later if it's that

08:14 <simbergm> hpx_main.hop is likely not installed anymore

08:49 wate123_Jun has joined #ste||ar

08:53 wate123_Jun has quit [Ping timeout: 240 seconds]

09:06 <simbergm> Yorlik: should be fixed now

09:06 vip3r has joined #ste||ar

09:07 vip3r is now known as kale

09:10 kale has quit [Client Quit]

09:18 nikunj97 has joined #ste||ar

09:28 wate123_Jun has joined #ste||ar

09:33 wate123_Jun has quit [Ping timeout: 240 seconds]

10:00 wate123_Jun has joined #ste||ar

10:04 wate123_Jun has quit [Ping timeout: 240 seconds]

10:13 ibalampanis has joined #ste||ar

10:13 <ibalampanis> Everyone has a good day!

10:45 <rori> ibalampanis: thanks you too

10:45 <rori> zao: amazing your pull request fetching command :D

10:46 <rori> Yorlik: I added a cmake script for mimalloc support:

10:46 <rori> https://github.com/STEllAR-GROUP/hpx/pull/4467

10:46 <rori> let me know if it is working for you

10:54 wate123_Jun has joined #ste||ar

10:59 wate123_Jun has quit [Ping timeout: 240 seconds]

11:07 ibalampanis has quit [Remote host closed the connection]

11:48 wate123_Jun has joined #ste||ar

11:53 wate123_Jun has quit [Ping timeout: 240 seconds]

11:54 ibalampanis has joined #ste||ar

12:23 nikunj97 has quit [Read error: Connection reset by peer]

12:43 wate123_Jun has joined #ste||ar

12:47 wate123_Jun has quit [Ping timeout: 240 seconds]

12:49 <Yorlik> rori: Very cool! IÄll check it out, though I probably might use jemalloc instead, since we're going for Linux anyways in production.

12:49 <Yorlik> Yesterday I found out when creating a Lua State you can give Lua a custom alloc function, which basically is a thin wrapper around realloc.

12:50 <Yorlik> I might just be able to give Lua jemalloc without modifying it. :)

12:50 hkaiser has joined #ste||ar

12:51 <Yorlik> Heyo jkaiser!

12:51 <Yorlik> Woops hkaiser ^^

12:52 <hkaiser> hey Yorlik

12:52 <hkaiser> g'morning

12:52 <Yorlik> So yesterday:

12:53 <Yorlik> Yeah - morning ! lol :)

12:53 <Yorlik> I compiled jemalloc and used it - it worked.

12:53 <Yorlik> And:

12:53 <Yorlik> I found out I can use Lua with a customizable alloc function by nature

12:53 <hkaiser> thought so

12:53 <Yorlik> The function Lua expects is just a thin wrapper around realloc

12:54 <Yorlik> So I can basically create my Lua States on a per state basis with whatever allocator I want.

12:54 <hkaiser> perfect

12:54 <hkaiser> jemalloc it is, then

12:54 <Yorlik> jemalloc worked like a charm, but I ve no results yet - need to fix and cleanup some other things.

12:55 <Yorlik> But that's my task for today: Cusom allocation.

12:55 <hkaiser> shouldn't be too hard, should it?

12:56 <Yorlik> So many reasons to love Lua - now there's just another one :)

12:56 <Yorlik> It'l work, I'm pretty sure

13:00 <Yorlik> hkaiser: this is the function one wants to mimic: https://github.com/lua/lua/blob/master/lauxlib.c#L986

13:00 <Yorlik> It's probably as trivial as it looks.

13:00 <hkaiser> sure, jemalloc has both, je_free and je_realloc

13:01 <Yorlik> Yup.

13:14 Abhishek09 has joined #ste||ar

13:22 <K-ballo> getting build failures due to missing hwloc includes again

13:22 <Yorlik> K-ballo: Do you know of that fix yesterday?

13:23 <K-ballo> Yorlik: which yesterday fix?

13:23 <Yorlik> The hwloc issue

13:23 <Yorlik> Its a missing line in the Cmakefile for hpx_init

13:23 <Yorlik> Lemme dig it up

13:24 <Yorlik> Its in 4462, but you need only 1 line

13:27 <ibalampanis> hkaiser I have submitted my proposal as a draft in official gsoc website>

13:28 <hkaiser> ok, cool

13:28 <ibalampanis> Also, I have informed Bita

13:28 <Yorlik> K-ballo: Try adding a `target_link_libraries(hpx_init PUBLIC hpx)` around here? https://github.com/STEllAR-GROUP/hpx/blob/master/src/CMakeLists.txt#L386

13:28 <hkaiser> just use #4462

13:28 <hkaiser> I'll merge it later today anyways

13:29 <Yorlik> K-ballo ^^

13:29 <K-ballo> i'll wait

13:29 <Abhishek09> hkaiser: How many slots generally ste||ar allocated for GSoC?

13:29 <ibalampanis> hkaiser up to the end of weekend I will update the proposal including the link of github repo if the MM with HPX

13:29 <hkaiser> 5-7

13:29 <ibalampanis> Now I working for this hkaiser

13:29 <hkaiser> sure, good luck!

13:30 <ibalampanis> Thank you for all your support!

13:30 <Abhishek09> hkaiser: Is Selection is only depends on quality of proposal in Ste||ar?

13:31 <hkaiser> yes

13:31 <ibalampanis> hkaiser Could I make a question to you?

13:31 <hkaiser> quality of proposal, general activity of the student, what is the quality of the example code we ask to write, etc.

13:31 <hkaiser> ibalampanis: you already did ;-)

13:32 <hkaiser> sure go ahead

13:32 <ibalampanis> Why don't you use Slack for chat? Many of organizations has this as a media.

13:32 <Abhishek09> example code means? hkaiser

13:32 <hkaiser> ibalampanis: historic reasons - is there a big difference?

13:33 <ibalampanis> No, no, just a question!

13:33 <ibalampanis> Thanks!

13:33 <hkaiser> Abhishek09: we ask our students to implement a small example matrix multiplication using HPX to get an idea of what you know

13:34 <hkaiser> ibalampanis: ;-)

13:34 <Abhishek09> hkaiser: Why not you gitter rather than irc . Many org started using it

13:35 <hkaiser> ibalampanis: one of the main reasons is that slack does not provide us with a full history of the conversations

13:35 <Yorlik> hkaiser: Can HPX executors be combined, like proposed in the isocpp proposal? I'd like to try out jbjnrs limiting_executor. Alternatively I'd add it's functionality to your hooking thing.

13:35 <Abhishek09> gitter is far better than slack

13:35 <hkaiser> Abhishek09: there are many options, we have not been able to agree on something else than irc so far - this channel here exists for more than 10 years, people got used to it

13:36 <heller1> Also, slack is acting under us regulations and bans certain foreign nationals

13:36 <hkaiser> Yorlik: our executors are not conforming to p0443 at this point, they represent an older version of it from 3-4 years ago

13:36 <heller1> Simply put: it's a for profit organization where you are the product, not the customer

13:36 <Yorlik> We just need to rewrite Discord and use HPX under the hood ;)

13:37 <hkaiser> Yorlik: however, I showed you in the example how you can create wrappers that can rely on other executors and add/remove things

13:37 wate123_Jun has joined #ste||ar

13:37 <Yorlik> Yes - I'll look into it.

13:37 <Yorlik> Gotta study the code better now.

13:37 <ibalampanis> Yorlik Are you interested in gsoc?

13:38 <Yorlik> I'm not a student

13:38 <ibalampanis> Ok! :D

13:38 <ibalampanis> :) *

13:38 <Yorlik> WQe're a group of hobbyists writing a gameserver with HPX.

13:39 <ibalampanis> Wow! After gsoc I would like to contribute on it

13:39 <Yorlik> Sure - contact me. But we're not open source.

13:39 <Abhishek09> zao nikunj Hi

13:39 <Yorlik> But we use Lua !!

13:40 <ibalampanis> @y

13:41 <ibalampanis> Yorlik How come you are nοt?

13:41 <hkaiser> Yorlik: I still hope to get a free license for your game, though ;-)

13:41 <Yorlik> There are several reasons. It's a long winded discussion I'd avoid for today.

13:41 <ibalampanis> Ok! Understood!

13:41 wate123_Jun has quit [Ping timeout: 256 seconds]

13:43 <Yorlik> We do not really have commercial ambitions, we we might monetize just to cover the costs. E.g. an option could be to make a dual license later, but giving out server code gives a great advantage to hackers, so we'd do that rather late in the process if ever.

13:43 <Yorlik> In the moment it's a msall puny exercise made mostly by me anyways.

13:44 <Yorlik> With some grains of awesomeness ;)

13:44 <ibalampanis> Sure, I haven't in my mind this about hacking

13:45 <ibalampanis> Hah, you 're great!

13:45 <Yorlik> MMO are always under massive attack from hackers.

13:45 <Yorlik> LOL - No - just a little crazy :)

13:45 <ibalampanis> Crazy is more suitable word than great. I admit it

13:46 <ibalampanis> ;)

13:46 <Yorlik> You need to be a little crazy to do what we do.

13:46 <Yorlik> When a hobbyist start saying: We want to write an MMO the reaction usually is always negative.

13:46 <ibalampanis> Could you give me your email or a social media account in order not to spam here?

13:47 <Yorlik> For reasons: It's quite a task. But with libraries like HPX and Lua it's much more feasible than it used to be.

13:47 <Yorlik> Sure: mckillroy lives at gmail with the dot of com.

13:47 <ibalampanis> hah thanks

13:48 <Yorlik> :)

13:50 <ibalampanis> I just send you an ack message Yorlik

13:51 <Yorlik> It arrived

13:51 <ibalampanis> It's ok!

13:58 Hashmi has joined #ste||ar

13:59 <ibalampanis> hkaiser Are you in USA? I ask this to have in my mind the local time difference

13:59 <hkaiser> yes, I'm in the US central time zone

13:59 <ibalampanis> What about Bita? If you know.

14:01 <hkaiser> same

14:04 <ibalampanis> So, your local time is 9:04 (24hr)

14:08 <Yorlik> hkaiser: Is the default executor essentially the interface description when writing an executor? Or is there a more overview writeup on the concept somewhere?

14:10 <Yorlik> I see there is a bit in examples though.

14:18 wate123_Jun has joined #ste||ar

14:24 <ibalampanis> hkaiser Is fault if I repo is a Jetbrains CLion project? This IDE has the Cmake as integrated tool and the execute of code is able via a button.

14:25 nikunj97 has joined #ste||ar

14:25 <Yorlik> ibalampanis: Last time I looked at CLion, their CMake only supported make generation and not ninja or MSBuild. Not sure if it's still like that.

14:27 <ibalampanis> I don't know what you say. What do you mean generation?

14:28 <Yorlik> CMake is not a build system, but a generator for various build systems, like make, MSBuild or Ninja.

14:28 <Yorlik> IIRC CLIon does not support all of them, but just makefile generation.

14:29 <Yorlik> But that might be outdated info. But if it still is true and you want something else with CMake than makefiles you might run into issues with CLion.

14:30 <ibalampanis> Yeah, I understand now. I don't know if now supports something else than make

14:31 <Yorlik> I just thought I'd tell you for consideration.

14:32 Pranavug has joined #ste||ar

14:32 <ibalampanis> Do you believe that for this project (https://github.com/STEllAR-GROUP/hpx/wiki/Google-Summer-of-Code-%28GSoC%29-2020#test-framework-for-phylanx-algorithms) I will face issues on CLion?

14:33 <Yorlik> I'm not qualified to answer this.

14:33 <ibalampanis> Hah, it's ok

14:34 * Yorlik is just a hobbyist with same half baked semi-knowledge which ~just works enough :)

14:35 Pranavug has quit [Client Quit]

14:36 <ibalampanis> Sorry if I made a fault question

14:36 <Yorlik> I think it's a good question. You wanna use the tools that work for your project.

14:37 <ibalampanis> It's true

14:42 <ibalampanis> Yorlik: Which tools/IDEs do you suggest?

14:43 <Yorlik> I can only say what I use - but what's best for you might be totally different. I work mostly on Windows and am using MSVC Community edition. On Linux I use VSCode

14:44 <ibalampanis> Ok, VSCode is a safe pick. Works with everything!

14:48 <Yorlik> It's a good editor. And it works nicely with CMake and testing frameworks.

14:48 <Yorlik> You can also use intellisense with it

14:48 <ibalampanis> Yeah, have it in my mind

14:49 <ibalampanis> Thanks!

14:49 <Yorlik> VSCode also allows you to work remotely via SSH

14:50 <ibalampanis> Yeah I knew it! Thanks for your time! Time to go out for duties! Cheers!

14:50 <Yorlik> It has a remote server - so that'S a pretty slick solution, though I heared you should't use it over the internet because of security issues. But locally its a nice thing.

14:50 ibalampanis has quit [Remote host closed the connection]

14:56 shahrzad has joined #ste||ar

15:00 <zao> Yorlik: vscode’s ssh remote now only listens on localhost on the remote, so it’s safe on singleuser machines or where you trust the other users

15:01 <Yorlik> Ah OK - so they fixed it. Thanks for the heads-up.

15:01 <zao> I have yet to get any response on whether it has any token auth or not, idiots seem to be reluctant to just say.

15:01 <Yorlik> lol

15:01 <zao> Btw, saw this talk on server instrumentation yesterday - https://youtu.be/r6Ex29gzqgc

15:02 <zao> Might be of interest to you

15:02 <Yorlik> Instrumentation will soon be a thing for us, when we have Milestone 1 and go into a polishing pahse.

15:04 <Yorlik> jbjnr: I just stole three lines of code from your limiting executor and put them into the start hook of the executor I'm using - I already had task counting implemented. And it seems to work nicely.

15:05 <Yorlik> Essentially I needed just this:

15:05 <Yorlik> if ( ( ++task_counter<I> ) > upper_threshold_ ) {

15:05 <Yorlik> hpx::util::yield_while( [&]( ) { return ( task_counter<I>> lower_threshold_ ); } );

15:05 <Yorlik> }

15:33 <nikunj97> hkaiser, the only way to calculate grain size that I know of is putting up a high resolution clock and measure the task execution time. Is there any other way? I hear APEX is used for performance measurement in HPX. Would that help?

15:41 <hkaiser> nikunj97: yes, that's what the auto-chunker does

15:42 <Yorlik> hkaiser: I just put the above three-liner in the start() lambda of your hooked executor and it seems to work nicely

15:42 <hkaiser> right

15:43 <hkaiser> as expected

15:43 <Yorlik> Working on jemalloc now :) Can't wait to see the effects.

15:43 <hkaiser> shahrzad: you need to use Boost.Context on the Pi

15:44 <hkaiser> shahrzad: -DHPX_WITH_GENERIC_CONTEXT_COROUTINES=On

15:45 <hkaiser> Yorlik: you will an effect for sure

15:45 <nikunj97> hkaiser, is there any documentation on auto-chunker?

15:45 <shahrzad> @hkaiser OK,thanks!

15:46 <hkaiser> nikunj97: what is 'documentation'? ;-)

15:46 <hkaiser> nikunj97: see https://github.com/STEllAR-GROUP/hpx/blob/master/libs/execution/include/hpx/execution/executors/auto_chunk_size.hpp#L33

15:46 <nikunj97> lol. We do really need a blog after all

15:46 <nikunj97> I'll have a look at it, thanks@

15:46 <hkaiser> nikunj97: I can give you access to the blogs

15:46 <nikunj97> which ones?

15:47 <nikunj97> ohh wait, we have blogs?

15:47 <nikunj97> do you mean the one hosted at cct?

15:48 <Yorlik> nikunj: https://gist.github.com/McKillroy/7b6dcfa19f0e202b9be386a1b09a4f74

15:48 <hkaiser> nikunj97: I did a whole blog post series back in 2015 I believe, highlighting all the features

15:48 <hkaiser> nikunj97: http://stellar-group.org/blog/

15:48 <Yorlik> A quick and dirty exerpt from my use of the auto chunker

15:49 <hkaiser> nikunj97: there is also a new website (just created a while back): hpx.stellar-group.org

15:49 <hkaiser> we will use it for all things HPX in the future

15:50 <nikunj97> Yorlik, thanks!

15:50 <nikunj97> hkaiser, aah the one you were talking about the other day

15:50 <nikunj97> I'll go through the blogs. Thanks

15:51 <Yorlik> nikunj: Made a cleanup of the gist - it's shorter now amd more clear.

15:51 <hkaiser> nikunj97: I'd be more than happy to give you access if you'd like to write blog posts

15:52 <nikunj97> hkaiser, I'm making myself familiar with the functionalities that I've rarely/never used

15:52 <nikunj97> I am sure to right one post it. HPX needs more publicity and user base

15:52 <hkaiser> perfect opportunity to document things as you go

15:52 <hkaiser> ;-)

15:52 <nikunj97> hkaiser, yes, I'm already writing smallest code snippet for the things I'm learning

15:53 <hkaiser> blog posts can be short, no reason to write a novel

15:53 <nikunj97> I believe it's always better to start with the easiest code you can write. And then you can show it's usage in a real application

15:53 <hkaiser> simple snippets are enough

15:54 <nikunj97> I'll be writing these blogs over the summer

15:54 <nikunj97> coz I don't think my internship is happening lol

15:55 <hkaiser> nikunj97: yah, there is that

16:00 wate123_Jun has quit [Remote host closed the connection]

16:01 wate123_Jun has joined #ste||ar

16:05 wate123_Jun has quit [Ping timeout: 256 seconds]

16:12 Abhishek09 has quit [Remote host closed the connection]

16:14 shahrzad has quit [Ping timeout: 240 seconds]

16:15 wate123_Jun has joined #ste||ar

16:23 <Yorlik> hkaiser: since HPX is using jemalloc - would I have to link my app again against jemalloc, or can I just use the symbols and only include the header where needed?

16:27 wate123_Jun has quit [Ping timeout: 240 seconds]

16:28 Hashmi has quit [Quit: Connection closed for inactivity]

16:30 nikunj97 has quit [Read error: Connection reset by peer]

16:31 nikunj97 has joined #ste||ar

16:33 wate123_Jun has joined #ste||ar

16:59 shahrzad has joined #ste||ar

17:12 ibalampanis has joined #ste||ar

17:20 <hkaiser> Yorlik: if you reference the symbols from your code you need to link against it, HPX might however re-export the library so that this might not require any actions on your side

17:21 <hkaiser> I htink the allocator is target_link_library'd to hpx publicly

17:21 <hkaiser> so you should be fine

17:21 <Yorlik> It's just working - just started a test :)

17:21 <hkaiser> ok

17:22 <Yorlik> Because of the task limiter I now simply let the lua states explode as needed, but I delete above a threshold on return to the pool

17:23 <Yorlik> It looks my test runs ~40-60% faster - I'll let it run a bit

17:23 <Yorlik> :D

17:23 <Yorlik> Big Win

17:23 <Yorlik> From ~40000 messages /sec to ~60000

17:24 <Yorlik> With 10000 calls into Lua

17:24 <Yorlik> 100 messages per call

17:24 <hkaiser> good

17:25 <Yorlik> That's muich more we ever had with the old crappy project.

17:25 <hkaiser> :D

17:25 <hkaiser> HPX for the win!

17:25 shahrzad has quit [Ping timeout: 240 seconds]

17:25 <Yorlik> I think I can now focus on a demo game and finalizing the default events and stuff.

17:25 <hkaiser> that's on a 4 core machine?

17:25 <Yorlik> Yes

17:25 <Yorlik> 4790k

17:25 <Yorlik> So - on a decent modern server this would surely b much better

17:26 <Yorlik> OFC, the Lua Load will be more.

17:26 <Yorlik> Still I'm happy with this result for now.

17:26 <hkaiser> Yorlik: we should give you access to our test cluster there you could do better benchmarks

17:26 <Yorlik> hat would be cool

17:27 <Yorlik> But I need to go finish the milestone first

17:27 <Yorlik> I want a basic fox-rabbit-grass population dynamics testcase

17:27 <hkaiser> Yorlik: talk to akheir here (once he's back), he manages the cluster

17:27 <Yorlik> And we are n ot yet distributed

17:28 <Yorlik> I need to get the location and load balancid system done for that

17:28 <Yorlik> The spatial partitioning

17:28 <hkaiser> no idea what a fox-rabbit-grass population is ;-)

17:28 <zao> A single node of like 14-28 cores still tends to give some insights, particularly around NUMA junk.

17:28 <Yorlik> Simple

17:28 <Yorlik> Grass Grows

17:28 <Yorlik> Rabbits eat grass

17:28 <Yorlik> Foxes eat rabbits

17:28 <Yorlik> Chaotic numbers in the subpopulations

17:29 <hkaiser> ok

17:29 <Yorlik> Its an easy way to create man objects.

17:29 <Yorlik> I'll make a very simplistic AI for that

17:29 <ibalampanis> @hk

17:30 <ibalampanis> hkaiser: Do you know if Bita is in dayoff today?

17:30 <hkaiser> Yorlik: interesting

17:30 <hkaiser> ibalampanis: it's weekend

17:31 <ibalampanis> Such a good note! Thanks!

17:33 <ibalampanis> hkaiser: Why aren't you in dayoff because of weekend?

17:33 <hkaiser> ibalampanis: I'm just lurking here ;-)

17:34 <ibalampanis> Hahah it's ok!

17:35 <ibalampanis> In order to double check, is your local time 12:35 ? (24hr)

17:36 <hkaiser> Yorlik: I merged #4462 just now

17:36 <hkaiser> ibalampanis: yes

17:36 <Yorlik> OK. stable it is then :)

17:36 <ibalampanis> Thanks!

17:36 <zao> ibalampanis: Local time in New Orleans should be accurate.

17:37 <Yorlik> I'll wait until the tag is there.

17:37 <hkaiser> Yorlik: you need to wait for the CI to cycle for stable to be updated to this

17:37 <Yorlik> Yup

17:37 <Yorlik> I have a working build after all.

17:37 <ibalampanis> zao: Thank you

17:37 <Yorlik> I think I'll use commit hashes in the future, so I have pinned stables

17:40 <hkaiser> Yorlik: I'd prefer you using the stable tag - nice way for us to discover problems ;-)

17:41 * Yorlik just joined the CI union

18:05 <nikunj97> zao, is a swap storage really important?

18:06 <zao> nikunj97: How elaborate of an explanation do you want? :)

18:06 <nikunj97> I've got 16gb ram

18:06 <nikunj97> and I'm always on 12-13gb

18:07 <nikunj97> it's when I'm not running any compilation/linking stuff

18:07 <nikunj97> it's with chrome, vs code, hexchat, spotify and a few other applications open

18:07 <nikunj97> like discord or slack

18:07 <zao> So your OS has a virtual memory system, yeah? Memory is split up into pages (4 KiB typically) and is backed either by physical RAM, physical storage, or just requested by not committed yet, having no backing storage.

18:09 <zao> Memory for things like loaded libraries or memory mapped files are backed by files on your disk, and can as long as it's not modified be dropped from memory and re-read from the files when needed.

18:09 K-ballo has quit [Remote host closed the connection]

18:09 K-ballo has joined #ste||ar

18:09 <zao> Memory for data allocated from a process or modified pages cannot be spilled to disk if there's no backing storage, it's stuck in RAM.

18:11 <zao> That is, unless you have a swap file/partition. It serves as an off-load area for less used pages from RAM, which the OS attempts to have somewhat pre-populated in case there's a burst in memory usage and it needs to evict something.

18:12 <zao> It helps free up and compact the contents of physical RAM that may have been spuriously loaded or not used at all.

18:12 <nikunj97> I see

18:12 <nikunj97> I should add one. Should help relieve the stress on memory

18:17 shahrzad has joined #ste||ar

18:25 <hkaiser> Yorlik: I approve

18:25 <Yorlik> hkaiser: Erm .. what?

18:26 <hkaiser> Yorlik: you joining the CI union ;-)

18:26 <Yorlik> :) lol

18:26 <Yorlik> They told me to take a nap to keep my overwhelming beauty - which I will do now - BBL :)

18:28 <nikunj97> heller1, why are my stream results looking way too different than the other day executing the same binary?

18:28 <nikunj97> I'm getting about 70-80GB/s bandwidth on x86 node all of a sudden

18:39 <hkaiser> nikunj97: different phase of moon ;-)

18:39 Abhishek09 has joined #ste||ar

18:40 <nikunj97> lol, it can make such a difference?

18:40 <hkaiser> sure!

18:40 <hkaiser> din't you know?

18:40 <nikunj97> it doubled!

18:40 <nikunj97> not like 10% or 20%

18:40 <hkaiser> so you changed something

18:40 <nikunj97> all my calculations are no good atm with these new results

18:41 <nikunj97> I didn't change anything. I executed an already compiled file that I used previously to record exactly the same thing

18:41 <hkaiser> yaya

18:41 <hkaiser> something else was going on on your machine when you tried before?

18:41 <nikunj97> it was executed on a node allocated by slurm

18:42 <hkaiser> rostam?

18:42 <nikunj97> no the cluster is at jsc

18:42 <hkaiser> exactly the same node?

18:42 <nikunj97> not sure about that

18:42 <hkaiser> see

18:42 <nikunj97> but they're all same x86

18:42 <hkaiser> you had that effect before when you were with us, remember?

18:42 <hkaiser> *sure*

18:42 wate123__ has joined #ste||ar

18:43 <nikunj97> yes, I remember the effect

18:43 <hkaiser> even on rostam equivalent the nodes are different

18:43 <nikunj97> so I choose a single node to record and benchmark everything

18:44 <nikunj97> btw which one will have higher memory bandwidth, float or double?

18:44 <nikunj97> peak memory bandwidth i.e.

18:44 wate123_Jun has quit [Ping timeout: 240 seconds]

18:51 <heller1> The data type is irrelevant

18:52 <nikunj97> heller1, 60GB/s with float and 80GB/s

18:52 <nikunj97> with double

18:52 <nikunj97> that's why I was wondering what gives

18:52 <nikunj97> on arm

18:52 <heller1> Well, you might need to increase the array size

18:53 <heller1> Float is half the size, might mess up the measurements

18:53 <nikunj97> aah that makes sense

18:55 <heller1> Also, x86 != x86

18:55 <nikunj97> I meant they're all Xeon E5 2660 v3

18:56 <nikunj97> same frequency as well

18:57 <nikunj97> and now arm is only giving 20GB/s for some reason

18:58 <nikunj97> it's exactly the same node but giving way less memory bandwidth. What am I doing wrong?

19:03 ibalampanis has quit [Remote host closed the connection]

19:10 <nikunj97> heller1, just read the array size rule on stream. It's definitely array size issue. Arm has 64MB cache while the array size is 10M

19:10 <nikunj97> L3 cache ^^

19:10 <heller1> See

19:11 <nikunj97> I should read things more carefully :/

19:11 <nikunj97> I changed the array size to 128M so I should see consistent rates both on hisilicon and e5

19:14 <nikunj97> heller1, yup, very consistent now. I don't see difference in float and double as well

19:26 nk__ has joined #ste||ar

19:28 nikunj97 has quit [Ping timeout: 240 seconds]

19:32 weilewei has joined #ste||ar

19:34 shahrzad has quit [Ping timeout: 256 seconds]

19:38 <weilewei> May I ask, what is the status of concurrent data structure support in GSoC project? I am interested in participating as a mentor (and learning as well)

19:39 <weilewei> Will we aim at implement concurrent_unordered_set

19:45 wate123__ has quit [Remote host closed the connection]

19:46 wate123_Jun has joined #ste||ar

19:51 wate123_Jun has quit [Ping timeout: 256 seconds]

19:59 nk__ has quit [Read error: Connection reset by peer]

19:59 nikunj97 has joined #ste||ar

20:00 <nikunj97> Abhishek09, hey

20:00 <nikunj97> just saw your text from afternoon

20:00 <nikunj97> what is it that you want to talk about?

20:04 <Abhishek09> nikunj97: Does installation of phylanx require library files for perfect working or it is fine with devel and header files?

20:05 <zao> Abhishek09: Hi there, did you want anything particular this morning? I saw you highlighted me but didn't say what it was about.

20:05 <zao> (the above, I guess)

20:06 <Abhishek09> zao: i have lost that thing

20:06 <nikunj97> Abhishek09, I didn't get you

20:07 <zao> If you're talking about which one of `hpx` and `hpx-devel` you need from the OS, an answer is that `hpx-devel` depends on `hpx`.

20:07 <zao> So you're going to either have `hpx` or `hpx` + `hpx-devel` installed if you're working with OS packages.

20:07 <zao> This is customary.

20:08 <zao> A development package contains an additional set of files on top of the base set of files in the regular package.

20:08 <zao> Both packages are needed to form a development environment.

20:08 <zao> Is this what you wondered about? :D

20:09 <Abhishek09> zao: Yes , that means hpx and hpx devel both are madatory for phylanx

20:09 <Abhishek09> installation

20:09 <Abhishek09> ?

20:10 <Abhishek09> hpx devel depends on hpx

20:11 <zao> You could reason it out from what you know that the Phylanx build needs too.

20:11 <zao> It builds a Python package with a native extension.

20:11 <zao> The native extension when built needs the headers to compile and the library to link.

20:12 <Abhishek09> As i have build phylanx by installing hpx by source not dnf

20:13 <zao> The only case you can get away with not having libraries installed is when you have a dependency that doesn't _have_ libraries, for example header-only dependencies like Eigen and Blaze

20:13 <zao> Great.

20:18 <Abhishek09> That means i have to ensure that all deps must works fine(library+built files+header) before building phylanx zao

20:19 <zao> I've found in the past that as long as the install step has worked, the dependencies are usable.

20:22 <nikunj97> hkaiser, so auto_chunk_size essentially makes sure that parallel_for_loop creates tasks such that their grain size is the time specified to auto_chunk_size?

20:23 <nikunj97> "This executor parameters type makes sure that as many loop iterations are combined as necessary to run for the amount of time specified." - Just making sure if I got this right

20:23 <Abhishek09> zao: Soon i will draft a proposal . i will built entirely (lib+header+built files) by source , Any tips for me you want to give

20:23 <Abhishek09> nikunj97 ^

20:24 <nikunj97> nope, looks good

20:24 <nikunj97> you'll have to build most of things from source

20:25 <zao> Abhishek09: I think I've said this in the past, but make a habit of installing things into a non-system directory, so that it's easy to remove and control.

20:25 <zao> Also take good notes on what commands you run so it's reproducible.

20:26 wate123_Jun has joined #ste||ar

20:27 <nikunj97> Yorlik, ^^

20:27 <Yorlik> Ya?

20:27 <nikunj97> about the auto_chunk_size thing

20:27 <Abhishek09> zao: You also patrticipating this year As a student in GSoC?

20:27 <nikunj97> I said before, is it correct?

20:27 <zao> Abhishek09: Nope, I've never done GSoC. I'm a professional sysadmin and application expert at a HPC site.

20:28 <zao> My primary job is to build and install software for researchers :D

20:28 <Yorlik> nikunj: The auto chunker does measurements. I think you pay like 10% for these IIRC what hkaiser said. He knows better.

20:28 <nikunj97> aah so I invoke get_chunk_size to know what the grain size was?

20:29 <Yorlik> I never used that function, but probably yes

20:30 <nikunj97> wait if auto chunker does measurements, there should be a way to report it back right?

20:30 <nikunj97> I'm asking that

20:31 <nikunj97> so you use executor.with( auto_chunk_size( 2000us ) ), what does it actually do?

20:31 <Yorlik> The auto chinker attempts to size you chunks such, that they run 2000us

20:32 <Yorlik> So if one iteration take 1us it would do 2000 iterations per chunk

20:32 <nikunj97> but with 10% overheads

20:33 <Yorlik> IIRC yes. hkaiser should tell exactly.

20:33 <nikunj97> ok no, this isn't what I wanted. I want to time the grain size

20:33 <Yorlik> You can use an exe3cutor, which has a start and a stop function

20:33 <nikunj97> not make the runtime change iterations to make it that grain size

20:33 <Yorlik> So you can pout your measurement hooks there

20:33 <nikunj97> you mean performance counter?

20:34 <Yorlik> Yes

20:34 <Yorlik> However you want to measure

20:34 <Yorlik> Look at this: https://github.com/STEllAR-GROUP/hpx/blob/2773503026a5610e1db8955c99065764ae18d2d3/examples/quickstart/executor_with_thread_hooks.cpp

20:34 <nikunj97> yea, that's what I think I'll do. Thanks!

20:34 <Yorlik> hkaise recently made this example I posted. I'm actually using it

20:35 <nikunj97> aah! so you were asking about the other day?

20:35 <Yorlik> Yes

20:35 <Yorlik> I needed it for a different purpose

20:35 <Yorlik> Lemme make a snippet for you

20:36 <nikunj97> this doesn't look like performance counter to me

20:37 <Yorlik> Its the executor which has the start and stop hook

20:37 <nikunj97> yeah, basically add your stuff on start and stop

20:37 <Yorlik> This is a function from my codebase using it: https://gist.github.com/McKillroy/f81277abc7685832f785c73decfeeb20

20:38 <Yorlik> You can put whatever you want into the start and stop lambdas

20:38 <Yorlik> You put the executor in the parloop and give it the start and stop lambdas where you can place your measurement or perfcounters.

20:39 <Yorlik> I use it to limit task creation to a max

20:39 <Yorlik> It gives you control over a chunk.

20:39 <Yorlik> And in the loop you can use the auto chunker or a static chunker

20:39 <nikunj97> got it

20:40 <nikunj97> it's again not what I was looking for btw. But it's a nice idea

20:40 <Yorlik> Maybe I misunderstood - what exactly do you need?

20:40 <nikunj97> so basically I want to measure the time of my execution of a task. I previously used to use high resolution timers to measure the grain size

20:40 <Yorlik> Do you want to measure your chunks or do you want to size them?

20:41 <nikunj97> like start the timer on invoking the function and calculate the time elapsed in the end

20:41 <nikunj97> I want a better way to get the time

20:41 <nikunj97> I neither want to measure them nor change the size

20:41 <Yorlik> Just measure a chunk and divide by the number?

20:41 <nikunj97> but that won't give me a right measure coz they have 10% overheads

20:42 <Yorlik> Not the static chunker

20:42 <Yorlik> Onlky the auto chunker

20:42 <Yorlik> The static chunker has zero overhead

20:42 <Yorlik> (Or almost zero)

20:43 <nikunj97> static chunker simply measures the grain size?

20:43 <Yorlik> No

20:43 <nikunj97> or does it try to do something similar to auto chunker?

20:43 <Yorlik> You set the size of a grain

20:43 <Yorlik> Its fixed

20:43 <nikunj97> aah got it, your snippet

20:43 <nikunj97> simply tell the number of iterations you want

20:43 <Yorlik> Do you want auto chunking or not?

20:43 <Yorlik> Yes

20:43 <nikunj97> nope, I do not want auto chunker

20:44 <nikunj97> static chunker looks better

20:44 <Yorlik> The use the static one. Its cheap

20:44 <nikunj97> I do not want the runtime to optimize things, I'm optimizing according to an underlying hardware ;-)

20:44 <Yorlik> So you make measurements and then you know your grain size?

20:45 <Yorlik> But you do it only once?

20:45 <Yorlik> You could simply do a claibration phase then, before the real heavy duty job starts.

20:46 <Yorlik> As long as your item load is constant that should work nicely.

20:46 <nikunj97> my item load is constant

20:46 <nikunj97> I just want to keep the load at optimal

20:47 <Yorlik> So basically you just want to measure your hardware.

20:47 <nikunj97> that's why I wanted to measure the grain size

20:47 <Yorlik> And adjust to it

20:47 <nikunj97> Yorlik, precisely

20:47 <Yorlik> static chunker then

20:47 <nikunj97> got it!

20:47 <Yorlik> Or just run a shorter loop

20:48 <nikunj97> what do you mean?

20:48 <Yorlik> Like a short calibration loop and then adjhust the chunks

20:48 <Yorlik> But maybe its better to have the chunkin in - then you have measured all overheads.

20:49 <nikunj97> yea

20:49 <Yorlik> But thats a detail question.

20:49 <Yorlik> Good luck!

20:49 <nikunj97> I think I got it, let me try

20:49 <nikunj97> thanks for the help!

20:49 <Yorlik> Cheers! :)

20:50 <nikunj97> Yorlik, one more thing

20:50 <Yorlik> Ya?

20:50 <hkaiser> nikunj97: auto-chunker is your friend

20:50 <nikunj97> so if a parallel_for loop goes from: 0 100, and you set static_chunk_size=10, you'll have 10 tasks in total, right?

20:50 <Yorlik> hkaiser: how large is its overhead again?

20:50 <nikunj97> hkaiser, aren't the overheads high though?

20:51 <hkaiser> you can do both: measure the best chunksize and then set it (possibly using the static chunker

20:51 <hkaiser> you can also create your own chunker that measures things once and uses the settings for all subsequent uses or somesuch

20:51 <hkaiser> Yorlik: overheads of what?

20:51 <Yorlik> hkaiser: What's the cost of the autochunker again?

20:51 <hkaiser> you tell it

20:51 <hkaiser> by default it runs 1% of the iterations to measure

20:52 <hkaiser> but you can change that

20:52 <Yorlik> Oh I C

20:52 <nikunj97> 1% aint't bad

20:52 <Yorlik> I wasn't clear about that, just told nikunj97 about it.

20:52 <nikunj97> I can use auto chunker then

20:52 <hkaiser> nikunj: could be too much

20:52 <Yorlik> 1% is 6 minutes in 10 hours :)

20:53 <hkaiser> Yorlik: you want to measure it once every now and then and just reuse afterwards

20:53 <hkaiser> writing a chunker is trivial, just look at the existing ones

20:53 <Yorlik> I will need constant monitoring, since the workloads always change

20:53 <Yorlik> I'm happy to use the autochunker

20:53 <hkaiser> right, that's what I said

20:53 <hkaiser> ok

20:54 <Yorlik> nikunj has a different situation, He just wants to measure his hardware to determine the grainsize for a constant job

20:54 <Yorlik> Mine is horribly dynamic.

20:56 <nikunj97> Yorlik, setting auto chunker to find the best grain-size will also do the trick for me

20:56 <nikunj97> once I find the best, I'll get rid of the auto chunker to get another 1% speedup

20:56 <Yorlik> With 1% it's much smaller than I wrongly remembered

20:57 <nikunj97> It's just that I wanted to know a good way where I don't have to time loops and write scripts to know better

20:57 <nikunj97> this way I can just run the thing on the chunk size I want

20:57 <Yorlik> What kind of job are you running?

20:58 <nikunj97> it's a stencil benchmark

20:58 <Yorlik> OK

20:58 <nikunj97> I want to optimize it

20:58 <Yorlik> IC

20:59 <nikunj97> to the best I can. If I can show that HPX's functionality hide in the noise and I get near optimal performance, I can write a paper on it

20:59 <nikunj97> which will increase HPX's publicity and help me with my academic journey :D

20:59 <Yorlik> Cool :) HPX is a really good piece of tech. You'll have fun :)

21:00 <nikunj97> hkaiser, btw Sanjay informally offered my a PhD during the interview

21:00 <nikunj97> he was telling me all the PhD deadlines and wanted to know what I wanted to pursue

21:00 <nikunj97> *offered me a PhD

21:01 <hkaiser> nikunj97: nice

21:12 wate123_Jun has quit [Ping timeout: 240 seconds]

21:22 karame_ has quit [Remote host closed the connection]

21:24 wate123_Jun has joined #ste||ar

21:24 <nikunj97> hkaiser, HPX's functionality do really hide in noise. https://gist.github.com/NK-Nikunj/135c84c72d4ef44a991e200473e777f4

21:24 <nikunj97> one of my recent runs

21:25 <hkaiser> nice

21:25 <nikunj97> I think I can get even closer to the serial versions

21:26 <Yorlik> nikunj: You wanna keep the thermal environment constant to really compare ;)

21:26 <nikunj97> Yorlik, idk how they handle their clusters. But I believe they're doing their best ;)

21:26 * Yorlik is trolling just a wee bit ...

21:27 <Yorlik> I'm a bit hyper because jemalloc gave us such a nice boost and everything seems to work today :)

21:28 <nikunj97> Must be a really good day!

21:28 <Yorlik> Yup.

21:29 * Yorlik is coding with Lee "Scratch" Perry music.

21:45 weilewei has quit [Ping timeout: 240 seconds]

23:08 shahrzad has joined #ste||ar

23:26 bita has joined #ste||ar

23:31 wate123_Jun has quit [Remote host closed the connection]

23:35 shahrzad has quit [Ping timeout: 256 seconds]

23:39 bita has quit [Ping timeout: 240 seconds]

23:45 wate123_Jun has joined #ste||ar