#ste||ar on 2019-06-16 — irc logs at irclog.cct.lsu.edu

2018-08-26 23:03 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/

02:07 K-ballo has quit [Quit: K-ballo]

02:51 hkaiser has quit [Quit: bye]

03:45 <lsl88> Hi! Sorry for the time of the day. Am I on the right track if I finish reading the manual, try to see things that could be improved - ie tables that are difficult to follow, more information that should be added to fully understand one chapter and so on- and afterwards read the examples?

05:33 <simbergm> lsl88: yep, that sounds great

05:33 <simbergm> remember that we're not expecting you to fix the whole manual, but of course the more you read the better your overview of the current state will be, so what you're doing is much appreciated

07:43 Amy1 has quit [Ping timeout: 258 seconds]

07:43 Amy1 has joined #ste||ar

10:00 nikunj has joined #ste||ar

10:09 lsl88 has quit [Ping timeout: 245 seconds]

10:42 Coldblackice has quit [Ping timeout: 268 seconds]

11:58 <Yorlik> I just started to eliminate C style casts from my allocators and replace them with C++ style casts. I am ending up with some really ugly code though, like:

11:58 <Yorlik> return reinterpret_cast<void*>( reinterpret_cast<uint64_t>(topPagePtr_) + reinterpret_cast<uint64_t>(pagealloc::COMMIT_ERROR) );

11:59 <Yorlik> Any hints for getting the best of both worlds, like pointer arithmetic and low level void* spammage but also as much type safety as possibnle?

12:00 <Yorlik> I tried to find some good sources for learning and ended up with the C++ guidelines - I wonder if there are any good tutorial resources for this area of work.

12:31 hkaiser has joined #ste||ar

12:38 <Yorlik> Heyo hkaiser! If you read up a couple of lines .. would you pls answer some newbie issue with pointer arithmetic coding style? (C vs C++ casts)?

12:38 <hkaiser> uhh

12:38 <hkaiser> I don't know anything about pointers ;-)

12:38 <Yorlik> You'r joking, right? Or do you imply you never ever use pointer again?

12:39 <hkaiser> you want to have type safety but at the same moment you use reinterpret_cast

12:39 <hkaiser> that does not go together

12:39 <hkaiser> but seriously

12:40 <Yorlik> I wonder if I should create new types

12:40 <hkaiser> you don't need to cast a pointer first to uint64 and then to void*

12:40 <hkaiser> a simple static_cast<void*>(ptr) will do that

12:40 <Yorlik> I want to add an error code to it

12:40 <hkaiser> casting back requires a reinterpret_cast, however

12:40 <Yorlik> The pointer has 12 free bits at the bottom

12:41 <hkaiser> urgs

12:41 <Yorlik> So i want to use it as a tagged pointer to a memory page

12:41 <hkaiser> stick with whatever you had with C-style casts, the C++ version will not give you anything more

12:42 <Yorlik> It might just raise an alarm flag if I see C style casts in my code and avoid them elsewhere

12:42 <hkaiser> you can also create a special type that acts as the tagged pointer

12:43 <Yorlik> I wonder if a special class with static functions for tagging could do the trick, so i would encapsulate away the unsafe parts of the code

12:44 <Yorlik> That shouldn't cause any serious runtime issues in terms of performance / memory usage

12:44 <hkaiser> here is an example of such a tagged_ptr: https://github.com/STEllAR-GROUP/hpx/pull/3406/files#diff-4215f58a490be17f635ae008dc68de89R62

12:44 <Yorlik> I mean - the code works already - I just got a ton of squiggles and I try to avoid pragma warning away everything

12:45 <Yorlik> cpmressed_ptr_type - lol

12:45 <Yorlik> I like the name already and the underlying type

12:48 <zao> Yorlik: standards-wise you should only cast between regular pointer types and integer types that are of the same size, and not rely on the integer value except that it can roundtrip ptr to int to ptr.

12:48 <zao> uintptr_t is such a suitably sized type.

12:49 <Yorlik> Yeah - my code kinda assumes 64 bit machines

12:49 <Yorlik> But what once we get 128 bit machines ... :D

12:49 <Yorlik> Or we try to run HPX on a Z80 ...

12:50 <Yorlik> But seriously: i really wann write safe code and avoid making it ugly

12:51 <zao> There are 64-bit machines with trap representations for example. All I’m saying is that you’re skirting toward UB territory.

12:52 <Yorlik> What's a trap representation?

12:53 <hkaiser> Yorlik: use size_t instead of uitn64_t, that is guaranteed to be large enough to store a pointer

12:53 <hkaiser> also, the compressed_ptr is probably all you need

12:53 <hkaiser> hides all ugliness

12:53 <Yorlik> I'll study it

12:53 <zao> size_t is technically supposed to only be able to hold the size of an array, which on _most_ architectures is the right size :P

12:54 <hkaiser> zao: you sure?

12:54 <zao> They may have changed the wording around that since 03.

12:55 <zao> Yorlik: You know how floating point numbers can be NaN, and in some CPU modes they cause hardware exceptions?

12:55 <Yorlik> Not really

12:55 <Yorlik> I always kinda assumed NaN would be a special reserved value

12:56 <zao> Special bit patterns that are Not-a-Number, which propagate through numeric operations.

12:56 <Yorlik> But I'm blank on that

12:56 <zao> On some architectures, notably Itanium, there's Not-a-Thing, which holds for regular integers.

12:56 <Yorlik> They are hardware defined?

12:57 <Yorlik> I think I might just make a special MemoryPagePtr class form my purposes as an excercise and taking inspiration from your compressed Ptr

12:57 <Yorlik> And avoid any dynamic / virtual stuff in it

12:58 <Yorlik> Yesterday I ended up teaching catch2 to test against segfaults ... :D

12:59 <zao> I looked them up, seems like NaT:s are not directly part of the value representation, so they're not necessarily a problem here :D

12:59 <zao> https://devblogs.microsoft.com/oldnewthing/?p=41003

12:59 <zao> I'm sure your code is "fine", it's just the coercion to a type that's not necessarily of pointer size that's iffy.

13:00 <zao> hkaiser: cppreference seems to imply that it only needs to be able to store the maximum size of an object, which could technically be smaller than a pointer.

13:01 <Yorlik> The problem is, that strategically other developers might use my code in the future, write new allocators or poke around in existing code and all this allocation stuff is so deep in the system I really want to make it as rock solid as i can.

13:01 <zao> It's the kind of reading that ##c++ would love to have :P

13:03 <zao> Personally I'm mostly scared of overly clever compilers optimizing my code according to spec.

13:35 <Yorlik> When doing arithmetic on memory cells - is there a c++ type that is guaranteed to always be able represent a single cell? Should I use byte or char?

13:35 <Yorlik> It's more a theoreical question, but after all void* has no type and I need to do arithmetic on memory cells, on pages too etc.

13:36 <Yorlik> So I believe I should represent them propely instead of doinf wild void* casts all over the place

13:36 <Yorlik> Sry - my typing is horrible again - need more coffee ... :)

13:39 nikunj has quit [Remote host closed the connection]

13:42 Coldblackice has joined #ste||ar

13:44 <zao> Yorlik: You might want to read https://en.cppreference.com/w/cpp/language/object

13:46 <zao> `unsigned char` has been a good bet in the past as it has unsurprising signedness, haven't done C++ since std::byte appeared.

13:46 <Yorlik> IC - thanks for the link! Seems unsigned char and byte are equivalent. I might just use byte for the lowlevel thing because its more precise imo.

13:47 <Yorlik> With "char" you always think of something that prints.

13:47 <Yorlik> But a byte I can print as char, int or hex

13:47 <zao> Doesn't matter much when you're operating on the pointer type, but char has implementation-defined signedness, which is always fun.

13:48 <Yorlik> BTW - there is another interesting detail in that article: "For example, multiple floating-point bit patterns represent the same special value NaN. "

13:48 <Yorlik> So - not all NaNs are equal internally - didn't know that

13:49 <zao> There's like a million of them.

13:49 <Yorlik> wow

13:49 <Yorlik> I think I'll se them as neglected orphans ...

13:49 * Yorlik feels some wird pity for the NaNs all at a sudden

13:51 <zao> f32 has a 8-bit exponent, where an exponent of 0xFF is all NaNs, 0x00 is all denormals.

13:51 <zao> (well, not all)

13:51 <zao> https://en.wikipedia.org/wiki/Single-precision_floating-point_format#Exponent_encoding

13:53 <Yorlik> I'm totally undereducated in this IEEE stuff - first stumbled over it when working with Lua - seems there is some learning to do.

13:58 <Yorlik> I start wondering why would we ever need void*? Any pointer to a memory cell could be byte* - couldn't/shouldn't it?

13:58 <hkaiser> byte* implies a type, void* does not

13:59 <zao> There's some value in a vocabulary type that has some semantic distinctions.

14:00 <Yorlik> You mean because byte* would imply the traget has 1 byte length and void* is totally open to any interpretation?

14:01 <hkaiser> yes

14:01 <zao> byte* also has some alignment assumptions, and sometimes it's great not being able to dereference or do arithmetic.

14:02 <Yorlik> It somehow makes sense to not have pointer arihmetic for void*, still it is a bit awkward to use casts to byte or char to actually do arithmetic with void*. otoh - maybe the problem is the ease of misusing void*.

14:03 <hkaiser> Yorlik: doing pointer arithmetic with (u)char* implies an object size of one (byte)

14:03 <Yorlik> Yes - a single memory cell.

14:03 <hkaiser> instead of foo* p = ... ; ((char*)p) + sizeof(foo) you could write ++p

14:05 <Yorlik> The problem appears at the interface to the OS, which always requires void* to reserve an address space and to commit a page. I think I'll just put the conversion closely there and above use pages or my object types.

14:05 <hkaiser> any pointer is default convertible to void*

14:05 <hkaiser> so you only need to cast when you go from a void* to foo*

14:06 <hkaiser> (well any non-member function pointer, but those are not really 'pointers' anyways)

14:07 <Yorlik> My low level API is like this(different implementations per OS under the hood):

14:07 <Yorlik> / get system memory pagesize

14:07 <Yorlik> static size_t get_mempage_size( );

14:07 <Yorlik> const size_t pagesize = get_mempage_size( ); // 0x1000; // 4096 bytes

14:07 <Yorlik> / try to reserve the given amount of virtual memory, page aligned and grainsized to pages

14:07 <Yorlik> static void* vreserve( size_t sz );

14:07 <Yorlik> / Give up reservation of the region pointed into (usually you never do that)

14:07 <Yorlik> static bool vunreserve( void* ptr );

14:07 <Yorlik> / commit pages with given address and size and back it up with real memory rounded up to full pages

14:07 <Yorlik> static void* vcommit( void* ptr, size_t sz );

14:07 <Yorlik> / give up physical backup of pages WHICH ARE FULLY COVERED by the given range

14:07 <Yorlik> static bool vuncommit( void* ptr, size_t sz );

14:07 <Yorlik> So - here I can't really avoid the void*

14:08 <hkaiser> I didn't say you should

14:08 <Yorlik> But I think I'll make my allocators all strictly typed.

14:08 <Yorlik> I'm just trying to find out where to do what and how to minimize risk of abuse.

14:10 <hkaiser> implement typed allocators, what those do under the hood is their business

14:11 <Yorlik> I have one allocator that is typed to pages and is ued by the object allocators

14:12 <Yorlik> like this: template <size_t PAGESIZE> struct LinearPageAllocator;

14:12 <Yorlik> So pagesize is 4096 for now

14:13 <Yorlik> And then on top my typed one:

14:13 <Yorlik> template <typename T, FreelistType FL>class ContiguousPoolAllocator ;

14:13 <Yorlik> FreeList can be a set or a vector

14:13 <Yorlik> so i can control the policy what to reuse foirst: lowest or last freed

17:38 lsl88 has joined #ste||ar

17:53 <hkaiser> lsl88: you're on the roll!

17:59 <lsl88> ok, thanks :)

18:11 K-ballo has joined #ste||ar

19:06 <lsl88> I haven't finished reading the whole manual, but I have some ideas, can I share them here?

19:17 <hkaiser> lsl88: absolutely!

19:18 <lsl88> Something that I find is that tables somethimes are broken, specially when converting the manual to the PDF version.

19:18 <hkaiser> yes, pdf generation is always finicky

19:18 <hkaiser> we just implemented it

19:19 <lsl88> it happens in almost all the chapters, specially when a field is big

19:19 <hkaiser> nod

19:19 <hkaiser> I think this shouldn't be the main priority, html is the main target anyways

19:21 <lsl88> oh, I see

19:21 <lsl88> In the optimization chapter tables break too, when they are wider than the page, see https://stellar-group.github.io/hpx/docs/sphinx/branches/master/html/manual/optimizing_hpx_applications.html#existing-hpx-performance-counters

19:22 <lsl88> maybe a solution could be rearranging the info there

19:22 <hkaiser> I have no idea what we could do about this... you?

19:22 <hkaiser> ok

19:25 <lsl88> I am thinking of maybe instead of having tables, have it in text with bullets.

19:25 <hkaiser> hmmm

19:25 <hkaiser> it's several pieces of information for each of the counters

19:26 <hkaiser> using a table sounds logical

19:26 <hkaiser> we might reduce some of the repitition in text

19:27 <hkaiser> or somehow fused table entries - not sure

19:27 <lsl88> yes, I was going to say that some info is repeated

19:28 <hkaiser> or somehow have a separate page for each of the counters we could link from a smaller table

19:30 <hkaiser> lsl88: does spinx allow to specify table field widths, that might help as well

19:30 <hkaiser> sphinx*

19:31 <lsl88> I am taking a look

19:32 <lsl88> yes, apparently we can specify the width of each field

19:33 <hkaiser> might work - at least its worth a try

19:34 <lsl88> yes, I am also thinking of changing the description field to the second place

19:34 <lsl88> WDYT?

19:35 <hkaiser> sounds good to me

19:40 <lsl88> and some of the info is duplicated or exchanged, that's why I was thinking of having a list instead of a table. For instance: in AGAS Performance Counters (https://stellar-group.github.io/hpx/docs/sphinx/branches/master/html/manual/optimizing_hpx_applications.html#id9) I believe the Description field is number 4 instead of Parameters

19:42 <lsl88> and in Parcel layer performance counters (https://stellar-group.github.io/hpx/docs/sphinx/branches/master/html/manual/optimizing_hpx_applications.html#id10) if you read the Description field, after the first paragraph you have the same info duplicated

19:43 <lsl88> (in the meantime I am fixing small typos)

19:43 <hkaiser> right

19:44 <hkaiser> lsl88: feel free to combine typo fixes into bigger pull requests

19:44 <lsl88> (hope you don't mind having pull requests)

19:44 <hkaiser> not at all

19:44 <lsl88> yes, I was going to say that I end up doing small pull requests but just because I am fixing them as I see them

19:48 nikunj has joined #ste||ar

19:50 <nikunj> hkaiser: yt?

19:50 <hkaiser> here

19:50 <nikunj> did you get time to check the PR?

19:51 <hkaiser> have not looked yet, sorry

19:51 <nikunj> I was about to make changes in the benchmarks to make sure there are no cache inconsistencies

19:51 <hkaiser> k

19:51 <nikunj> ohh, then I'll not make changes for replicate one's

19:51 <hkaiser> no, pls go ahead

19:51 <nikunj> alright

19:52 <nikunj> the PR is pretty simple and the implementation is pretty straightforward

19:52 <nikunj> so I don't think there's any problems in the implementation

19:52 <nikunj> I'll go ahead with all benchmarks then

19:52 <hkaiser> k

20:16 <Yorlik> I start liking test driven development. All the red tests act as a todo list for me, right in my IDE.

20:17 <Yorlik> If I need to remiond myself of sth I just make a quick failing test.

20:18 <Yorlik> TEST_CASE( "Household::GetCoffee" ) {

20:18 <Yorlik> CHECK(false);

20:18 <Yorlik> }

20:18 <Yorlik> :D

20:35 <nikunj> is rostam still down?

21:13 <hkaiser> nikunj: yes, will be up tomorrow only - cooling was down

21:13 <nikunj> ok

21:13 <nikunj> I have written the benchmarks and scripts

21:13 <nikunj> I'll run them tomorrow morning

21:13 <hkaiser> ok

21:49 <Yorlik> Oh man ... "Don't cast away const" and "cannot initialize T* from const T* " - this is nuts.

21:49 <Yorlik> If I have a const base pointer and i want to derive a non const pointer pointing to an item in the buffer ...

21:50 <Yorlik> I get a love hate relationship with the squiggles - lol

23:03 <hkaiser> Yorlik: casting away constness is a sign of bad/flawed/incomplete design

23:14 <Yorlik> How would you solve the usage of a buffer as array wuith a const base pointer?

23:15 <Yorlik> I mean - I'm totally open to learn

23:15 <Yorlik> But at this low level there can be no compromises conserning performance

23:16 <Yorlik> And really: I don't understand how copying a const value in a formuala could be bad design, when it's used to compute a derived value

23:16 <Yorlik> like nonconst a = const b + nonconst c

23:23 <Yorlik> I'm pondering to install GSL and just use spans for my buffers

23:24 <Yorlik> spans look cool to me in this area of wild void* and T* pointers

23:35 <hkaiser> Yorlik: don't make the base-ptr const in the first place ;-)

23:36 <Yorlik> But it never ever changes and should never ever be changed

23:36 <Yorlik> If anything in this part of the code should be const its that pointer

23:37 <Yorlik> I mean practically - yes It probably wouldn't hurt to make it non const - however - I'm trying to internalize the meaning of constness

23:37 <Yorlik> If not I am missing something conceptiually

23:37 <Yorlik> pracxtically it all works nicely

23:37 <Yorlik> I'm just trying to understand the underlying C++ consepts

23:51 <Yorlik> I think the base problem lies here:

23:51 <Yorlik> const testdata* td_array_base = cpa_td.allocate (itemcount);

23:51 <Yorlik> giving the base pointer to my allocated are a the type of the stored items is wrong in the forst place

23:51 <Yorlik> It should be void

23:53 <Yorlik> If I want to create an array like structure from this pointer I should either use a typed span or a typed non const pointer

23:53 <Yorlik> This nonconst typed pointer would then be used in all array arithmetic

23:53 <zao> (note the distinction between a pointer-to-const an da const-pointer-to-mutable)

23:54 <Yorlik> Its about the pointer itself - the data is all mutable

23:54 <hkaiser> well, if you use the base ptr to access a non-const member, then the ptr shouldn't be const

23:55 <Yorlik> I think the issue it, that const void* myBase and T* myArrayBase are different in meaning

23:55 <hkaiser> Yorlik: well, if the pointer itself is const then you shouldn't have problems accessing a non-const element

23:55 <Yorlik> The problem comes when initializing a typed non const pointer from the const pointer to the base

23:55 <hkaiser> a const pointer is a 'foo* const' while a ptr to a const is a 'foo const*'

23:56 <Yorlik> const testdata* td_array_base = cpa_td.allocate (itemcount); testdata* ptr = td_array_base; does not work

23:56 <hkaiser> sure it doesn't

23:56 <Yorlik> The data is not const right?

23:57 <hkaiser> it is

23:57 <Yorlik> not by this definition

23:57 <Yorlik> Really?

23:57 <hkaiser> if you need a ptr that is const write foo* const = ...

23:57 <zao> Note that `const T*` is `T const*`.

23:57 <zao> If you want a const pointer, `T* const`.

23:57 <hkaiser> one more reason not to use east-const

23:57 <Yorlik> Dang - I want the pointer itself to be const, not the data

23:57 <zao> Yorlik is using west-const, which is horrible ;)

23:58 <hkaiser> that's what I meant

23:58 <Yorlik> Thats a matter of taste

23:58 <hkaiser> it's a matter of consistency

23:58 <Yorlik> How would you write the above code correctly?

23:58 <hkaiser> foo* const = allocate(...);

23:59 <Yorlik> testdata * const td_array_base = cpa_td.allocate (itemcount); testdata* ptr = td_array_base; ?

23:59 <hkaiser> you essentially wrote: foo const* = allocate(...)

23:59 <hkaiser> yes

23:59 <Yorlik> FFS - lol

23:59 <zao> The only way to get the const on the left side there would be `using FooPtr = foo*; const FooPtr = ..`

23:59 <hkaiser> urgs

23:59 <Yorlik> rofl