#ste||ar on 2023-05-16 — irc logs at irclog.cct.lsu.edu

2021-08-06 22:55 hkaiser changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar-group.org | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | This channel is logged: irclog.cct.lsu.edu

00:30 K-ballo has quit [Ping timeout: 240 seconds]

00:30 K-ballo has joined #ste||ar

01:47 Yorlik_ has joined #ste||ar

01:51 Yorlik has quit [Ping timeout: 265 seconds]

02:14 hkaiser has quit [Quit: Bye!]

07:08 <K-ballo> rand() is a bad option

09:28 gdaiss[m] has quit [Remote host closed the connection]

11:09 HHN93 has joined #ste||ar

11:09 <HHN93> How do I find return type of a function?

11:09 <HHN93> I am aware of result_of_t, invoke_result_t but does HPX have a macro for it?

11:48 HHN93 has quit [Quit: Client closed]

11:51 hkaiser has joined #ste||ar

12:00 HHN93 has joined #ste||ar

12:18 HHN93 has quit [Ping timeout: 245 seconds]

13:27 Arnav-Negi has joined #ste||ar

13:33 akheir1 has joined #ste||ar

13:36 akheir has quit [Ping timeout: 256 seconds]

14:42 akheir has joined #ste||ar

14:43 akheir1 has quit [Ping timeout: 264 seconds]

14:46 Arnav-Negi has quit [Quit: Konversation terminated!]

14:56 Arnav-Negi has joined #ste||ar

14:56 Arnav-Negi has quit [Client Quit]

15:50 <K-ballo> no macro, there are traits

19:14 <gnikunj[m]> hkaiser: found the atoi link: https://alexott.blogspot.com/2010/01/boostspirit2-vs-atoi.html

19:15 <gnikunj[m]> Also a google thread discussion where Joel said boost spirit v2 is anywhere from 2.5x to 7x faster than Antlr. I'm sure v3 to bring me some joy now ;)

19:23 <gnikunj[m]> hkaiser K-ballo : are there any spirit v3 programs that parse a relatively complex grammar (not json or single node grammars in examples) that I can look into? Need to understand how I can connect an already built infrastructure (Instructions/Functions etc. classes) to Spirit V3 structs/classes (Boost Fusion only gets me partly there as some types are class/structs/enums that needs to be mapped as well).

19:35 <satacker[m]> Maybe https://github.com/boostorg/spirit/tree/develop/example/x3/calc/calc9

19:37 <satacker[m]> (I rely on spirit qi btw )

19:39 <gnikunj[m]> I've mostly written code in spirit qi as well. hkaiser asked me to try v3 so...

19:46 <hkaiser> gnikunj[m]: the biggest parser in Spirit V3 I have is parsing the performance counter names in HPX

19:46 <gnikunj[m]> hkaiser: not sure how complex that would be. Could you point me to the code for it?

19:46 <hkaiser> however, converting Spirit V2 to V3 is trival (almost a nobrainer), so looking at any V2 code might be an options - and there are plenty complex V2 parsers available

19:47 <gnikunj[m]> Aah, didn't think that. I can look into Phylanx then. That has been my goto.

19:47 <hkaiser> gnikunj[m]: https://github.com/STEllAR-GROUP/hpx/blob/master/libs/full/performance_counters/src/counter_parser.cpp

19:48 <hkaiser> well, the PhySL parser is not too difficult either

19:50 <K-ballo> it's a good idea to keep the parser not too difficult anyways

19:51 <K-ballo> don't go spirit crazy, don't play into attribute compatibility magic, don't play into semantic actions, don't play into validation, that's all post

19:52 <gnikunj[m]> K-ballo: need to write a parser for a language ;) The corresponding Antlr4 grammar for it is about ~150 lines of grammar. Given I need to create AST myself in spirit v3, I would've liked some inspiration to begin the work.

19:52 <K-ballo> I don't see how a language changes things

19:53 <hkaiser> the spirit parser should be constructing the AST directly, with the constraints K-ballo mentioned, just keep the types the parser produce as is and let it compose the hierarchical AST

19:53 <K-ballo> for example, don't have your language parser deal with precedence and associativity, just parse a simple dumb expression and handle those later

19:54 <hkaiser> it's more of a parse tree actually, not even quite an AST

19:54 <K-ballo> you _can_ the parser deal with precedence and associativity, but it's simply not a good idea... best case scenario you end up with considerable more complex code that takes way longer to compile to do just the same

19:55 <K-ballo> and with a spirit parser that also means you can't debug it, so you better get it right the first time

19:55 <gnikunj[m]> Right, but I do need to parse the tree and set values (semantic actions/boost fusion).

19:55 <K-ballo> don't use semantic actions

19:55 <K-ballo> have your ast match the grammar's natural attributes, no semantic actions, no compatibility magic

19:56 <K-ballo> you can have two ASTs: one that's the result of the parser, and one that's the result of an actual "compilation" phase (with type and semantic rules baked into it)

19:57 <hkaiser> the trick to understand attribute handling in SPirit is to understand this: https://www.boost.org/doc/libs/1_80_0/libs/spirit/doc/html/spirit/qi/quick_reference/compound_attribute_rules.html

19:57 <gnikunj[m]> That's a little hard to do as of now the way code is already written. I will need some plug points. But I get your point and try for a KISS approach.

19:57 <K-ballo> so you are working with an existing x3 parser?

19:58 <K-ballo> do NOT try to parse into an existing ast, use an intermediate

19:58 <gnikunj[m]> No. It's not as straightforward and I'm bound by NDA :/

19:58 <K-ballo> alright, in that case I guess I don't need to help you

20:32 HHN93 has joined #ste||ar

20:35 <HHN93> T *arr[4] creates an array of 4 elements, populated with garbage

20:35 <HHN93> is there something similar I can do for a vector?

20:35 <hkaiser> no

20:35 <hkaiser> vector calls the constructor for all elements

20:37 <HHN93> I wanted to try vectorising generate_n by generating 4 elements first, storing them in a vector

20:37 <HHN93> and then loading them into the corresponding position (this is more likely to be vectorised)

20:38 <HHN93> so any suggestion on how I can do it? Should I use a C-style array?

20:38 <hkaiser> why can't you use a vector?

20:38 <HHN93> if the default constructor is deleted?

20:38 <HHN93> it would throw an error, right?

20:38 <hkaiser> std::vector<int> v(4); creates a vector with 4 default initialized integers

20:39 <HHN93> std::vector<T> v(4) where T has no default constructor

20:39 <hkaiser> does T have another constructor?

20:40 <HHN93> let's say, yes

20:40 <hkaiser> use std::vector<int> v(4, 0); if it can be constructed from an int, for instance

20:41 <HHN93> I want to add vectorisation to generate_n(v.begin(), v.end(), f)

20:41 <HHN93> v is vector<T>

20:42 <HHN93> I am generating the object using f and moving it into the corresponding iterator

20:42 <HHN93> let me share the code instead

20:42 <HHN93> https://gist.github.com/Johan511/a59f081c074ff2b1731bbb62102ab24e

20:43 <hkaiser> ok?

20:43 <HHN93> this code will fail to compile if f_RetTy has no default constructor, right?

20:43 <HHN93> so can I do anything about it?

20:43 <hkaiser> yes, indeed

20:43 <HHN93> what do I do?

20:44 <hkaiser> vectorization makes sense for types only that can be used in vector registers: int, double, float, etc.

20:44 <hkaiser> all of those are default constructibe

20:44 <HHN93> ok, but I don't think users would be happy when they move from std::generate_n to hpx::generate_n and see that their compilation fails

20:47 <HHN93> can I do something or do I just drop the idea?

20:47 <hkaiser> HHN93: well, for non-vectorizable code this wouldn't be used

20:47 <hkaiser> non-vectorizable types*

20:48 <HHN93> We do try to stick to std right? and also just a note, my method doesn't vectorise the loads even for int, tried checking the vectorisation report and the assembly

20:49 <HHN93> so do we break our alligment with std?

20:49 <HHN93> `my method doesn't vectorise the loads even for int, tried checking the vectorisation report and the assembly`

20:49 <HHN93> atleast with clang 14

20:49 <hkaiser> the std wouldn't attempt to vectorize non-vectorizable types, even for unseq

20:50 <HHN93> yeah, but in our case the compilation itself fails. Which does seem less desirable

20:50 <hkaiser> it fail only if you unconditionally try to vectorize

20:51 <hkaiser> vectorizing code should be selected only for vectorizable types

20:51 <HHN93> oh ok

20:51 <HHN93> fine,

20:51 <HHN93> also can I trust compiler vectorisation reports?

20:52 <hkaiser> probably, looking at the assembly is more informative, however

20:53 <HHN93> I am running a profiler instead to screening through the whole assembly

20:53 <HHN93> and looking at hotspots on the profiler

20:56 <gnikunj[m]> HHN93: compiler vectorization reports are very accurate. The reports are generated as the compiler tries to vectorize loops.

20:57 <gnikunj[m]> IIRC the compiler fails to vectorize in an event of: 1. conditional branching within the loop; 2. Anti-dependency of a loop variable found outside the basic block; 3. Non trivial data layout making it difficult for the compiler to interpret and implement

20:59 <HHN93> yes, that's why I am breaking generate into 2 loops, generating part and then loading into the vector

20:59 <HHN93> loading should ideally be vectorised, but doesn't seem to be

21:00 <HHN93> -Rpass-analysis=loop-vectorize

21:00 <HHN93> tried this compile option, and there is no mention of the loop

21:00 <gnikunj[m]> Point me to the code. Could be something trivial.

21:01 <HHN93> https://gist.github.com/Johan511/a59f081c074ff2b1731bbb62102ab24e

21:04 <gnikunj[m]> and why would L19 be vectorized again?

21:04 <HHN93> looking at assembly, there are no vector registers being used (no xmm, ymm) so i assume there is no vectorization happening

21:04 <HHN93> isn't it an element being moved into the vector?

21:05 <HHN93> it should correspond to a load instruction right?

21:05 <gnikunj[m]> sure - 4 loads and not a single simd load

21:06 <HHN93> ok how do you suggest I make it a simd load?

21:07 <HHN93> I thought that's what HPX_VECTORISE did

21:07 <HHN93> ok I think I had an wrong understanding of how vector registers worked

21:08 <HHN93> let me look into it again

21:08 <gnikunj[m]> It comes under 3. of what I said above - the compiler doesn't know if it can do write all 4 directly using first. Why not use copy_from from std::simd or srinivasyadav18 's implementation?

21:09 <HHN93> what is srinivasyadav18 's implementation? is it https://github.com/STEllAR-GROUP/hpx/blob/master/libs/core/algorithms/include/hpx/parallel/unseq/loop.hpp?

21:14 <HHN93> also is there any documentation for understanding std::simd?

21:14 <HHN93> apart from cppreference

21:29 <hkaiser> HHN93: wg21.link/n4808, section 9

21:29 <HHN93> thank you

21:31 HHN93 has quit [Quit: Client closed]

21:33 HHN93 has joined #ste||ar

21:33 HHN93 has quit [Client Quit]

22:34 K-ballo has quit [Ping timeout: 240 seconds]

22:34 K-ballo1 has joined #ste||ar

22:36 K-ballo1 is now known as K-ballo