aserio changed the topic of #ste||ar to: STE||AR: Systems Technology, Emergent Parallelism, and Algorithm Research | stellar.cct.lsu.edu | HPX: A cure for performance impaired parallel applications | github.com/STEllAR-GROUP/hpx | Buildbot: http://rostam.cct.lsu.edu/ | Log: http://irclog.cct.lsu.edu/
mcopik has quit [Ping timeout: 240 seconds]
Matombo has quit [Remote host closed the connection]
StefanLSU has joined #ste||ar
StefanLSU has quit [Quit: StefanLSU]
parsa has joined #ste||ar
hkaiser has quit [Quit: bye]
Aalice has joined #ste||ar
K-ballo has quit [Quit: K-ballo]
parsa has quit [Quit: Zzzzzzzzzzzz]
david_pfander has joined #ste||ar
david_pfander has quit [Ping timeout: 255 seconds]
patg has quit [Quit: This computer has gone to sleep]
Aalice has quit [Quit: Leaving.]
jaafar has joined #ste||ar
<github>
[hpx] sithhell pushed 1 new commit to throttle_cores: https://git.io/v5PDQ
<github>
hpx/throttle_cores de6c7d7 Thomas Heller: Making inspect happy...
jaafar has quit [Ping timeout: 246 seconds]
bikineev has quit [Remote host closed the connection]
david_pfander has joined #ste||ar
david_pfander has quit [Client Quit]
Matombo has joined #ste||ar
bikineev has joined #ste||ar
bikineev has quit [Remote host closed the connection]
Matombo has quit [Remote host closed the connection]
Matombo has joined #ste||ar
<jbjnr>
I added 2 lines - balanced_numa type, added a string to the parse description (same two links you showed above), but it never gets to the actual decode distribution where the work is done cos the qi stuff throws that the parameter is bad. Seems like it does not use the "| partlit("balanced-numa") >> qi::attr(balanced_numa)" part
<jbjnr>
but I'm not familiar with boost spirit so not sure how to debug inside it
bikineev has quit [Ping timeout: 248 seconds]
<jbjnr>
discovery
<jbjnr>
"balanced-numa" doesn't work because it is similar to "balanced" and the parser is choking
<jbjnr>
if I use "numabalance" it works
<heller>
I was about to say...
<jbjnr>
bloody spirit. shite!
<heller>
it's not spirit
<heller>
if you put balanced-numa at the top, it should work
<jbjnr>
what's to blame then?
<jbjnr>
ok
<heller>
it's the partlit parser, written by hartmut :P
<jbjnr>
pfff
Matombo has quit [Remote host closed the connection]
Matombo has joined #ste||ar
Matombo has quit [Remote host closed the connection]
<github>
[hpx] mcopik opened pull request #2894: Fix incorrect handling of compile definition with value 0 (master...cmake_fix) https://git.io/v5XfZ
bikineev has joined #ste||ar
<heller>
jbjnr: what(): partitioner::add_resource: Creation of 5 threads requested by the resource partitioner, but only 4 provided on the command-line.
<heller>
jbjnr: ever saw this error?
<jbjnr>
if you say hpx::threads=N but then in int main say add_pu 5 times, or add core etc
<jbjnr>
N=4
<jbjnr>
if you want to bind more threads than there are available you have to --alow-oversubscription
<jbjnr>
(might not have implemented that yet)
<heller>
ok
<heller>
it seems to only occur on a specific system
<heller>
doesn't seem to have any effect for release builds
<hkaiser>
heller: looks like a problem in the command line handling code
<heller>
yes
<heller>
it's been there since a while now
<hkaiser>
was not aware of that :/
<heller>
I only ever saw it on buildbot
<heller>
looks like some sort of strange race condition, since not all test runs are affected
<heller>
apart from some spuriosly failing tests and this assert, it doesn't look too bad at the moment
<hkaiser>
heller: just don't remove the assert, pls
<heller>
that was not my plan, just said it is annoying ;)
<diehlpk_work>
mcopik, Most people who were interested in HPXCL asked for CUDA
<diehlpk_work>
mcopik, If you think we still need opencl, I can keep it and disbale it for circle-ci
<heller>
hkaiser: ha! figured it out :D
<hkaiser>
heller: what is it?
<heller>
hkaiser: we look for SLURM_NODELIST, which includes the complete nodelist we asked for during salloc, however, when we do an srun -n1 -N1, we might get a different nodelist, which is in the environment variable SLURM_STEP_NODELIST
<diehlpk_work>
Should we put this paper into the OpenSuCo paper too? Frank Löffler, Zhoujian Cao, Steven R. Brandt, Zhihui Du. “A new parallelization scheme for adaptive mesh refinement.” Journal of Computational Science, 16 (2016) 79–88.
bikineev has quit [Ping timeout: 246 seconds]
hkaiser has joined #ste||ar
<diehlpk_work>
hkaiser, Frank Löffler, Zhoujian Cao, Steven R. Brandt, Zhihui Du. “A new parallelization scheme for adaptive mesh refinement.” Journal of Computational Science, 16 (2016) 79–88. Should we mention this paper here too?
<hkaiser>
diehlpk_work: that has no relation to hpx
<diehlpk_work>
Ok
pree has quit [Ping timeout: 260 seconds]
pree has joined #ste||ar
<diehlpk_work>
hkaiser, Can you read the introduction of the paper?
<diehlpk_work>
I shortened heller's introduction from the thesis
david_pfander has quit [Ping timeout: 248 seconds]
<hkaiser>
diehlpk_work: will do
<diehlpk_work>
Ok, I will finish the conclusion soon.
pree has quit [Ping timeout: 260 seconds]
pree has joined #ste||ar
pree has quit [Ping timeout: 246 seconds]
bikineev has joined #ste||ar
rod_t has joined #ste||ar
pree has joined #ste||ar
rod_t has quit [Client Quit]
rod_t has joined #ste||ar
pree has quit [Ping timeout: 260 seconds]
<diehlpk_work>
hkaiser, Can we use this one here as a reference for HPX.Compute Copik, M., and Kaiser, H. Using sycl as an implementation framework for hpx.compute. In Proceedings of the 5th
<diehlpk_work>
International Workshop on OpenCL (New York, NY, USA, 2017), IWOCL 2017, ACM, pp. 30:1–30:7.
<hkaiser>
diehlpk_work: absolutely
<hkaiser>
I missed that one, thanks
EverYoung has joined #ste||ar
EverYoung has quit [Remote host closed the connection]
EverYoung has joined #ste||ar
jaafar has joined #ste||ar
aserio has quit [Ping timeout: 246 seconds]
<heller>
hkaiser: only checking for the STEP variables is the right thing according to the docs
<diehlpk_work>
hkaiser, How should we cite HPXCL? Chapter of my thesis or just link to github repo?
<heller>
There might be situations where you only have one job step (for example when doing a srun in solitude, that is without salloc or sbatch beforehand)
pree has joined #ste||ar
pree has quit [Read error: Connection reset by peer]
jaafar has quit [Ping timeout: 252 seconds]
rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]
rod_t has joined #ste||ar
pree has joined #ste||ar
rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]
pree has quit [Read error: Connection reset by peer]
rod_t has joined #ste||ar
EverYoung has quit [Ping timeout: 246 seconds]
jkleinh has joined #ste||ar
rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]
jbjnr has quit [Read error: Connection reset by peer]
jaafar has joined #ste||ar
jbjnr has joined #ste||ar
rod_t has joined #ste||ar
pree has joined #ste||ar
jaafar has quit [Ping timeout: 252 seconds]
EverYoung has joined #ste||ar
EverYoung has quit [Remote host closed the connection]
EverYoung has joined #ste||ar
aserio has joined #ste||ar
Matombo has quit [Ping timeout: 252 seconds]
bikineev has quit [Ping timeout: 240 seconds]
Matombo has joined #ste||ar
StefanLSU has joined #ste||ar
aserio has quit [Ping timeout: 240 seconds]
bikineev has joined #ste||ar
StefanLSU has quit [Quit: StefanLSU]
StefanLSU has joined #ste||ar
StefanLSU has quit [Quit: StefanLSU]
jbjnr_ has joined #ste||ar
aserio has joined #ste||ar
jbjnr has quit [Ping timeout: 255 seconds]
jbjnr_ is now known as jbjnr
jfbastien has quit [Ping timeout: 255 seconds]
Matombo has quit [Read error: Connection reset by peer]
bikineev has quit [Remote host closed the connection]
aserio1 has joined #ste||ar
bikineev has joined #ste||ar
aserio has quit [Ping timeout: 246 seconds]
aserio1 is now known as aserio
bikineev has quit [Remote host closed the connection]
Matombo has joined #ste||ar
bikineev has joined #ste||ar
<github>
[hpx] aserio created new_people from master (+0 new commits): https://git.io/v51fZ
jkleinh has quit [Quit: Page closed]
bikineev has quit [Remote host closed the connection]
bikineev has joined #ste||ar
hkaiser has quit [Quit: bye]
aserio has quit [Ping timeout: 246 seconds]
jkleinh has joined #ste||ar
aserio has joined #ste||ar
pree has quit [Quit: AaBbCc]
bikineev has quit [Remote host closed the connection]
aserio1 has joined #ste||ar
bikineev has joined #ste||ar
aserio has quit [*.net *.split]
zbyerly has quit [*.net *.split]
ABresting has quit [*.net *.split]
aserio1 has quit [Ping timeout: 264 seconds]
denis_blank has quit [Ping timeout: 240 seconds]
denis_blank has joined #ste||ar
zbyerly has joined #ste||ar
jaafar has joined #ste||ar
aserio has joined #ste||ar
aserio has quit [Client Quit]
bikineev has quit [Remote host closed the connection]
bikineev has joined #ste||ar
wash is now known as washcuda
bibek_desktop has quit [Quit: Leaving]
jaafar has quit [Ping timeout: 260 seconds]
hkaiser has joined #ste||ar
Matombo has quit [Quit: Leaving]
ABresting has joined #ste||ar
mcopik has quit [Ping timeout: 248 seconds]
jkleinh has quit [Quit: Page closed]
<github>
[hpx] hkaiser created fixing_2896 (+1 new commit): https://git.io/v51VI
<github>
hpx/fixing_2896 26be28a Hartmut Kaiser: Removing dependency on Boost.ICL
<hkaiser>
zao: this should take care of the odd gcc issue ^^
<zao>
Yay!
<github>
[hpx] hkaiser opened pull request #2897: Removing dependency on Boost.ICL (master...fixing_2896) https://git.io/v51Vm
bikineev has quit [Remote host closed the connection]
rod_t has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]
bikineev has joined #ste||ar
bikineev has quit [Ping timeout: 240 seconds]
<github>
[hpx] hkaiser pushed 1 new commit to partitioned_vector: https://git.io/v51Vy