Parallella Community

by **jar** » Fri Apr 28, 2017 12:59 am

https://arxiv.org/abs/1704.08343
Title: A Distributed Shared Memory Model and C++ Templated Meta-Programming Interface for the Epiphany RISC Array Processor
Abstract: The Adapteva Epiphany many-core architecture comprises a scalable 2D mesh Network-on-Chip (NoC) of low-power RISC cores with minimal uncore functionality. Whereas such a processor offers high computational energy efficiency and parallel scalability, developing effective programming models that address the unique architecture features has presented many challenges. We present here a distributed shared memory (DSM) model supported in software transparently using C++ templated metaprogramming techniques. The approach offers an extremely simple parallel programming model well suited for the architecture. Initial results are presented that demonstrate the approach and provide insight into the efficiency of the programming model and also the ability of the NoC to support a DSM without explicit control over data movement and localization.

Comments and discussion appreciated

by **dobkeratops** » Thu May 04, 2017 11:35 pm

by **jar** » Fri May 05, 2017 5:15 am

I thought you'd like this. Yes, this is the similar to the thing you were brainstorming, but you were ahead of your time. GCC wasn't ready (at least version 4.8 with the older Linux image) as well as some of our software.

And it's not ready for prime time yet. This was an early experiment on Epiphany as a side project from the main effort. There is a lot left to improve, but the intention is to place this on GitHub at some point and it won't just be for Epiphany. We would like to delay this as long as possible after witnessing what happened with Kokkos -- they released an unfinished product on the DOE in a panic to have some semblance of code portability between their next Xeon Phi and Power/GPU supercomputers. The end result was that many things are completely missing or unrefined and to properly fix it would break codes.

I don't think we want to begin implementing 'high order functions' but rather enable expressions to be written that compile to efficient code. But I'll keep it in mind. It's not a library though it might be considered a header-only library. It actually can't be pre-compiled and shipped as a proprietary package, so it must be open source if anyone will use it.

The memory layout accessors vary between architectures and platforms. It's a single like of code appearing in an application header that defines memory layout and has a certain complexity to it. Each platform will have defaults, but it's a memory-layout-first approach to parallel computing. The parallel kernel code will remain the same and the expression templates handle the rest. Each platform may have specific optimizations baked into the layout description.

by **dobkeratops** » Fri May 05, 2017 2:34 pm

Parallella Community

[Paper] A Distributed Shared Memory Model and C++ Templated

[Paper] A Distributed Shared Memory Model and C++ Templated

Re: [Paper] A Distributed Shared Memory Model and C++ Templ

Re: [Paper] A Distributed Shared Memory Model and C++ Templ

Re: [Paper] A Distributed Shared Memory Model and C++ Templ

Who is online