Unfortunately, I have not found any documented on parallella yet.
I have worked with advanced multi-core systems that provide hardware queues for inter core communications (IPC) and they allow for scalability closer to the 1:1 levels. With the traditional shared memory IPC the non-parallelizable portion of the overall application is increased and generally one is lucky if they can achieve 0.65:1 scalability.
Does the parallella provide any solution for this problem or will we only get ~60% value of each additional core running?
Thanks