notzed,
Great blog post! Really enjoyed reading it.
*It's possible to run gdb on code running on a core by first launching the 'e-server' and the launching e-gdb with your ecore (epu if you like) elf. It gets harder when you have a cooperative host/slave program.
*Not sure if you are using byte/short data formats at all, but there is a pretty big performance hit for loading anything non- 32 bit from local memory.
*The A9 ARM core is significantly more advanced than the Epiphany cores, so I suppose we shouldn't be too surprised. Still disappointed though.
On straight up floating point "filter type code", the results should be closer.
*Is the Parallella time including data transfer time to and from the Epiphany?
Thanks,
Andreas