I defined a 48,48,34 network and it worked well, the predict process will cost ~0.15s(forka executing time), but in ARM a same topology network based on opencv only need ~0.02s. I changed the cl code to make the "forwardPass" run 2 to 100 times in "k_forward", the forka time only increased ~0.005s*n. Is that means the forward predict process only cost ~0.005s in Epiphany, and the data movement cost most of the other time? If that is true, do you think there is any way to reduce the time cost?Statistics: Posted by leonfg — Fri Mar 25, 2016 5:24 am
]]>