- Code: Select all
`for (n = 0; n < TIMES; n++){`

//Clear Sum

(*(c))=0x0;

//Sum of product calculation

for (i = 0; i < N/CORES; i++){

(*(c)) += a[i] * b[i];

}

}

Then I did the same on the ARM:

- Code: Select all
`for (n = 0; n < TIMES; n++){`

// printf("j= %d\n", j);

//Clear Sum

sop = 0;

//Sum of product calculation

for (i = 0; i < N; i++){

sop += a[i] * b[i];

}

}

For TIMES=100,000 and N=4096, the eCore takes 11 seconds and the ARM takes 19 seconds.

Can anyone explain why the eCore is faster at this benchmark?