Data doesn't fit into the core

I am trying to make face detection program to run using epiphany. To detect faces it uses cascade file. When loaded, it takes 173 kilobytes space (technically it can be reduced to around 99kb). How can I deal with that?
The only solution I could come up with is to store picture in shared memory (512x512 grayscale picture should fit, right?) and then partition cascade into smaller bits, so that they fit cores and each core would be responsible for it's own part. This approach will mean program will do quite a bit of extra work, but I can't come up with anything else.
So my question is: is there a way to access 99kb of raw data in each of the cores relatively quickly and compare that data against picture data.
Second question: What is the best way for cores to access a 512x512 grayscale picture (or partitioned portions of it)?
EDIT:
volatile char *process_done = static_cast<char*>((void *)0x2000);
this line declares that process_done variable is to be stored inside core's local memory at specific address, right? How do I access shared memory? is this the correct way:
volatile float *pic = static_cast<float>((void*)0x1e010000);
The only solution I could come up with is to store picture in shared memory (512x512 grayscale picture should fit, right?) and then partition cascade into smaller bits, so that they fit cores and each core would be responsible for it's own part. This approach will mean program will do quite a bit of extra work, but I can't come up with anything else.
So my question is: is there a way to access 99kb of raw data in each of the cores relatively quickly and compare that data against picture data.
Second question: What is the best way for cores to access a 512x512 grayscale picture (or partitioned portions of it)?
EDIT:
volatile char *process_done = static_cast<char*>((void *)0x2000);
this line declares that process_done variable is to be stored inside core's local memory at specific address, right? How do I access shared memory? is this the correct way:
volatile float *pic = static_cast<float>((void*)0x1e010000);