Parallella Community

by **eleitl** » Mon May 26, 2014 12:38 pm

I'm thinking about writing a neural spike code, which would use spatial distribution of neurons/synapses over processor nodes.

I understand this interconnect is both high-throughput and low-latency. Is there any point in collecting remote writes to a specific node, and processing them in a batch? Or is this just atomic memory writes with little overhead so that one would live with just implementing this naively.

by **aolofsson** » Mon May 26, 2014 2:00 pm

For on chip write communicationthe best approach will depend on the application. If you coalesce writes then you can use the DMA more effectively. Still, I have would always start with native writes before optimizing. For off chip DRAM access, always use DMA for performance critical transfers.

by **timpart** » Mon May 26, 2014 2:00 pm

Parallella Community

coalescing remote writes

coalescing remote writes

Re: coalescing remote writes

Re: coalescing remote writes

Who is online