Page 1 of 1

Generation of the Single Precision BLAS

PostPosted: Fri Aug 19, 2016 1:27 am
by MiguelTasende
In case someone is interested...
Subject: a first implementation of BLAS for Parallella (with Epiphany acceleration), and Linpack benchmark. (Both with many improvements to be done)

Title: Generation of the Single Precision BLAS library for the Parallella platform, with Epiphany co-processor acceleration, using the BLIS framework

Link:

http://arxiv.org/abs/1608.05265

Re: Generation of the Single Precision BLAS

PostPosted: Sat Aug 20, 2016 2:56 am
by aolofsson
Hi Miguel,
Excellent work! If you are up for traveling, consider going to the upcoming BLIS workshop in Austin. PM me if you want an intro to Robert van de Geijn.
Thanks,
Andreas

Re: Generation of the Single Precision BLAS

PostPosted: Wed Aug 24, 2016 4:31 pm
by MiguelTasende
Good news.
The code is released under Mozilla 2.0 license.
It is not perfectly "polished" (there may be some unused files included, and you'll find many comments and variable names in Spanish, among other things), but it works (at least here... hope to hear about other people testing it).

The link to GitHub is here:

https://github.com/mtasende/BLAS_for_Parallella

IMPORTANT NOTE: Most of the use cases require running 2 processes on the Linux host. That is explained in the README.txt, but it is something different from a regular code, and easy to forget.