Thread Links | Date Links | ||||
---|---|---|---|---|---|
Thread Prev | Thread Next | Thread Index | Date Prev | Date Next | Date Index |
On 8/29/2013 2:43 PM, Langou, Julien
wrote:
It's a dot product, but not exact. Usually, making this code better seems to involving unrolling the loop, or somehow doing the arithmetic on a GPU (graphics processing unit). There's another stream of papers on trying to make it more accurate. Here's some hacker's results on timing some random chips (I am not endorsing this particularly, but I found it via Google) http://blog.theincredibleholk.org/blog/2012/12/10/optimizing-dot-product/ There's much more discussion, including the role of a limited amount of high precision (e.g. quad) arithmetic starting on page number 134 (in the PDF, it is page 144) in Basic Linear Algebra Subprograms Technical Forum Standard, International Journal of High Performance Applications and Supercomputing, Volume 16, Number 1, Spring 2002. see first item here.. http://netlib.org/blas/blast-forum/ So it is not that BLAS people ignored extended precision. I
think they consider using The document on Interval BLAS, which I had not seen, starts on
page number 219. (add 10 in the pdf) is 80 pages of fairly detailed descriptions, but a search reveals |