Thread Links Date Links
Thread Prev Thread Next Thread Index Date Prev Date Next Date Index

Re: IEEEP1788



Am 01.05.2015 um 10:48 schrieb David Lester:
On 30 Apr 2015, at 21:42, Vincent Lefevre<vincent@xxxxxxxxxx>  wrote:

On 2015-04-30 17:49:51 +0200, Ulrich Kulisch wrote:
It computes the exact dot product totally on chip without any memory
involvement.
You need memory on chip for the long accumulator. At least one for
each core.
Just so.

My suggestion to Ulrich is the following approach (which we use in SpiNNaker/Human Brain Project):

Use a minimal processor, attach a small amount of instruction memory (16-32K), and data memory (32-64K) in a Harvard configuration (separate instruction/data paths). That’s your processing node.


Yes, that is coming very close to what I claimed. With a small amount of register memory you can compute all dot products exactly and at extreme speed just by avoiding the memory bottle-neck for intermediate results.. One needs about 600 bit for IEEE 754 short. We needed about 1000 bit for the /370 format long in the IBM product ACRITH-XSC and about 2000 bit should suffice for 754 double.

The exact dot product is a key operation for verified computing with high accuracy.

Thanks for the attachments. Are the instructions available?

With best regards
Ulrich Kulisch


--
Karlsruher Institut für Technologie (KIT)
Institut für Angewandte und Numerische Mathematik
D-76128 Karlsruhe, Germany
Prof. Ulrich Kulisch
KIT Distinguished Senior Fellow

Telefon: +49 721 608-42680
Fax: +49 721 608-46679
E-Mail:ulrich.kulisch@xxxxxxx
www.kit.edu
www.math.kit.edu/ianm2/~kulisch/

KIT - Universität des Landes Baden-Württemberg
und nationales Großforschungszentrum in der
Helmholtz-Gesellschaft