Thread Links			Date Links
Thread Prev	Thread Next	Thread Index	Date Prev	Date Next	Date Index

Re: IEEEP1788

To: David Lester <dlester@xxxxxxxxxxxx>, Vincent Lefevre <vincent@xxxxxxxxxx>
Subject: Re: IEEEP1788
From: Ulrich Kulisch <ulrich.kulisch@xxxxxxx>
Date: Sat, 2 May 2015 10:51:40 +0200
Cc: stds-1788 <stds-1788@xxxxxxxxxxxxxxxxx>
Delivered-to: mhonarc@xxxxxxxxxxxxxxxx
In-reply-to: <7B84B935-D73A-485F-B3A4-522D9B75FC55@cs.man.ac.uk>
List-help: <https://listserv.ieee.org/cgi-bin/wa?LIST=STDS-1788>, <mailto:LISTSERV@LISTSERV.IEEE.ORG?body=INFO%20STDS-1788>
List-owner: <mailto:STDS-1788-request@LISTSERV.IEEE.ORG>
List-subscribe: <mailto:STDS-1788-subscribe-request@LISTSERV.IEEE.ORG>
List-unsubscribe: <mailto:STDS-1788-unsubscribe-request@LISTSERV.IEEE.ORG>
References: <55367C31.8070402@kit.edu> <5536AF27.8060906@louisiana.edu> <5536B4E0.7020607@heimlich-online.net> <553E60E2.3080404@kit.edu> <553E6E84.6060700@heimlich-online.net> <553ED0A4.606@berkeley.edu> <163E66CB-6901-4C8F-A093-253CD9A90A9A@manchester.ac.uk> <55424F1F.3030503@kit.edu> <20150430204253.GA23977@xvii.vinc17.org> <7B84B935-D73A-485F-B3A4-522D9B75FC55@cs.man.ac.uk>
Sender: stds-1788@xxxxxxxx
User-agent: Mozilla/5.0 (Windows NT 5.1; rv:31.0) Gecko/20100101 Thunderbird/31.6.0

Am 01.05.2015 um 10:48 schrieb David Lester:

On 30 Apr 2015, at 21:42, Vincent Lefevre<vincent@xxxxxxxxxx>  wrote:

On 2015-04-30 17:49:51 +0200, Ulrich Kulisch wrote:

It computes the exact dot product totally on chip without any memory
involvement.

You need memory on chip for the long accumulator. At least one for
each core.

Just so.

My suggestion to Ulrich is the following approach (which we use in SpiNNaker/Human Brain Project):

Use a minimal processor, attach a small amount of instruction memory (16-32K), and data memory (32-64K) in a Harvard configuration (separate instruction/data paths). That’s your processing node.

Yes, that is coming very close to what I claimed. With a small amount ofregister memory you can compute all dot products exactly and at extremespeed just by avoiding the memory bottle-neck for intermediate results..One needs about 600 bit for IEEE 754 short. We needed about 1000 bit forthe /370 format long in the IBM product ACRITH-XSC and about 2000 bitshould suffice for 754 double.

The exact dot product is a key operation for verified computing withhigh accuracy.


Thanks for the attachments. Are the instructions available?

With best regards
Ulrich Kulisch


--
Karlsruher Institut für Technologie (KIT)
Institut für Angewandte und Numerische Mathematik
D-76128 Karlsruhe, Germany
Prof. Ulrich Kulisch
KIT Distinguished Senior Fellow

Telefon: +49 721 608-42680
Fax: +49 721 608-46679
E-Mail:ulrich.kulisch@xxxxxxx
www.kit.edu
www.math.kit.edu/ianm2/~kulisch/

KIT - Universität des Landes Baden-Württemberg
und nationales Großforschungszentrum in der
Helmholtz-Gesellschaft

References:
- IEEEP1788
  - From: Ulrich Kulisch
- Re: IEEEP1788
  - From: Ralph Baker Kearfott
- Re: IEEEP1788
  - From: Oliver Heimlich
- Re: IEEEP1788
  - From: Ulrich Kulisch
- Re: IEEEP1788
  - From: Oliver Heimlich
- Re: IEEEP1788
  - From: Richard Fateman
- Re: IEEEP1788
  - From: Ulrich Kulisch
- Re: IEEEP1788
  - From: Vincent Lefevre
- Re: IEEEP1788
  - From: David Lester

Prev by Date: Complete Arithmetic
Next by Date: Re: IEEEP1788
Previous by thread: Re: IEEEP1788
Next by thread: Re: IEEEP1788
Index(es):
- Date
- Thread