Thread Links			Date Links
Thread Prev	Thread Next	Thread Index	Date Prev	Date Next	Date Index

Re: Motion P1788/M0009.01_ExactDotProduct

To: rbk@xxxxxxxxxxxx
Subject: Re: Motion P1788/M0009.01_ExactDotProduct
From: James Demmel <demmel@xxxxxxxxxxxxxxx>
Date: Fri, 30 Oct 2009 10:29:32 -0700
Cc: Michael Schulte <schulte@xxxxxxxxxxxxx>, "'stds-1788@xxxxxxxx'" <stds-1788@xxxxxxxx>
Delivered-to: mhonarc@xxxxxxxxxxxxxxxx
In-reply-to: <4AEB1591.7080902@xxxxxxxxxxxx>
List-help: <http://listserv.ieee.org/cgi-bin/wa?LIST=STDS-1788>, <mailto:LISTSERV@LISTSERV.IEEE.ORG?body=INFO%20STDS-1788>
List-owner: <mailto:STDS-1788-request@LISTSERV.IEEE.ORG>
List-subscribe: <mailto:STDS-1788-subscribe-request@LISTSERV.IEEE.ORG>
List-unsubscribe: <mailto:STDS-1788-unsubscribe-request@LISTSERV.IEEE.ORG>
References: <C7104FC8.2877%George.Corliss@xxxxxxxxxxxxx> <4AEB012A.1020509@xxxxxxxxxxxx> <003901ca5977$1f06cc80$5d146580$@wisc.edu> <4AEB1591.7080902@xxxxxxxxxxxx>
Sender: stds-1788@xxxxxxxx
User-agent: Thunderbird 1.5.0.14 (Windows/20071210)

There are a number of papers on accurate summations and dot products insoftware(the most recent paper I know of is by Zhu and Hayes in SISC, 2009),exhibitinga big design space with lots of tradeoffs in space, time and accuracy(correctly

rounded vs faithful vs at most a few ulps of error).

Though I have not written this down formally, I think that an algorithmthat is correctfor any underlying number of mantissa and exponent bits must in effectdo as muchwork as sorting (by the exponents). This can be done in a big hardwareregister(basically bucket sort) or by distillation (bucket sort and merge sorthave been proposed)

or by explicit sorting (as my algorithm with Hida).

Hopefully any statement of standard will not discourage implementors from
continuing to explore this space.

Jim Demmel


Ralph Baker Kearfott wrote:

Michael et al,

I'd like to remind people that Ulrich and others have argued in the
past that accurate dot product should be implemented in hardware because
it is otherwise too slow.  On the other hand, for what it's worth,
Rump, Oishi et al have
developed "almost accurate" dot product algorithms that can beimplemented
efficiently in current IEEE-754 conforming hardware.

I'm not sure how all this should affect whether or not we require an
accurate dot product, but it's relevant.

Baker

Michael Schulte wrote:
George,
.
.
.
However, since std-1788 does not require that everything beimplemented inhardware, I think we should include exact dot products in thestandard andthen people can decide if they want to implement it in hardware orsoftware.My impression is that a software implementation of exact dot productwouldnot be prohibitive (except possibly in some very cost-constrainedembeddeddevices).Best regards,
Mike

Follow-Ups:
- Re: Motion P1788/M0009.01_ExactDotProduct
  - From: Arnold Neumaier

References:
- Re: Motion P1788/M0009.01_ExactDotProduct
  - From: Corliss, George
- Re: Motion P1788/M0009.01_ExactDotProduct
  - From: Ralph Baker Kearfott
- RE: Motion P1788/M0009.01_ExactDotProduct
  - From: Michael Schulte
- Re: Motion P1788/M0009.01_ExactDotProduct
  - From: Ralph Baker Kearfott

Prev by Date: Re: Can Motion 8.02 support proofs of correctness?
Next by Date: Re: Motion P1788/M0009.01_ExactDotProduct
Previous by thread: Re: Motion P1788/M0009.01_ExactDotProduct
Next by thread: Re: Motion P1788/M0009.01_ExactDotProduct
Index(es):
- Date
- Thread