Thread Links			Date Links
Thread Prev	Thread Next	Thread Index	Date Prev	Date Next	Date Index

Re: Rounded operations: test

To: stds-1788 <stds-1788@xxxxxxxxxxxxxxxxx>
Subject: Re: Rounded operations: test
From: Vincent Lefevre <vincent@xxxxxxxxxx>
Date: Fri, 15 Jul 2011 14:34:22 +0200
Delivered-to: mhonarc@xxxxxxxxxxxxxxxx
In-reply-to: <273AC613528D4468A9272206697C0A95@KLENDATHU>
List-help: <http://listserv.ieee.org/cgi-bin/wa?LIST=STDS-1788>, <mailto:LISTSERV@LISTSERV.IEEE.ORG?body=INFO%20STDS-1788>
List-owner: <mailto:STDS-1788-request@LISTSERV.IEEE.ORG>
List-subscribe: <mailto:STDS-1788-subscribe-request@LISTSERV.IEEE.ORG>
List-unsubscribe: <mailto:STDS-1788-unsubscribe-request@LISTSERV.IEEE.ORG>
Mail-followup-to: stds-1788 <stds-1788@xxxxxxxxxxxxxxxxx>
References: <4E16AC1B.1090006@xxxxxxxxxxxxxx> <CAA9By-3R3uep1PVFZ5sLU-V2La7KjZ5jBnr5pJzxx3=3RCc-eQ@xxxxxxxxxxxxxx> <4E19AB1A.1060303@xxxxxxxxxxxx> <51A39D6FA3034F33A6F2FD0237CF2C59@KLENDATHU> <4E19E5DA.3020703@xxxxxxxxxxxx> <273AC613528D4468A9272206697C0A95@KLENDATHU>
Sender: stds-1788@xxxxxxxx
User-agent: Mutt/1.5.21-6193-vl-r44775 (2011-07-01)

On 2011-07-10 13:05:38 -0500, Nate Hayes wrote:
> For this particular machine, it seems the first test (explicitly
> changing/restoring the rounding mode for each individual addition
> operation) was just a tiny bit slower than the "fixup" method of
> using the nextup() routine; the small performance hit in this first
> test is probably worth the extra accuracy it provides.

Concerning your nextup() test, there are many branches, and since
the code is run on the same data, I suppose that the branches can
be predicted correctly (if this is implemented that way on the
processor). In real codes, this solution might be slower.

> For further comparision, I just ran a third test which simply does
> 1 billion additions in "round to nearest" mode, i.e., I made NO
> attempt whatsoever to implement directed rounding. This test was an
> order of magnitude faster than either of the previous two.
> 
> The moral of the story seems to be what I suspect most of us already
> knew: that rounded operations can be easily emulated in software,
> but real hardware support of these operations at the opcode level of
> the processor will surely give dramatic speed improvements.

Probably. It would be interesting to run your benchmark on other
processors, in particular those supporting static (or semi-static)
rounding modes.

-- 
Vincent Lefèvre <vincent@xxxxxxxxxx> - Web: <http://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <http://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / Arénaire project (LIP, ENS-Lyon)

Follow-Ups:
- Re: Rounded operations: test
  - From: Nate Hayes

References:
- Motion 24.03: NO
  - From: Frédéric Goualard
- Re: Motion 24.03: NO
  - From: Hossam A. H. Fahmy
- Re: Motion 24.03: NO
  - From: Ralph Baker Kearfott
- Rounded operations: test
  - From: Nate Hayes
- Re: Rounded operations: test
  - From: Ralph Baker Kearfott
- Re: Rounded operations: test
  - From: Nate Hayes

Prev by Date: Re: Comments on Motion 27-A "Decorated Intervals"
Next by Date: Comments on Motion 27-A "Decorated Intervals"
Previous by thread: Re: Rounded operations: test
Next by thread: Re: Rounded operations: test
Index(es):
- Date
- Thread