Thread Links			Date Links
Thread Prev	Thread Next	Thread Index	Date Prev	Date Next	Date Index

A proposal for the next motion

To: stds-1788@xxxxxxxxxxxxxxxxx
Subject: A proposal for the next motion
From: John Pryce <j.d.pryce@xxxxxxxxxxxx>
Date: Wed, 13 May 2009 10:03:05 +0100
Delivered-to: mhonarc@xxxxxxxxxxxxxxxx
List-help: <http://listserv.ieee.org/cgi-bin/wa?LIST=STDS-1788>, <mailto:LISTSERV@LISTSERV.IEEE.ORG?body=INFO%20STDS-1788>
List-owner: <mailto:STDS-1788-request@LISTSERV.IEEE.ORG>
List-subscribe: <mailto:STDS-1788-subscribe-request@LISTSERV.IEEE.ORG>
List-unsubscribe: <mailto:STDS-1788-unsubscribe-request@LISTSERV.IEEE.ORG>
Sender: stds-1788@xxxxxxxx

P1788 members

I propose the following motion. Like Motion 3, it is not aboutstructure and procedure of the standard, but about content. I aim inthe next few weeks to turn the text below into a LaTeX document thatcan be submitted for inclusion in the standard text.

Note to the officers: this is significantly changed from the previousdraft. I have responded to various comments, and have tightened upvarious definitions.


John Pryce
====================================

===Motion P1788/M0005.01_Level_2_Mixed_format===
Proposer: John Pryce
Seconder: Dan Zuras

===Motion text===

P1788 shall support mixed-format interval arithmetic (Definition D2)as specified in 1 to 11 of the Rationale below. Specifically:

(1) An implementation (of this standard) shall specify which intervalformats (Definition D1) it supports, and shall provide mixed-formatarithmetic for all supported interval formats.(2) An implementation shall be called 754-conforming if: (i) itsunderlying system is 754-conforming (Definition D3) and if theimplementation supports intervals of each of the five basic 754floating point formats: binary32, binary64, binary128, decimal64, anddecimal128; or (ii) it is functionally indistinguishable from case (i).(3) The relation between a floating point format and its intervalformat shall be as defined in 7 and 8 below.(4) There shall be conversions between any two supported intervalformats, as defined in 9a below.(5) An implementation shall specify a set of "interval-supportedelementary functions" (Definition D7). For any such function and anysupported format F there shall be an interval version of thatfunction in that format, which shall enclose the sharp interval F-extension of that function as defined in 9b below, but is otherwisenot specified by this motion.This motion also leaves it open whether every point elementaryfunction (Definition D3) is interval-supported.(6) A 754-conforming implementation may provide other intervalformats beside the basic five, and appropriate conversions andoperations: for instance, in variable-precision arithmetic.(7) An implementation may provide more than one interval elementaryfunction in a given format: for instance, to offer different trade-offs between speed and tightness.

Constraint: Items (3,4,5) above ensure that the Basic Theorem ofInterval Arithmetic (BTIA) holds for mixed-format calculation in anyconforming implementation, as outlined in 10, 11 below. The BTIA isfundamental to interval computation; therefore, an amendment to thismotion is only acceptable if it preserves the BTIA.


This motion does not cover:

- What elementary functions shall be, or are recommended to be,provided by

  an implementation.

- Issues of the construction of intervals, from real numbers, textstrings, etc.- Issues of definition of real-valued functions of intervals, such aswhetherfinding the midpoint of the whole line should return NaN, orgenerate an

  exception, etc.

- Issues of tightness of enclosure, as in the 'accuracy modes' ofVienna§3.2.- Flags, exceptions or other means of signalling that an elementaryfunction

  has been evaluated outside its domain.

===Rationale===
===============

This divides into Foundations; Terminology; Definitions;Introduction; Level 1 description; and Level 2 description.


Foundations.
------------

Motion 2 and the associated position paper, defining a levelsstructure as a guideline for the standard.


Motion 3, which defines P1788's number system and set of intervals.

Terminology.
------------

754§X denotes section X in the standard IEEE754-2008. Vienna§X standsfor section X in the final version of the Vienna proposal, 19December 2008.


Definitions.
------------

[Note. These are intended to be part of a standard set ofdefinitions, abbreviations and acronyms of this standard, similar to754§2. I have not yet put them in any systematic order.]

D1. An "infsup interval format" means the set IR_F of F-intervals forsome floating point (abstract) format F, as defined in 7 below. Whenused without qualification, "interval format" means an infsup format-- by contrast with, say, a midrad (midpoint-radius) format.

D2. "Mixed-format" interval arithmetic refers to operations onintervals where the formats of the operand interval(s) and thedestination interval may not all be the same.Note. It is implied by the theory below that the formats of the lowerand upper bounds of an individual interval are the same.

D3. A "754-conforming system" is a programming environment in whichthe default floating point arithmetic (whether implemented inhardware, software or a mixture) conforms to IEEE754-2008.[Note. A hardware processor (e.g. the Cell processor) may not be754-conforming in itself, but a 754-conforming system may be built ontop of it with software assistance.]

D4. A "point elementary function" is a point function, in the senseof 4 below, that is provided by an implementation. The set of theseis the implementation's "point elementary function library" ("pointlibrary" for short). These terms may be qualified by a format, e.g."binary64 point library".[Note. The four basic arithmetic operations count as elementaryfunctions.]

D5. An "interval elementary function" and the "interval elementaryfunction library" ("interval library" for short), possibly qualifiedby a format, are defined similarly.

D6. An interval elementary function is "algebraic" if it is aninterval extension of a point function. See 5 below.

D7. A point function is "interval-supported" if an interval extensionof it exists in the interval library.[Note. An interval-supported function need not be in the pointlibrary. A function in the point library need not be interval-supported. It is left open whether the standard makes requirements onthese issues.]


Introduction.
-------------

The theory below aims to provide a simple foundation for intervalcomputation in mixed formats, for instance mixing binary and decimalintervals in arithmetic expressions, as well as potential compile-time "exact denotations" of intervals such as pi + [-0.1,0.1].

We believe the concepts given here, though general, are easilygrasped and easily taught. They show that the theoreticaldifficulties of mixed-format interval computation are none, and thepractical difficulties -- illustrated by the example in 10 below --are minor.

The notation used is similar to that of the Vienna proposal, which issuitable for a plain-text document such as this. I believe thefunctionality defined below is essentially what the Vienna proposaldefines, or rather what it would define if it supported mixed-formatcomputation explicitly: see the end of Vienna§1.1.

I have avoided the Vienna terms "L-numeral" and "B-numeral", whichare valuable notions but place an emphasis on representations that Ithink is unnecessary in this context.

I believe there is nothing in this motion and the theory below thathinders the implementation of various forms of non-standard intervals-- Kahan, modal, etc. -- as discussed at the end of Vienna§1.2.


Level 1 description.
--------------------
This section recapitulates what has already been decided.

1. R is the set of reals. R* is the set of extended reals
    R* = R union {-oo, +oo},
  where we use -oo and +oo to denote negative and positive infinity.

Following 754, any member of R* is called a number: it is a"finite number"

  if it is in R, else an "infinite number".

2. Following Motion 3, the set of "textbook" intervals (Vienna§1.2)in R,

  denoted IR, comprises the empty set together with all intervals
    x = { x in R | xlo <= x <= xhi }               (*)
  where xlo <= xhi, -oo <= xlo < +oo, -oo < xhi <= +oo.

  Following standard mathematical notation, the interval (*) is denoted

[xlo, xhi] if xlo and xhi are finite, but denoted (-oo, xhi] or[xlo, +oo)

  or (-oo, +oo) in the case of infinite endpoints, to indicate that -oo
  and/or +oo is not a member of the set.

The empty set is denoted by Empty. The whole of R is denoted byEntire.

3. The (interval) hull of an arbitrary subset ss of R, written hull(ss), is the

  smallest member of IR that contains ss.

4. A "point function" is a (possibly partial) multivariate realfunction: that is,a mapping f from a subset D of R^n to R^m for some integers n,m >=0. (When n=0

  and m=1, we have a "named real constant".)

When not otherwise specified, a scalar function is assumed, i.e.m=1. If m>1,

  the function is called a vector function.

  D = D_f is the domain of f.

  To specify n, call f an "n-variable point function", or denote f as
    f(x_1, ..., x_n).

The "range" (or "exact range") of f over an arbitrary subset ss ofR^n is the

  set
      range(f, ss) = {f(x) | x in ss and x in D_f}.

Note. Here, f is a mapping, not an expression. Expressions aredealt with in

  11 below.

5. Unless otherwise specified, an "interval function" is a mapping fffrom(IR)^n to (IR)^m for some n >= 0. To specify n, call ff an "n-variable

  interval function", or denote ff as
    ff(xx_1, ..., xx_n).
  As with point functions, m=1 is assumed unless specified otherwise.

An interval function is "algebraic" if it is an interval extensionof somepoint function, as defined next. Examples of non-algebraicfunctions are

  the interval hull and intersection functions.

Given an n-variable point function f, an "interval extension" of fis any

  interval function ff such that
    ff(ss) contains range(f,ss)  for any subset ss of R^n.
  The "sharp interval extension" of f is defined by
    ff(ss) = hull(range(f,ss))   for any subset ss of R^n.

Equivalently, for the case where ff has separate argumentsss_1, ..., ss_n,

  each being a subset of R, an interval extension satisfies
    ff(ss) contains range(f,ss_1, ..., ss_n),
  and the sharp interval extension satisfies
    ff(ss_1, ..., ss_n) = hull(range(f,ss_1, ..., ss_n)),
  for any ss_1, ..., ss_n.

This is an alternative notation for the case where ss is thecartesian

  product of the ss_i.

In particular, when f is a binary operator "op" written in infixnotation,this gives the familiar definition of its (sharp) intervalextension asxx op yy = hull({x op y | x in xx, y in yy, and x op y isdefined}).


  Notes.
  - Example. With these definitions
      xx*{0} = {0} and xx/{0} = Empty for any nonempty interval xx.
  - All interval functions used here are automatically defined for all

arguments (e.g. sqrt([-1,4]) = [0,2], sqrt([-2,-1]) = Empty). Itis left

    open what exceptional action may be taken by an implementation on

evaluating ff(ss), when ff is an extension of a point functionf, and

    ss is not a subset of D_f.

Level 2 description.
--------------------

6. A floating point "abstract format" ("aformat") F is a finitesubset of R*

  such that at least -oo and +oo are elements of F.

<754>A format in the 754 sense shall be identified with theaformat comprisingthose extended-real numbers that are exactly representable in thatformat,

  where -0 and +0 both represent 0.

  Notes.

- In view of this definition, 754's -0 and +0 are consideredidentical. Also,in 754 decimal formats, numbers in the same cohort areconsidered identical.- For examples, we use the abbreviations b64 to mean 754's 64-bitbinaryformat (C's "double"), and d64 for 64-bit decimal. Similarly b32is 32-bit

    binary (C's "float").
  </754>

7. Given an aformat F, the set of F-intervals in R, denoted IR_F,comprises theempty set together with all textbook intervals whose endpoints arein F.


  Notes.
  - This definition implies some form of infsup, that is (lower bound,

upper bound) representation of intervals. Variants of this exist,see for

  instance Vienna§1.6.

- It also implies that it is not just wrong, but actuallymeaningless atthis level, to speak of an interval that has NaN as an endpoint orhas its

  lower bound greater than its upper bound.

An "F-interval datum", by analogy with the definition of floating-point datumin 754§3.2, is either an F-interval or possibly, should P1788support it,

  an abstract object NaI, "Not an Interval".

  An "interval datum" is an F-interval datum of any relevant aformat F.

At this level we consider there is just one NaI for all aformats.(At the

  representation levels this would probably not be the case.)

8. The (interval) F-hull of an arbitrary subset ss of R, writtenhull_F(ss), is

  the smallest F-interval that contains ss.

  Notes.

- The set hull_F(ss) always exists, because F is finite andcontains -oo

    and +oo.

- Always, hull_F(ss) contains hull(ss). If aformat G containsaformat Fin the sense of subsets of R* -- equivalently in the 754 case,if formatG is wider than F in the sense of 754§2.1.36 -- then hull_F(ss)contains

    hull_G(ss).

9. We now give a mathematical characterisation of an ideal "mixed format

interval arithmetic". To illustrate how such mixed formatarithmetic maybe implemented in practice, we present examples assuming a 754-conformingsystem, in which "format" shall mean a 754 supported format(754§2.1.52).

  Text that assumes this is enclosed between the tags
    <754>  and  </754>.

It is assumed that the compiler (or other relevant processor)knows the

  radix of any format.

Relevant features of 754 are that floating point computationaloperations(754§2.1.11) are regarded as having a specified destinationformat. Further,many operations are "formatOf" operations (754§5.1), for which acorrectlyrounded result (in a chosen rounding direction) is produced forany operands

  _of the same radix_ as the destination format.

9a. Interval format conversion.

If F is an aformat, the "conversion" to format F shall mean theoperation

  that maps an interval xx of any other supported format to its F-hull,
      yy = hull_F(xx).

<754>This interval operation can in all cases (whether xx has thesame radixas F or not) be implemented in terms of one of the floating pointoperations

      formatOf-convertFormat
  defined in 754§5.4.2, with the appropriate outward rounding.
  </754>

9b. Mixed-format interval elementary operations.
  Given an aformat F and an n-variable point function f, an "interval

F-extension" of f is any interval extension of f whose values arealways

  F-intervals. The "sharp interval F-extension" ff_F of f is defined by
      ff_F(ss) = hull_F(range(f,ss))    for any subset ss of R^n.
  For the case where ss is the cartesian product of ss_1, ..., ss_n,
  each ss_i being a subset of R, this is written as
      ff_F(ss_1, ..., ss_n) = hull_F(range(f,ss_1, ..., ss_n)).

  If xx_1, ..., xx_n are interval datums, of arbitrary formats, then
      ff_F(xx_1, ..., xx_n)             (1)

is given by the previous equation with the xx_i substituted forthe ss_i, if

  all the xx_i are intervals (and therefore are subsets of R).

If not -- that is if at least one argument is NaI -- then, bydefinition, the

  result is NaI. The NaI case will not be considered further.

When f is an operator written in infix form, a correspondingnotation is used,

  e.g.
  <754>
      zz = xx +_b64 yy

means that the intervals xx and yy, of arbitrary formats, arecombined by themathematical interval addition operation to give an intermediateresult

      xx + yy,

that is enclosed in the smallest b64 (double-precision) intervalto give

  the delivered result zz.
  </754>

10. <754>Suppose f is a "formatOf" elementary floating pointoperation. When allthe operands xx_i have a format with the same radix as F, then theintervalresult (1) can be evaluated in terms of the "formatOf" operation fwith theappropriate outward rounding. (This is clear for the four basicoperations

  and SQRT; it may be less obvious for transcendental functions.)

If some xx_i have a different radix from that of F, formatconversions asdefined in 9a are necessary and the situation becomes morecomplicated. As anexample, consider forming the product of two d64 (decimal64)intervals xx andyy, to give a b32 (binary32) result zz. We may convert xx and yyto b32 first,

  which may be written:
      zz1 = hull_b32(xx) *_b32 hull_b32(yy)
  or may form the product in d64 and then convert:
      zz2 = hull_b32(xx *_d64 yy).

Example. Let xx and yy be the singleton intervals [x,x] and [y,y]wherex = 5^16 and y = 5^(-16), both of which are exactly represented ind64. When

  the b32 product is formed in each of the two ways above we find
      zz1 = [3f7ffffd, 3f800001]  in hexadecimal
  (note the number 1 is 3f800000 hex), whereas
      zz2 = [1,1]  exactly.
  Thus zz1 is too wide by 3 ulps at its lower bound.
  </754>

It is probably not the role of P1788 to say which of thesepatterns of radix

  conversion, or something else, should be done in practice.

What is important in the example above is that each of zz1 and zz2enclosesthe mathematical (and impractical to compute) result of the sharpinterval

  b32-extension of the multiplication function as defined in 9b:
    zz = xx *_b32 yy.

For zz1 this is because hull_b32(xx) and hull_b32(yy) enclose xxand yyrespectively. For zz2 it is because xx *_d64 yy encloses themathematical

  result xx * yy.

The property being used here is that for any F, the sharp F-extension isinclusion-isotonic. This argument is completely general (whetherthe system

  is 754-conforming or not) and underlies the next item.

11. Interval evaluation of expressions.

Let E be an explicit real expression in arguments x_1, ..., x_n.It can be

  written as a sequence of steps
      v = e(u_1, ..., u_k)              (2)

where e is a point elementary function, and each u_i is either oneof theinputs x_j or a previously calculated v. One (or for a vectorexpression,more) of the v's is nominated as the output of E. This defines apoint

  function from R^n to R^m:
      (y_1, ..., y_m) = y = f(x) = f(x_1, ..., x_n).
  The (natural) domain D_f of f is the set of points x in R^n for which

evaluation succeeds, in the sense that for each step (2), theargument

  (u_1, ..., u_k) lies in the domain of e.

  Let an interval aformat be specified for each of the input arguments

x_1, ..., x_n. For each e in the sequence of steps (2), let aninterval

  aformat be specified as its destination format. Then, given arbitrary

interval inputs xx_1, ..., xx_n of the specified formats, the"interval

  evaluation" of E consists of a sequence of steps
      vv = ee_F(uu_1, ..., uu_k)              (3)
  where
  *     ee_F is an interval F-extension of e.
  *     uu_j are interval results from previous steps.
  This defines a (possibly vector) interval function

(yy_1, ..., yy_m) = yy = ff(xx) = ff(xx_1, ...,xx_n) (4)where each input xx_j has the aformat specified for x_j, and eachyy_i hasthe destination aformat specified for the operation e thatproduces it.


  Note.

- This discussion only applies to interval expressions that arealgebraic,in the sense of 5 above. Expressions that contain, hull orintersectionoperations, or involve point-valued functions of intervals suchas the

    midpoint, are excluded.

It is left open here -- but has a significant influence on qualityof implem-entation -- just what interval F-extension ee_F is chosen in eachstep (3).For expressions just involving the four basic operations, and whenonly one

  format is involved, the standard may require that ee_F is the sharp

F-extension. In more complicated situations the choice of ee_F maydepend onthe compiler, the optimisation level, switches or pragmas definedby the

  standard, or other factors.

<754>At a step (3) where the input intervals have the same radixas the output,it is natural that ee_F be implemented by an interval version of a"formatOf"operation as described in item 10, but this is not necessary. At astep wheresome input has a different radix from the output, it is leftunspecified howthe needed radix conversions are done (the example in item 10illustrates

  alternatives).
  </754>

However these ee_F operations are implemented, it is easy to seethat Moore's

  Basic Theorem of Interval Arithmetic holds for any such mixed-format
  calculation:

Theorem (BTIA). For any interval inputs xx = (xx_1, ..., xx_n),the resultyy = ff(xx) of (4) encloses the exact range, range(f, xx) of thereal function

  f over xx.

Follow-Ups:
- Re: A proposal for the next motion
  - From: J. Wolff v. Gudenberg
- Re: A proposal for the next motion
  - From: Arnold Neumaier
- Re: A proposal for the next motion
  - From: Vincent Lefevre

Prev by Date: Re: A proposal for the next motion
Next by Date: Re: Motion P1788/M0004.01_Keep_to_754 NO
Previous by thread: Motion P1788/M0004.01_Keep_to_754 WITHDRAWN
Next by thread: Re: A proposal for the next motion
Index(es):
- Date
- Thread