Thread Links | Date Links | ||||
---|---|---|---|---|---|
Thread Prev | Thread Next | Thread Index | Date Prev | Date Next | Date Index |
Ulrich, If I understand what you have previously written, what you envisage is that an EDP for an arbitrary length pair of vectors will be executed to completion without interrupts. This makes the execution of real-time devices impossible. It also messes up the OS rendering of the screen on your desk-top/lap-top. So, what I think you want is that there is the possibility of interrupts after each pair of float/double data reads. In which case, for a general purpose supercomputer, the long accumulator needs to be flush-able so that the Operating System scheduler can schedule someone else’s EDP-ridden program. And so forth. And because there will always be only a limited number of long accumulators on a processor, this will inevitably — in the worst case — cause flushing to main-memory. A typical minimal modern processor (ARM6) has only 35,000 transistors. Each bit of SRAM (your long accumulator) has 6 transistors. Thus just six of your long accumulators will _double_ the size of the core’s foot-print. Personally I’d go with the idea of using all that extra SRAM in a more general way as caches or scratchpad, but that’s just me. The alternative is that we are considering a specialised piece of single-user hardware which is there only for Matrix/Vector processing. If that’s the case then by all means build it — it won’t cost much, say $0.5-1M. A sort of bolt-on hardware accelerator, as it were. But I’m struggling to see the usefulness of EDP in a standard for general purpose processors. Dave Lester
|