The Lounge - CodeProject

First Prev Next

Re: Distribution of floating-point operations in scientific computing

DaveAuld24-Jan-17 22:13

DaveAuld

24-Jan-17 22:13

Is was on the list to do, but it fell through the cracks.... Roll eyes | :rolleyes:

Dave
Find Me On:Web|Youtube|Facebook|Twitter|LinkedIn
Folding Stats: Team CodeProject

Re: Distribution of floating-point operations in scientific computing

Daniel Pfeffer24-Jan-17 23:02

Daniel Pfeffer

24-Jan-17 23:02

I remember reading that someone had performed such an analysis, but I can't find any pointers to it.

The idea was that additions/subtractions are more common than multiplications, which in turn are much more common than divisions/square root. This implies that optimizing the less common operations is likely to give a lower return than optimizing the more common operations.

As I said, my Google-fu is non-functional today. Frown | :(

If you have an important point to make, don't try to be subtle or clever. Use a pile driver. Hit the point once. Then come back and hit it again. Then hit it a third time - a tremendous whack.

--Winston Churchill

Re: Distribution of floating-point operations in scientific computing

JRickey26-Jan-17 5:47

JRickey

26-Jan-17 5:47

If the less common operations are dramatically slower than the common ones, it still may be worth it to optimize them. Take a look at the speed comparisons at Integer and Floating-Point Arithmetic Speed vs Precision[^].

Consider the Core i7-4770 floating point graph for 32-bit operations, indicating multiplication takes about 3 times as long as addition. If addition occurs 75% of the time and multiplication 25%, you will spend the same time on each.

The decision might be influenced by which operation would be easier to optimize and which would produce the greater gain once optimized. (I see Jochen Arndt gave similar advice. This puts some numbers to it for you.)

Re: Distribution of floating-point operations in scientific computing

obermd26-Jan-17 13:25

obermd

26-Jan-17 13:25

Since multiplication can be done via addition and division can be done via subtractions and hardware shifts it makes a lot of sense that there are more additions and subtractions than other operations. Roots can be done via smart algorithms using multiplication, division, and subtractions.

Re: Distribution of floating-point operations in scientific computing

Kornfeld Eliyahu Peter24-Jan-17 22:13

Kornfeld Eliyahu Peter

24-Jan-17 22:13

Are you looking for exact values (measured values) or statistics?

For statistical purposes it is 25% each Smile | :)

Skipper: We'll fix it.
Alex: Fix it? How you gonna fix this?
Skipper: Grit, spit and a whole lotta duct tape.

Re: Distribution of floating-point operations in scientific computing

Daniel Pfeffer24-Jan-17 23:10

Daniel Pfeffer

24-Jan-17 23:10

Kornfeld Eliyahu Peter wrote:
For statistical purposes it is 25% each

Actually, it isn't. A review of floating-point programs that I have written shows that addition/subtraction is more common than multiplication, and these are much more common than division/square root.

I am writing various floating-point libraries, and would like this information so I can know where to spend my optimization time.

Re: Distribution of floating-point operations in scientific computing

Jochen Arndt24-Jan-17 22:24

Jochen Arndt

24-Jan-17 22:24

This can't be answered since FMA (Multiply–accumulate operation - Wikipedia[^]) has been introduced because it would require to know the distribution of zero FMA arguments.

Re: Distribution of floating-point operations in scientific computing

Chris C-B24-Jan-17 23:57

Chris C-B

24-Jan-17 23:57

Looks to me that every body is wrong. There are clearly more zeros than ones.

Each byte is packed with leading zeros. The ones are big-time losers.
QED. Laugh | :laugh:

Re: Distribution of floating-point operations in scientific computing

Jochen Arndt25-Jan-17 0:08

Jochen Arndt

25-Jan-17 0:08

Wrong thread?

Re: Distribution of floating-point operations in scientific computing

Chris C-B25-Jan-17 0:20

Chris C-B

25-Jan-17 0:20

Woops! Yes, it should be the thread below. Blush | :O

You will understand my difficulty when you see my next thread. Laugh | :laugh:

Re: Distribution of floating-point operations in scientific computing

DaveAuld25-Jan-17 1:11

DaveAuld

25-Jan-17 1:11

Not quite, the thread below the thread below...

take a step away from keyboard......

Dave
Find Me On:Web|Youtube|Facebook|Twitter|LinkedIn
Folding Stats: Team CodeProject

Re: Distribution of floating-point operations in scientific computing

Chris C-B25-Jan-17 1:55

Chris C-B

25-Jan-17 1:55

That isn't a thread - it's just a single post. Poke tongue | ;-P

$Shucks | :-\$

Anyway, I don't use a keyboard, I just use my psychic powers to make the words appear on the screen. Laugh | :laugh:

Re: Distribution of floating-point operations in scientific computing

Rage25-Jan-17 0:33

Rage

25-Jan-17 0:33

What for ?

Do not escape reality : improve reality !

Re: Distribution of floating-point operations in scientific computing

Daniel Pfeffer25-Jan-17 1:09

Daniel Pfeffer

25-Jan-17 1:09

I'm writing a floating-point package in C++ that provides:

A full implementation of the binary part of the IEEE-754-2008 Standard for Floating-Point Arithmetic (single-, double- and quad-precision)
Implementation of higher-precision formats, compatible with the Standard (up to binary1024).

I have a basic implementation written using the "standard" algorithms, and would like some idea of where to invest time on improvements. Obviously, spending a lot of time on an operation that is rarely executed is not the best use of my time... Smile | :)

Re: Distribution of floating-point operations in scientific computing

Jochen Arndt25-Jan-17 1:29

Jochen Arndt

25-Jan-17 1:29

Daniel Pfeffer wrote:
I'm writing a floating-point package in C++

That was not clear from your original question.

So I will dig in here:

I would not think about that. All basic operations will be used often (more or less) and should be therefore optimised as far as possible.

Because division is the slowest operation it might be the first candidate even used probably less than the other operations. When a calculation uses divisions, a better implementation would probably reduce the overall calculation time by a greater factor than without division optimsation but with addition and multiplication optimisation.

Re: Distribution of floating-point operations in scientific computing

Daniel Pfeffer25-Jan-17 1:59

Daniel Pfeffer

25-Jan-17 1:59

OK, that makes sense. Thanks.

Re: Distribution of floating-point operations in scientific computing

Jochen Arndt25-Jan-17 2:16

Jochen Arndt

25-Jan-17 2:16

You are welcome.

It is an interesting and challenging topic.
Did you plan to publish it as an article?

Re: Distribution of floating-point operations in scientific computing

Daniel Pfeffer25-Jan-17 2:50

Daniel Pfeffer

25-Jan-17 2:50

Eventually - yes.

The code works for the few problems that I've thrown at it, but that's not good enough (see the Pentium bug...). My biggest problem is finding an appropriate test suite; most of them cost an arm and a leg, and I can't justify spending that sort of money on a hobby. Frown | :(

Re: Distribution of floating-point operations in scientific computing

patbob25-Jan-17 6:48

patbob

25-Jan-17 6:48

The distribution of operations depends on the problem set. However, you might be able to take some general guidelines from the evolution of computers themselves. Addition/subtraction came first, with floating point units being added later. If you look at those floating point units, you'll probably see that later ones implemented more operators.

On the other hand, if you look at GPUs, they've always had floating point hardware -- those problem sets were never tractable in real time until floating point hardware existed.

As for testing, the best way I found was to look at the architecture of the hardware, and design a test that tested it. For example, the old VAX FPUs used a nibble lookup table for multiplication, so I concluded that I needed to test every pattern in that lookup table to know if the hardware was OK. That did not reliably happen by simply pounding a lot of math-happy code at the FPU -- it required a specially created dataset that could be proven to be exercising each entry in the lookup table. If your hardware doesn't use a nibble lookup table, that test would likely be useless since it might not achieve full coverage.

We can program with only 1's, but if all you've got are zeros, you've got nothing.

Re: Distribution of floating-point operations in scientific computing

englebart26-Jan-17 2:27

englebart

26-Jan-17 2:27

The most floating point math I have seen recently was in a mapping package.

It was heavily loaded with trigonometry functions as you can imagine. I could see the optimizations for those functions varying heavily dependent on the bit size. (per your 1024 bit precision capability)

Re: Distribution of floating-point operations in scientific computing

kalberts26-Jan-17 3:31

kalberts

26-Jan-17 3:31

Even "real" numbers can turn out to be misleading, if the background for the figures are not completely understood. Such as: 30+ years ago I was working on a computer which had an extreme FPU (it filled about half a square meter of circuit board). It was so fast that for integer multiply and divide, the 32 bit integer value was internally converted to a 64 bit floating point value, the operation performed by the FPU, and the result converted back to integer format. So a count of FP multiply/divide operations would count integer operations as well.

Another case: At my university, the IT people running the huge mainframe (this was many years ago) attached a counter to the Divide by Zero flag, and discovered that every single day, literally tens of millions of divide by zero was performed. For a few days, there was a big uproar in the IT department over the "low code quality" causing so many exceptions - until one of the mechanical engineering guys noticed the worries and explained that this was quite normal and expected: Some of the standard matrix operations would generate partial results where some number indeed was divided by zero, but the algorithm did not make use of those partial results. So there was no "real" need to perform those divisions at all; it was just a consequence of using a standard matrix library operating on all elements rather than those actually used.

If you didn't know, you might have spent lots of time speeding up the processing of the Divide by Zero exception, which might have been a waste.

When you ask for other people's use of a certain mechanism, you will not know the context from which these figures were drawn. If you collect data from two dozen independent sources, you might get an idea about the "typical" figures, but they might be completely off for one specific application domain.

To illustrate: This machine with the half sqare meter FPU were mostly used in engineering applications, where FP performance was at a premium. For business use, you could choose the BCD option. Business applications hardly do division at all, so there was no BCD divide hardware - it was implemented purely in microcode, and it was dead slow! But no customer complained over it: They never discovered, because they never used BCD divide.

For comparison: FP divide started with a table lookup for the two operands, giving the first 11 bits correct, followed by 1-clock-cycle iterations, each iteration doubling the number of correct result bits. Finally, 1 cycle was requred for normalizing the result value. So the total time for a 32 bit divide was four clock cyles, for a 64 bit divide, five clock cycles. They won a couple of design awards for that FPU in the early 1980s (and a number of prestigious engineering contracts, like with CERN and the F16 fighter project).

Re: Distribution of floating-point operations in scientific computing

TNCaver26-Jan-17 5:05

TNCaver

26-Jan-17 5:05

Wouldn't that vary from one application to the next, depending on the purpose of the app?

If you think 'goto' is evil, try writing an Assembly program without JMP.

I had a very wade morning :-(

Kornfeld Eliyahu Peter24-Jan-17 20:56

Kornfeld Eliyahu Peter

24-Jan-17 20:56

U.S. Acres by Jim Davis for Jan 23, 2017 | Read Comic Strips at GoComics.com[^]
U.S. Acres by Jim Davis for Jan 24, 2017 | Read Comic Strips at GoComics.com[^]
U.S. Acres by Jim Davis for Jan 25, 2017 | Read Comic Strips at GoComics.com[^]

Skipper: We'll fix it.
Alex: Fix it? How you gonna fix this?
Skipper: Grit, spit and a whole lotta duct tape.

TWCP OTD (The Who Cares Puzzle Of The Day) - 24th of January, 2017

Kornfeld Eliyahu Peter24-Jan-17 7:05

Kornfeld Eliyahu Peter

24-Jan-17 7:05

THE MOST AND LESS FREQUENT DIGITS IN THE NUMBERS FROM 1 TO 1000
It is not about writing code - but it is possible - but some nice logical explanation...
So which is the most frequent digit in the list of numbers form 1 to 1000?
And the less frequent?
Why?

And even Google is our friend - it would be nice to not to tell him about it...

Skipper: We'll fix it.
Alex: Fix it? How you gonna fix this?
Skipper: Grit, spit and a whole lotta duct tape.

Re: TWCP OTD (The Who Cares Puzzle Of The Day) - 24th of January, 2017

dbrenth24-Jan-17 7:21

dbrenth

24-Jan-17 7:21

OK, I'll be the first in.

1 is the most only because you are going from 1 to 1000. If it was 1 to 999 or 2 to 1000, there would be the same number of 1's as all other non-zero numbers.

0 is the least because numbers do not start with a 0. (except 0 which is not included).

Brent

Last Visit: 31-Dec-99 18:00 Last Update: 27-Jun-24 7:06

Refresh

ᐊ Prev 1...11113 11114 11115 111161111711118 11119 11120 11121 11122 Next ᐅ

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Welcome to the Lounge