Can C++ compiler automatically optimize float to double for me?_问答_开发者

Can C++ compiler automatically optimize float to double for me?

开发者 https://www.devze.com 2023-03-07 16:17 出处：网络

I was w开发者_StackOverflow中文版ondering that maybe double is faster on some machines than float.

However, the operations I am performing really only require the precision of floats. However, they are in image processing and I would desire using the fastest possible one.

Can I use float everywhere and trust that the optimizing VC++ 2008 compiler will convert it to double if it deems it is more appropriate? I don't see how this would break code.

Thanks in advance!

No, the compiler will not change a fundamental type like float to a double for optimization.

If you think this is likely, use a typedef for your floating point in a common header, e.g. typedef float FASTFLOAT; and use FASTFLOAT (or whatever you name it) throughout your code. You can then change one central typedef, and change the type throughout your code.

My own experience is that float and double are basically comparable in performance on x86/x64 platforms now for math operations, and I tend to prefer double. If you are processing a lot of data (and hitting memory bandwidth issues, instead of computationally bound), you may get some performance benefit from the fact that floats are half the size of doubles.

You will also want to explore the effects of the various optimization flags. Depending on your target platform requirements, you may be able to optimize more aggresively.

Firstly, the compiler doesn't change float types unless it has to, and never in storage declarations.

float will be no slower than double, but if you really want fast processing, you need to look into either using a compiler that can generate SSE2 or SSE3 code or you need to write your heavy-processing routines using those instructions. IIRC, there are tools that can help you micromanage the processor's pipeline if necessary. Last I messed with this (years ago), Intel had a library called IPP that could help as well by vectorizing your math.

I have never heard of an architecture where float was slower than double, if only for the fact that memory bandwidth requirements double if you use double. Any FPU that can do a single-cycle double operation can do a single-cycle float operation with a little modification at most.

Mark's got a good idea, though: profile your code if you think it's slow. You might find the real problem is somewhere else, like hidden typecasts or function-call overhead from something you thought was inlined not getting inlined.

When the code needs to store the variable in memory chances are on most architectures it will take 32 bits for a float and 64 bits for a double. Doing the memory size conversion would prevent complete optimization of such.

Are you sure that the floating point math is the bottleneck in your application? Perhaps profiling would reveal another possible source of improvement.