Question
In Haskell, the base
libraries and Hackage packages provide several means of converting binary IEEE-754 floating point data to and from the lifted Float
and Double
types. However, the accuracy, performance, and portability of these methods are unclear.
For a GHC-targeted library intended to (de)serialize a binary format across platforms, what is the best approach for handling IEEE-754 floating point data?
Approaches
These are the methods I've encountered in existing libraries and online resources.
FFI Marshaling
This is the approach used by the data-binary-ieee754
package. Since Float
, Double
, Word32
and Word64
are each instances of Storable
, one can poke
a value of the source type into an external buffer, and then peek
a value of the target type:
toFloat :: (F.Storable word, F.Storable float) => word -> float
toFloat word = F.unsafePerformIO $ F.alloca $ \buf -> do
F.poke (F.castPtr buf) word
F.peek buf
On my machine this works, but I cringe to see allocation being performed just to accomplish the coercion. Also, although not unique to this solution, there's an implicit assumption here that IEEE-754 is actually the in-memory representation. The tests accompanying the package give it the "works on my machine" seal of approval, but this is not ideal.
unsafeCoerce
With the same implicit assumption of in-memory IEEE-754 representation, the following code gets the "works on my machine" seal as well:
toFloat :: Word32 -> Float
toFloat = unsafeCoerce
This has the benefit of not performi开发者_开发知识库ng explicit allocation like the approach above, but the documentation says "it is your responsibility to ensure that the old and new types have identical internal representations". That implicit assumption is still doing all the work, and is even more strenuous when dealing with lifted types.
unsafeCoerce#
Stretching the limits of what might be considered "portable":
toFloat :: Word -> Float
toFloat (W# w) = F# (unsafeCoerce# w)
This seems to work, but doesn't seem practical at all since it's limited to the GHC.Exts
types. It's nice to bypass the lifted types, but that's about all that can be said.
encodeFloat
and decodeFloat
This approach has the nice property of bypassing anything with unsafe
in the name, but doesn't seem to get IEEE-754 quite right. A previous SO answer to a similar question offers a concise approach, and the ieee754-parser
package used a more general approach before being deprecated in favor of data-binary-ieee754
.
There's quite a bit of appeal to having code that needs no implicit assumptions about underlying representation, but these solutions rely on encodeFloat
and decodeFloat
, which are apparently fraught with inconsistencies. I've not yet found a way around these problems.
Simon Marlow mentions another approach in GHC bug 2209 (also linked to from Bryan O'Sullivan's answer)
You can achieve the desired effect using castSTUArray, incidentally (this is the way we do it in GHC).
I've used this option in some of my libraries in order to avoid the unsafePerformIO
required for the FFI marshalling method.
{-# LANGUAGE FlexibleContexts #-}
import Data.Word (Word32, Word64)
import Data.Array.ST (newArray, castSTUArray, readArray, MArray, STUArray)
import GHC.ST (runST, ST)
wordToFloat :: Word32 -> Float
wordToFloat x = runST (cast x)
floatToWord :: Float -> Word32
floatToWord x = runST (cast x)
wordToDouble :: Word64 -> Double
wordToDouble x = runST (cast x)
doubleToWord :: Double -> Word64
doubleToWord x = runST (cast x)
{-# INLINE cast #-}
cast :: (MArray (STUArray s) a (ST s),
MArray (STUArray s) b (ST s)) => a -> ST s b
cast x = newArray (0 :: Int, 0) x >>= castSTUArray >>= flip readArray 0
I inlined the cast function because doing so causes GHC to generate much tighter core. After inlining, wordToFloat
is translated to a call to runSTRep and three primops (newByteArray#
, writeWord32Array#
, readFloatArray#
).
I'm not sure what performance is like compared to the FFI marshalling method, but just for fun I compared the core generated by both options.
Doing FFI marshalling is a fair bit more complicated in this regard. It calls unsafeDupablePerformIO and 7 primops (noDuplicate#
, newAlignedPinnedByteArray#
, unsafeFreezeByteArray#
, byteArrayContents#
, writeWord32OffAddr#
, readFloatOffAddr#
, touch#
).
I've only just started learning how to analyse core, perhaps someone with more experience can comment on the cost of these operations?
All modern CPUs use IEEE754 for floating point, and this seems very unlikely to change within our lifetime. So don't worry about code making that assumption.
You are very definitely not free to use unsafeCoerce
or unsafeCoerce#
to convert between integral and floating point types, as this can cause both compilation failures and runtime crashes. See GHC bug 2209 for details.
Until GHC bug 4092, which addresses the need for int↔fp coercions, is fixed, the only safe and reliable approach is via the FFI.
I'm the author of data-binary-ieee754
. It has at some point used each of the three options.
encodeFloat
and decodeFloat
work well enough for most cases, but the accessory code required to use them adds tremendous overhead. They do not react well to NaN
or Infinity
, so some GHC-specific assumptions are required for any casts based upon them.
unsafeCoerce
was an attempted replacement, to get better performance. It was really fast, but reports of other libraries having significant problems made me eventually decide to avoid it.
The FFI code has so far been the most reliable, and has decent performance. The overhead of allocation isn't as bad as it seems, likely due to the GHC memory model. And it actually doesn't depend on the internal format of floats, merely on the behavior of the Storable
instance. The compiler can use whatever representation it wants as long as Storable
is IEEE-754. GHC uses IEEE-754 internally anyway, and I don't worry much about non-GHC compilers any more, so it's a moot point.
Until the GHC developers see fit to bestow upon us unlifted fixed-width words, with associated conversion functions, FFI seems the best option.
I'd use the FFI method for conversion. But be sure to use the alignment when allocating memory so you get memory that is acceptable for load/store for both the floating point number and the integer. You should also put in some assert about the sizes of the float and word being the same so you can detect if anything goes wrong.
If allocating memory makes you cringe you should not be using Haskell. :)
精彩评论