Abstract
Numbers from \(\mathbb {R}\) cannot be handled directly by computers, as they can have infinitely many digits after the decimal point. We have to truncate their representation. Storage schemes with a finite number of digits before and after the decimal point struggle: There are always regimes of values where they work prefectly fine and regimes where they cannot represent values accurately. We therefore introduce the concept of floating point numbers and discuss the IEEE standard. Its concept of normalisation allows us to establish the notion of machine precision, as we realise that the truncation into a finite number of digits equals rounding and introduces a bounded relative error. Since we work with approximations to real numbers, we have to be careful when we compare floating point numbers in our codes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Storing the significant bits—note the d versus t.
- 2.
From Latin, I consider datum to be singular and data plural.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Weinzierl, T. (2021). Floating Point Numbers. In: Principles of Parallel Scientific Computing. Undergraduate Topics in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-030-76194-3_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-76194-3_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-76193-6
Online ISBN: 978-3-030-76194-3
eBook Packages: Computer ScienceComputer Science (R0)