0% found this document useful (0 votes)

111 views7 pages

Floating Point Arithmetic in Computing

This lecture discusses floating point arithmetic and its limitations for computational physics simulations: - Floating point numbers represent real numbers using a finite number of bits, introducing rounding errors. Commonly used formats include 32-bit and 64-bit following the IEEE 754 standard. - While basic arithmetic operations like addition and multiplication are reasonably accurate, errors accumulate over many operations. Associativity and distributivity do not strictly hold due to rounding. - Simple numbers like fractions in decimal form cannot be represented exactly. Derivatives and sums converge slowly due to shifting of the mantissa with differing magnitudes. - Numerical models assume the error in floating point operations is bounded and proportional to the operation value. Consistency checks

Uploaded by

Petàr Groff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views7 pages

Floating Point Arithmetic in Computing

Uploaded by

Petàr Groff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Computational Physics I

Luigi Scorzato Lecture 2: Floating point arithmetic

Computer memories are nite: 1. 2. how can we represent , or on a computer? To what extent can such representation(s) be trusted?

Representation
Usually computers assign 32 or 63 bits foreach single number, but there are two main strategies to do it:

Fixed Point (used for Integers):

n = (1)a0 (a1 20 + a2 21 + . . . + aM 1 2M 2 )

M is the number of bits available for a single number (typical choices are M=32 or 64) and ai = 0,1. nmax = 2M-1 i.e. (with 64bits) 9.2 1018. This means that (9 1018), (9 1018 +1) are all represented and distinguishable; but 1019=

Floating Point (used for Reals, ...)

x = (1)s 1.f 2

bias

IEEE754 standard s is 1 bit for the sign; f is the 23 (single precision) or 53 (double) bits mantissa and is the 8 (single) 10 (double) bits exponent. The length of the Mantissa denes roughly the relative precision and that of the exponent the range. Both are represented as integers.

Limitations of Floating Point Arithmetics

Commutativity and Addition inverse are OK for IEEE: (a+b=b+a; a*b=b*a; a-a=0) (less trivial than you may think). But this are the last good news... Addition is not associative ((a+b)+c) != (a+(b+c)) Distributive law does not hold ((a+b)*c) != (a*c+b*c) Multiplication inverse may not exist a* (1/a) != 1 Most simple numbers in decimal units are not mapped exactly

Typical mechanism that produces errors:

shift of the mantissa, when summing very different numbers. E.g:

A Model

Instead of: || oat(A op B) - (A op B) ||=0, we can only assume that: || oat(A op B) - (A op B) || < u || A op B || (where op=+,-,/,* of single oating point numbers) we can use this model to predict which errors we might expect. For example for the scalar product one nds (Golub-van Loan):
N N N

fl
k=1

xk yk

k=1

xk yk N u

k=1

|xk yk | + O(u2 )

Simple exercises

Exponential function [see my_exp.m] Accumulating sums as e.g. harmonic series: lim_n (1+x/n)^n [my_exp_seq.m]

(x)n = n! n=0
N k=1

1 N ln N + Euler k

Trying to understand precisely the origin of rounding errors is often frustrating and as hard as

Message:

solving analytically the problem that we want to solve numerically. What we can do is to check a posteriori: check the correctness with known exact results, which have the same numerical difculties. check consistency when changing conditions by negligeable amounts (when you know that they should not matter: sometimes high sensitivity is physical). check consistency when changing numerical precision of the operations.

Compute Derivative of a function

Notation:
fn = f (t0 + nh) f
(1),order=1

Naive 1st derivative:

f1 f0 + O(h) h

One can do better (remember Taylor):

f1 f2 f1 f1 f2 f2 f (1),o=2 f (1),o=4 = = = = = =

h2 (2) h3 (3) h4 (4) f0 hf + f f + f + O(h5 ) 2 3! 4! (2h)2 (2) (2h)3 (3) (2h)4 (4) f0 2hf (1) + f f + f + O((2h)5 ) 2 3! 4! 1 3 (3) (1) 2hf + h f + O(h5 ) 3 8 3 (3) (1) 4hf + h f + O(h5 ) 3 f1 f1 + O(h2 ) 2h (f2 f2 ) 8(f1 f1 ) + O(h4 ) 2h
(1)

However, smaller h and higher orders are not necessarely better: see the following example

Exercise: Write a program that computes the derivative of sin( x) for

different order of approximation; different values of h; different values of .

Compare with [numdiff.m]

Numerical Analysis for ODEs Overview
No ratings yet
Numerical Analysis for ODEs Overview
30 pages
Numerical Analysis Lecture Notes
No ratings yet
Numerical Analysis Lecture Notes
122 pages
Understanding Round-off and Truncation Errors
No ratings yet
Understanding Round-off and Truncation Errors
26 pages
Cit335 Summary
No ratings yet
Cit335 Summary
10 pages
Numerical Analysis Lecture Notes
No ratings yet
Numerical Analysis Lecture Notes
6 pages
Understanding Floating Point Representation
No ratings yet
Understanding Floating Point Representation
49 pages
Lecture Notes17
No ratings yet
Lecture Notes17
122 pages
Numerical Analysis for STEM Students
No ratings yet
Numerical Analysis for STEM Students
21 pages
1.3 Error, Accuracy, and Stability: Preliminaries
No ratings yet
1.3 Error, Accuracy, and Stability: Preliminaries
4 pages
1.3 Error, Accuracy, and Stability: Preliminaries
No ratings yet
1.3 Error, Accuracy, and Stability: Preliminaries
4 pages
Unit 1
No ratings yet
Unit 1
7 pages
Understanding Numerical Representation
No ratings yet
Understanding Numerical Representation
30 pages
Numerical Analysis Study Guide
No ratings yet
Numerical Analysis Study Guide
6 pages
Numerical Methods Notes
100% (1)
Numerical Methods Notes
553 pages
Lecture Notes On Numerical Methods For Engineering (?) : Pedro Fortuny Ayuso
No ratings yet
Lecture Notes On Numerical Methods For Engineering (?) : Pedro Fortuny Ayuso
104 pages
Lecture Notes On Numerical Methods For Engineering (?) : Pedro Fortuny Ayuso
No ratings yet
Lecture Notes On Numerical Methods For Engineering (?) : Pedro Fortuny Ayuso
104 pages
ErrorAnalysis Trimmed - Jupyter Notebook bPk6W7j
No ratings yet
ErrorAnalysis Trimmed - Jupyter Notebook bPk6W7j
22 pages
Lect 02&03
No ratings yet
Lect 02&03
22 pages
IEEE Standard for Floating-Point Arithmetic
No ratings yet
IEEE Standard for Floating-Point Arithmetic
3 pages
ME3105 Notes 1
No ratings yet
ME3105 Notes 1
10 pages
Understanding Rounding Errors in Computing
No ratings yet
Understanding Rounding Errors in Computing
34 pages
Eeg 823
No ratings yet
Eeg 823
71 pages
Understanding Floating Point Numbers
No ratings yet
Understanding Floating Point Numbers
4 pages
GSC-320 Numerical Computing: Lecturer:Fasiha Ikram
No ratings yet
GSC-320 Numerical Computing: Lecturer:Fasiha Ikram
17 pages
Chapter 1 (5 Lectures)
No ratings yet
Chapter 1 (5 Lectures)
15 pages
Numerical Methods: Accuracy & Errors
No ratings yet
Numerical Methods: Accuracy & Errors
41 pages
Understanding Floating-Point Numbers
No ratings yet
Understanding Floating-Point Numbers
50 pages
FPGA-Based 64-Bit Floating Point Adder
No ratings yet
FPGA-Based 64-Bit Floating Point Adder
11 pages
Numerical Analysis 1 Notes
No ratings yet
Numerical Analysis 1 Notes
111 pages
BCA 3rd Sem CBNST Notes
100% (2)
BCA 3rd Sem CBNST Notes
27 pages
Certified Floating-Point Arithmetic in Coq
No ratings yet
Certified Floating-Point Arithmetic in Coq
10 pages
Understanding IEEE Floating Point Format
No ratings yet
Understanding IEEE Floating Point Format
4 pages
IEEE Floating Point Representation
No ratings yet
IEEE Floating Point Representation
8 pages
Error Analysis in Numerical Computing
No ratings yet
Error Analysis in Numerical Computing
14 pages
Floating Point Error Analysis
No ratings yet
Floating Point Error Analysis
30 pages
Unit 4 - 2
No ratings yet
Unit 4 - 2
21 pages
Understanding Floating-Point Arithmetic
No ratings yet
Understanding Floating-Point Arithmetic
2 pages
Finite Precision Arithmetic Explained
No ratings yet
Finite Precision Arithmetic Explained
72 pages
Fixed vs Floating Point Representation
No ratings yet
Fixed vs Floating Point Representation
8 pages
SKJ Notes Error Analysis - Module 1
No ratings yet
SKJ Notes Error Analysis - Module 1
11 pages
Numerical Methods and Error Analysis
No ratings yet
Numerical Methods and Error Analysis
9 pages
IEEE Floating Point Representation Explained
No ratings yet
IEEE Floating Point Representation Explained
31 pages
Numerical Methods I - Roundoff Errors
No ratings yet
Numerical Methods I - Roundoff Errors
46 pages
IEEE Floating Point Arithmetic Explained
No ratings yet
IEEE Floating Point Arithmetic Explained
3 pages
IEEE Floating Point Arithmetic Explained
No ratings yet
IEEE Floating Point Arithmetic Explained
3 pages
Numerical Analysis Essentials
No ratings yet
Numerical Analysis Essentials
68 pages
Numerical Analysis Study Guide
No ratings yet
Numerical Analysis Study Guide
6 pages
IEEE Floating-Point in Matlab
No ratings yet
IEEE Floating-Point in Matlab
9 pages
Real Number Representation and Floating Point Arithmetic
No ratings yet
Real Number Representation and Floating Point Arithmetic
12 pages
Demystifying Floating Point - John Farrier - CppCon 2015
No ratings yet
Demystifying Floating Point - John Farrier - CppCon 2015
61 pages
Floating Point Arithmetic IEEE Floating Point
No ratings yet
Floating Point Arithmetic IEEE Floating Point
30 pages
CHAP 03e
No ratings yet
CHAP 03e
32 pages
Understanding Approximation Errors in Engineering
No ratings yet
Understanding Approximation Errors in Engineering
27 pages
Week
No ratings yet
Week
87 pages
Computations in Mechanical Engineering: Numbers and Vectors
No ratings yet
Computations in Mechanical Engineering: Numbers and Vectors
18 pages
Numerical Methods
No ratings yet
Numerical Methods
17 pages
Numerical Analysis Basics
No ratings yet
Numerical Analysis Basics
49 pages
Braking System Design Impact on Pedal Feel
No ratings yet
Braking System Design Impact on Pedal Feel
255 pages
Car Washing and Our Life Research Paper
No ratings yet
Car Washing and Our Life Research Paper
6 pages
Because Frozen Sections Are Vitally Important: The Pathology Company
No ratings yet
Because Frozen Sections Are Vitally Important: The Pathology Company
5 pages
PCOS Awareness Program for Adolescents
100% (1)
PCOS Awareness Program for Adolescents
13 pages
Spinning Report Pratibha
No ratings yet
Spinning Report Pratibha
35 pages
Titanium
No ratings yet
Titanium
22 pages
Chinese Radicals
No ratings yet
Chinese Radicals
3 pages
FT - XGQT01 - Acople Rig
No ratings yet
FT - XGQT01 - Acople Rig
1 page
Piping Material Specification: December
100% (1)
Piping Material Specification: December
78 pages
Wide, Complex and Troublesome - LITFL - Cardiovascular Curveball
No ratings yet
Wide, Complex and Troublesome - LITFL - Cardiovascular Curveball
7 pages
Essential Swimming Pool Cleaning Tools
No ratings yet
Essential Swimming Pool Cleaning Tools
12 pages
Meld Web Page Menu
No ratings yet
Meld Web Page Menu
5 pages
Bio Investigatory Project
No ratings yet
Bio Investigatory Project
27 pages
Medical Gas Automatic Manifold Systems
No ratings yet
Medical Gas Automatic Manifold Systems
5 pages
Sinvert Datas Generator Boxes en
0% (1)
Sinvert Datas Generator Boxes en
60 pages
Superdirective Circular Arrays of Electric and Huygens Dipole Elements
No ratings yet
Superdirective Circular Arrays of Electric and Huygens Dipole Elements
25 pages
AP Biology Insta-Review Unit 5 1
No ratings yet
AP Biology Insta-Review Unit 5 1
30 pages
General Arrangement Drawing for DB
No ratings yet
General Arrangement Drawing for DB
1 page
Research Paper Cholesterol
No ratings yet
Research Paper Cholesterol
18 pages
1973 MICHEL The Osmotic Potential of Polyethylene Glycol 60001
No ratings yet
1973 MICHEL The Osmotic Potential of Polyethylene Glycol 60001
3 pages
Revision Worksheet - 3m Qns & CBQ
No ratings yet
Revision Worksheet - 3m Qns & CBQ
22 pages
A. Case Study Thesis-Front Page (Revised)
No ratings yet
A. Case Study Thesis-Front Page (Revised)
10 pages
Viscosity Symbols and Definitions
No ratings yet
Viscosity Symbols and Definitions
7 pages
Solve Linear Equations: Cramer's Rule & Inverse Matrix
No ratings yet
Solve Linear Equations: Cramer's Rule & Inverse Matrix
4 pages
Key Elements of Short Story Bodies
No ratings yet
Key Elements of Short Story Bodies
10 pages
Bai Nghe Part 2 - Phan 3
No ratings yet
Bai Nghe Part 2 - Phan 3
5 pages
Revolutionizing Farming Smart Irrigation For A Sustainable Future
No ratings yet
Revolutionizing Farming Smart Irrigation For A Sustainable Future
8 pages
12 Marks Question and Answers
No ratings yet
12 Marks Question and Answers
10 pages
BCA and MCA Student Course List
No ratings yet
BCA and MCA Student Course List
15 pages
Understanding Multiple Sclerosis: Overview and Insights
No ratings yet
Understanding Multiple Sclerosis: Overview and Insights
26 pages

Floating Point Arithmetic in Computing

Uploaded by

Floating Point Arithmetic in Computing

Uploaded by

Computational Physics I

Luigi Scorzato Lecture 2: Floating point arithmetic

Fixed Point (used for Integers):

Floating Point (used for Reals, ...)

Limitations of Floating Point Arithmetics

Typical mechanism that produces errors:

shift of the mantissa, when summing very different numbers. E.g:

Compute Derivative of a function

Naive 1st derivative:

One can do better (remember Taylor):

Exercise: Write a program that computes the derivative of sin( x) for

different order of approximation; different values of h; different values of .

You might also like