SlideShare a Scribd company logo
1 of 75
Chapter 4
Addition/subtraction of signed numbers
si =
ci +1 =
13
7
+ Y
1
0
0
0
1
0
1
1
0
0
1
1
0
1
1
0
0
1
1
0
1
0
0
1
0
0
0
0
1
1
1
1
0
0
0
0
1
1
1
1
Example:
1
0
= = 0
0
1 1
1
1 1 0 0
1
1 1 1
0
Legend for stage i
xi yi Carry-in ci Sumsi Carry-outci+1
X
Z
+ 6 0
+
xi
yi
si
Carry-out
ci+1
Carry-in
ci
xi
yi
ci
xi
yi
ci
xi
yi
ci
xi
yi
ci xi yi ci
 
=
+ + +
yi
ci
xi
ci
xi
yi
+ +
At the ith stage:
Input:
ci is the carry-in
Output:
si is the sum
ci+1 carry-out to (i+1)st
state
Addition logic for a single stage
Full adder
(FA)
ci
ci 1
+
s
i
Sum Carry
yi
xi
c
i
yi
xi
c
i
yi
x
i
xi
ci
yi
si
c
i 1
+
Full Adder (FA): Symbol for the complete circuit
for a single stage of addition.
n-bit adder
•Cascade n full adder (FA) blocks to form a n-bit adder.
•Carries propagate or ripple through this cascade, n-bit ripple carry adder.
FA c0
y1
x1
s1
FA
c1
y0
x0
s0
FA
cn 1
-
yn 1
-
xn 1
-
cn
sn 1
-
Most significant bit
(MSB) position
Least significant bit
(LSB) position
Carry-in c0 into the LSB position provides a convenient way to
perform subtraction.
K n-bit adder
K n-bit numbers can be added by cascading k n-bit adders.
n-bit c
0
yn
xn
s
n
cn
y0
xn 1
-
s
0
ckn
s
k 1
-
( )n
x0
yn 1
-
y2n 1
-
x2n 1
-
ykn 1
-
s
n 1
-
s
2n 1
-
s
kn 1
-
xkn 1
-
adder
n-bit
adder
n-bit
adder
Each n-bit adder forms a block, so this is cascading of blocks.
Carries ripple or propagate through blocks, Blocked Ripple Carry Adder
n-bit subtractor
FA 1
y1
x1
s1
FA
c1
y0
x0
s0
FA
cn 1
-
yn 1
-
xn 1
-
cn
sn 1
-
Most significant bit
(MSB) position
Least significant bit
(LSB) position
•Recall X – Y is equivalent to adding 2’s complement of Y to X.
•2’s complement is equivalent to 1’s complement + 1.
•X – Y = X + Y + 1
•2’s complement of positive and negative numbers is computed similarly.
n-bit adder/subtractor (contd..)
Add/Sub
control
n-bit adder
x
n 1
-
x
1
x
0
c
n
s
n 1
- s
1
s
0
c
0
y
n 1
-
y
1
y
0
•Add/sub control = 0, addition.
•Add/sub control = 1, subtraction.
Detecting overflows
 Overflows can only occur when the sign of the two operands is
the same.
 Overflow occurs if the sign of the result is different from the
sign of the operands.
 Recall that the MSB represents the sign.
 xn-1, yn-1, sn-1 represent the sign of operand x, operand y and result s
respectively.
 Circuit to detect overflow can be implemented by the
following logic expressions:
1
1
1
1
1
1 




 
 n
n
n
n
n
n s
y
x
s
y
x
Overflow
1


 n
n c
c
Overflow
Computing the add time
Consider 0th stage:
x0
y0
c0
c1
s0
FA
•c1 is available after 2 gate delays.
•s1 is available after 1 gate delay.
c
i
yi
xi
c
i
yi
x
i
xi
ci
yi
si
c
i 1
+
Sum Carry
Computing the add time (contd..)
x0
y0
s2
FA
x0 y0
x0
y0
s1
FA
c2
s0
FA
c1
c3
c0
x0
y0
s3
FA
c4
Cascade of 4 Full Adders, or a 4-bit adder
•s0 available after 1 gate delays, c1 available after 2 gate delays.
•s1 available after 3 gate delays, c2 available after 4 gate delays.
•s2 available after 5 gate delays, c3 available after 6 gate delays.
•s3 available after 7 gate delays, c4 available after 8 gate delays.
For an n-bit adder, sn-1 is available after 2n-1 gate delays
cn is available after 2n gate delays.
Fast addition
Recall the equations:
i
i
i
i
i
i
i
i
i
i
i
c
y
c
x
y
x
c
c
y
x
s






1
Second equation can be written as:
i
i
i
i
i
i c
y
x
y
x
c )
(
1 



We can write:
i
i
i
i
i
i
i
i
i
i
y
x
P
and
y
x
G
where
c
P
G
c





1
•Gi is called generate function and Pi is called propagate function
•Gi and Pi are computed only from xi and yi and not ci, thus they can
be computed in one gate delay after X and Y are applied to the
inputs of an n-bit adder.
Carry lookahead
ci 1  Gi  Pici
ci  Gi1  Pi1ci1
 ci1  Gi  Pi(Gi1  Pi 1ci 1)
continuing
 ci1  Gi  Pi(Gi1  Pi 1(Gi 2  Pi 2ci 2 ))
until
ci1  Gi  PiGi 1  Pi Pi1Gi 2  ..  PiPi1..P1G0  Pi Pi 1...P0c0
•All carries can be obtained 3 gate delays after X, Y and c0 are applied.
-One gate delay for Pi and Gi
-Two gate delays in the AND-OR circuit for ci+1
•All sums can be obtained 1 gate delay after the carries are computed.
•Independent of n, n-bit addition requires only 4 gate delays.
•This is called Carry Lookahead adder.
Carry-lookahead adder
Carry-lookahead logic
B cell B cell B cell B cell
s
3
P3
G3
c
3
P2
G2
c
2
s
2
G
1
c
1
P
1
s
1
G
0
c
0
P
0
s
0
.
c4
x
1
y
1
x
3
y
3
x
2
y
2
x
0
y
0
G i
c
i
.
.
.
P i
s i
x i
y i
B cell
4-bit
carry-lookahead
adder
B-cell for a single stage
Carry lookahead adder (contd..)
 Performing n-bit addition in 4 gate delays independent of n
is good only theoretically because of fan-in constraints.
 Last AND gate and OR gate require a fan-in of (n+1) for a n-
bit adder.
 For a 4-bit adder (n=4) fan-in of 5 is required.
 Practical limit for most gates.
 In order to add operands longer than 4 bits, we can cascade
4-bit Carry-Lookahead adders. Cascade of Carry-Lookahead
adders is called Blocked Carry-Lookahead adder.
ci1
 Gi
 Pi
Gi1
 Pi
Pi1
Gi2
 ..  Pi
Pi1
..P
1
G0
 Pi
Pi1
...P0
c0
4-bit carry-lookahead Adder
Blocked Carry-Lookahead adder
c4  G3  P
3G2  P
3P2G1  P
3P
2 P
1G0  P
3P
2 P
1P0c0
Carry-out from a 4-bit block can be given as:
Rewrite this as:
P
0
I
 P
3P
2 P
1P
0
G0
I
 G3  P
3G2  P
3P
2G1  P
3P2P
1G0
Subscript I denotes the blocked carry lookahead and identifies the block.
Cascade 4 4-bit adders, c16 can be expressed as:
c16  G3
I
 P
3
I
G2
I
 P
3
I
P
2
I
G1
I
 P3
I
P
2
I
P
1
0
G0
I
 P
3
I
P
2
I
P
1
0
P
0
0
c0
Blocked Carry-Lookahead adder
Carry-lookahead logic
4-bit adder 4-bit adder 4-bit adder 4-bit adder
s15-12
P3
I
G3
I
c12
P2
I
G2
I
c8
s11-8
G1
I
c4
P1
I
s7-4
G0
I
c0
P0
I
s3-0
c16
x15-12
y15-12
x11-8
y11-8
x7-4
y7-4
x3-0
y3-0
.
After xi, yi and c0 are applied as inputs:
- Gi and Pi for each stage are available after 1 gate delay.
- PI is available after 2 and GI after 3 gate delays.
- All carries are available after 5 gate delays.
- c16 is available after 5 gate delays.
- s15 which depends on c12 is available after 8 (5+3)gate delays
(Recall that for a 4-bit carry lookahead adder, the last sum bit is
available 3 gate delays after all inputs are available)
Multiplication of unsigned numbers
Product of 2 n-bit numbers is at most a 2n-bit number.
Unsigned multiplication can be viewed as addition of shifted
versions of the multiplicand.
Multiplication of unsigned
numbers (contd..)
 We added the partial products at end.
 Alternative would be to add the partial products at each stage.
 Rules to implement multiplication are:
 If the ith bit of the multiplier is 1, shift the multiplicand and add the
shifted multiplicand to the current value of the partial product.
 Hand over the partial product to the next stage
 Value of the partial product at the start stage is 0.
Multiplication of unsigned numbers
ith multiplier bit
carry in
carry out
jth multiplicand bit
ith multiplier bit
Bit of incoming partial product (PPi)
Bit of outgoing partial product (PP(i+1))
FA
Typical multiplication cell
Combinatorial array multiplier
Multiplicand
m3 m2 m1 m0
0 0 0 0
q3
q2
q1
q0
0
p2
p1
p0
0
0
0
p3
p4
p5
p6
p7
PP1
PP2
PP3
(PP0)
,
Product is: p7,p6,..p0
Multiplicand is shifted by displacing it through an array of adders.
Combinatorial array multiplier
Combinatorial array multiplier
(contd..)
 Combinatorial array multipliers are:
 Extremely inefficient.
 Have a high gate count for multiplying numbers of practical size such as 32-
bit or 64-bit numbers.
 Perform only one function, namely, unsigned integer product.
 Improve gate efficiency by using a mixture of
combinatorial array techniques and sequential
techniques requiring less combinational logic.
Sequential multiplication
 Recall the rule for generating partial products:
 If the ith bit of the multiplier is 1, add the appropriately shifted multiplicand
to the current partial product.
 Multiplicand has been shifted left when added to the partial product.
 However, adding a left-shifted multiplicand to an
unshifted partial product is equivalent to adding an
unshifted multiplicand to a right-shifted partial
product.
Sequential Circuit Multiplier
q
n 1
-
m
n 1
-
n-bit
Adder
Multiplicand M
Control
sequencer
Multiplier Q
0
C
Shift right
Register A (initially 0)
Add/Noadd
control
a
n 1
-
a
0
q
0
m
0
0
MUX
Sequential multiplication (contd..)
1 1 1 1
1 0 1 1
1 1 1 1
1 1 1 0
1 1 1 0
1 1 0 1
1 1 0 1
Initial configuration
Add
M
1 1 0 1
C
First cycle
Second cycle
Third cycle
Fourth cycle
No add
Shift
Shift
Add
Shift
Shift
Add
1 1 1 1
0
0
0
1
0
0
0
1
0
0 0 0 0
0 1 1 0
1 1 0 1
0 0 1 1
1 0 0 1
0 1 0 0
0 0 0 1
1 0 0 0
1 0 0 1
1 0 1 1
Q
A
Product
Signed Multiplication
 Considering 2’s-complement signed operands, what will happen to (-
13)(+11) if following the same method of unsigned multiplication?
Sign extension of negative multiplicand.
1
0
1
1 1
1 1 1 0 0 1 1
1
1
0
1
1
0
1
0
1
0
0
0
1
1
1
0
1
1
0
0
0
0
0
0
1
1
0
0
1
1
1
0
0
0
0
0
0
0
0
1
1
0
0
1
1
1
1
1
13
-
( )
143
-
( )
11
+
( )
Sign extension is
shown in blue
Signed Multiplication
 For a negative multiplier, a straightforward solution is
to form the 2’s-complement of both the multiplier and
the multiplicand and proceed as in the case of a
positive multiplier.
 This is possible because complementation of both
operands does not change the value or the sign of the
product.
 A technique that works equally well for both negative
and positive multipliers – Booth algorithm.
Booth Algorithm
 Consider in a multiplication, the multiplier is positive
0011110, how many appropriately shifted versions of
the multiplicand are added in a standard procedure?
0
0 0
1 0 1 1 0 1
0
0 0 0 0 0 0
1
0
0
1
1
0
1
0
1
0
1
1
0
1
0
1
0
1
1
0
1
0
1
0
1
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
1
0
1
0
1
0
0
0
0
1
+ 1
+ 1
+ 1
+
Booth Algorithm
 Since 0011110 = 0100000 – 0000010, if we use the
expression to the right, what will happen?
0
1
0 1 0 1 1 1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1 1 1 1 1 1 1 0 1 0 0 1
0
0
0
0 0 0 1 0 1 1 0 1
0 0 0 0 0 0 0 0
0
1
1
0
0
0
1
0
0
1
0
0
0 1
2's complement of
the multiplicand
0
0
0
0
1
+ 1
-
0
0
0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0
Booth Algorithm
 In general, in the Booth scheme, -1 times the shifted multiplicand is
selected when moving from 0 to 1, and +1 times the shifted
multiplicand is selected when moving from 1 to 0, as the multiplier
is scanned from right to left.
Booth recoding of a multiplier.
0
0
1
1
0
1
0
1
1
1
0
0
1
1
0
1
0
0
0
0
0
0
0
0
0
0 1
+ 1
-
1
-
1
+
1
-
1
+
1
-
1
+
1
-
1
+
Booth Algorithm
Booth multiplication with a negative multiplier.
0
1
0
1 1 1 1 0 1 1
0 0 0 0 0 0 0 0 0
0
0
0
1
1
0
0 0 0 0 1 1 0
1
1
0
0
1
1
1
0 0 0 0 0 0
0
1
0
0
0 1
1
1
1
1
1
1
0 1 1 0 1
1 1 0 1 0 6
-
( )
13
+
( )
X
78
-
( )
+1
1
- 1
-
Booth Algorithm
Multiplier
Bit i Bit i 1
-
Version of multiplicand
selected by bit
i
0
1
0
0
0
1
1 1
0 M
1
+ M
1
 M
0 M
Booth multiplier recoding table.
X
X
X
X
Booth Algorithm
 Best case – a long string of 1’s (skipping over 1s)
 Worst case – 0’s and 1’s are alternating
1
0
1
1
1
0
0
0
0
1
1
1
1
1
0
0
0
0
0
0
1
1
1
1
0
1
1
0
1
0
0
0
1
1 0 1 0 1 0 1 0 1 0 1 0 1 0 1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
-
1
- 1
- 1
- 1
-
1
-
1
-
1
+ 1
+ 1
+ 1
+ 1
+ 1
+ 1
+ 1
+
1
+
1
+
1
+
1
+
1
+
Worst-case
multiplier
Ordinary
multiplier
Good
multiplier
Flow chart
Bit-Pair Recoding of Multipliers
 Bit-pair recoding halves the maximum number of
summands (versions of the multiplicand).
1
+
1

(a) Example of bit-pair recoding derived from Booth recoding
0
0
0
0
1 1 0 1 0
Implied 0 to right of LSB
1
0
Sign extension
1
2
1
 

Bit-Pair Recoding of Multipliers
i 1
+ i 1
(b) Table of multiplicand selection decisions
selected at position i
Multiplicand
Multiplier bit-pair
i
0
0
1
1
1
0
1
0
1
1
1
1
0
0
0
1
1
0
0
1
0
0
1
Multiplier bit on the right
0 0 X M
1
+
1
1
+
0
1
2
2
+




X M
X M
X M
X M
X M
X M
X M
Bit-Pair Recoding of Multipliers
40
1
-
0
0
0
0
1 1 1 1 1 0
0 0 0 0 1
1
1 1 1 1 1
0 0
0 0 0 0 0 0
0
0
0
0 1
1
1
1
1
1
0 1 1 0 1
0
1 0
1
0
0
1
1
1
1
1
1 1 1 1 0 0 1 1
0 0 0 0 0 0
1 1 1 0 1 1 0 0 1 0
0
1
0 0
1 0
1
0 0
0
0 1
0
0 1
1
0
0
0
1
0
0 1 1 0 1
1
1
1
-
6
-
( )
13
+
( )
1
+
78
-
( )
1
- 2
-
´
Figure 6.15. Multiplication requiring only n/2 summands.
Carry-Save Addition of Summands
 CSA speeds up the addition process.
41
P7 P6 P5 P4 P3 P2 P1 P0
Carry-Save Addition of Summands(Cont.,)
P3 P2 P1 P0
P5 P4
P7 P6
Carry-Save Addition of Summands(Cont.,)
 Consider the addition of many summands, we can:
 Group the summands in threes and perform carry-save addition on
each of these groups in parallel to generate a set of S and C vectors in
one full-adder delay
 Group all of the S and C vectors into threes, and perform carry-save
addition on them, generating a further set of S and C vectors in one
more full-adder delay
 Continue with this process until there are only two vectors remaining
 They can be added in a RCA or CLA to produce the desired product
Carry-Save Addition of Summands
Figure 6.17. A multiplication example used to illustrate carry-save addition as shown in Figure 6.18.
1
0
0 1 1
1
1
0
0 1 1
1
1
0
0 1 1
1
1
1
1
1
1 1
1
0
0 1 1
1 M
Q
A
B
C
D
E
F
(2,835)
X
(45)
(63)
1
0
0 1 1
1
1
0
0 1 1
1
1
0
0 1 1
1
0
0
0 1 1
1 1
1
1 0 0
0 Product
Figure 6.18. The multiplication example from Figure 6.17 performed using
carry-save addition.
0
0
0
0
0
1
0
1 0 1
0
1
0
0
1
0
0
0
0 1 1
1 1
+
1
0
0
0
0
1
1 1
1
0
0
1
0
1
1
1 0 1
0 1
0
1
1
0 1 1
0 0
0
0
0
1
1
0
1
0 0 0
0
1
0
0
0
1
0
1
1 1 0 1
1
1
0
0
0
1 1 0
0
0
1
1
1
1
0
0
0
0
1
1
0 1 1
0
1
1
0
0
1 0 0
1
1
0
0 1 1
1
1
0
0 1 1
1
1
0
0 1 1
1
0
0
1
1
0 1 1
0
1
1
0
0
1 0 0
1
1
0
0 1 1
1
1
0
0 1 1
1
1
0
0 1 1
1
1
1
1
1
1 1
1
0
0 1 1
1 M
Q
A
B
C
S
1
C
1
D
E
F
S
2
C
2
S1
C
1
S2
S
3
C3
C2
S4
C
4
Product
x
Manual Division
Longhand division examples.
1101
1
13
14
26
21
274 100010010
10101
1101
1
1110
1101
10000
13 1101
Longhand Division Steps
 Position the divisor appropriately with respect to the
dividend and performs a subtraction.
 If the remainder is zero or positive, a quotient bit of 1
is determined, the remainder is extended by another
bit of the dividend, the divisor is repositioned, and
another subtraction is performed.
 If the remainder is negative, a quotient bit of 0 is
determined, the dividend is restored by adding back
the divisor, and the divisor is repositioned for another
subtraction.
Circuit Arrangement
Figure 6.21. Circuit arrangement for binary division.
qn-1
Divisor M
Control
Sequencer
Dividend Q
Shift left
N+1 bit
adder
q0
Add/Subtract
Quotient
Setting
A
m0
0 mn-1
a0
an
an-1
Restoring Division
 Shift A and Q left one binary position
 Subtract M from A, and place the answer back in A
 If the sign of A is 1, set q0 to 0 and add M back to A
(restore A); otherwise, set q0 to 1
 Repeat these steps n times
Examples
1
0
1
1
1
Figure 6.22. A restoring-division example.
1 1 1 1 1
0
1
1
1
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
1 0
1
1
1
1 1
0
1
0
0
0
1
Subtract
Shift
Restore
1 0
0
0
0
1 0
0
0
0
1 1
Initially
Subtract
Shift
1
0
1
1
1
1
0
0
0
0
1
1
0
0
0
0
0
0
0
0
Subtract
Shift
Restore
1
0
1
1
1
0
1
0
0
0
1
0
0
0
0
1 1
Quotient
Remainder
Shift
1
0
1
1
1
1 0
0
0
0
Subtract
Second cycle
First cycle
Third cycle
Fourth cycle
0
0
0
0
0
0
1
0
1
1
0
0
0
0
1 1
1 0
0
0
0
1
1
1
1
1
Restore
q0
Set
q0
Set
q0
Set
q0
Set
Nonrestoring Division
 Avoid the need for restoring A after an
unsuccessful subtraction.
 Any idea?
 Step 1: (Repeat n times)
If the sign of A is 0, shift A and Q left one bit position and
subtract M from A; otherwise, shift A and Q left and add M
to A.
Now, if the sign of A is 0, set q0 to 1; otherwise, set q0 to 0.
 Step2: If the sign of A is 1, add M to A
Examples
A nonrestoring-division example.
Add
Restore
remainder
Remainder
0 0 0 0
1
1 1 1 1 1
0 0 0 1 1
1
Quotient
0 0 1 0
1 1 1 1 1
0 0 0 0
1 1 1 1 1
Shift 0 0 0
1
1
0
0
0
0
1
1
1
1
Add
0 0 0 1 1
0 0 0 0 1 0 0 0
1 1 1 0 1
Shift
Subtract
Initially 0 0 0 0 0 1 0 0 0
1 1 1 0 0
0
0
0
1 1 1 0 0
0 0 0 1 1
0 0 0
Shift
Add
0 0 1
0 0 0 0
1
1 1 1 0 1
Shift
Subtract
0 0 0 1
1
0
0
0
0
Fourth cycle
Third cycle
Second cycle
First cycle
q
0
Set
q
0
Set
q
0
Set
q
0
Set
Fractions
If b is a binary vector, then we have seen that it can be interpreted as
an unsigned integer by:
V(b) = b31.231 + b30.230 + bn-3.229 + .... + b1.21 + b0.20
This vector has an implicit binary point to its immediate right:
b31b30b29....................b1b0. implicit binary point
Suppose if the binary vector is interpreted with the implicit binary point is
just left of the sign bit:
implicit binary point .b31b30b29....................b1b0
The value of b is then given by:
V(b) = b31.2-1 + b30.2-2 + b29.2-3 + .... + b1.2-31 + b0.2-32
Range of fractions
The value of the unsigned binary fraction is:
V(b) = b31.2-1 + b30.2-2 + b29.2-3 + .... + b1.2-31 + b0.2-32
The range of the numbers represented in this format is:
In general for a n-bit binary fraction (a number with an assumed binary
point at the immediate left of the vector), then the range of values is:
9999999998
.
0
2
1
)
(
0 32



 
b
V
n
b
V 


 2
1
)
(
0
Scientific notation
•Previous representations have a fixed point. Either the point is to the immediate
right or it is to the immediate left. This is called Fixed point representation.
•Fixed point representation suffers from a drawback that the representation can
only represent a finite range (and quite small) range of numbers.
A more convenient representation is the scientific representation, where
the numbers are represented in the form:
x  m1.m2m3m4  be
Components of these numbers are:
Mantissa (m), implied base (b), and exponent (e)
Significant digits
A number such as the following is said to have 7 significant digits
x  0.m1m2m3m4m5m6m7  be
Fractions in the range 0.0 to 0.9999999 need about 24 bits of precision
(in binary). For example the binary fraction with 24 1’s:
111111111111111111111111 = 0.9999999404
Not every real number between 0 and 0.9999999404 can be represented
by a 24-bit fractional number.
The smallest non-zero number that can be represented is:
000000000000000000000001 = 5.96046 x 10-8
Every other non-zero number is constructed in increments of this value.
Sign and exponent digits
•In a 32-bit number, suppose we allocate 24 bits to represent a fractional
mantissa.
•Assume that the mantissa is represented in sign and magnitude format,
and we have allocated one bit to represent the sign.
•We allocate 7 bits to represent the exponent, and assume that the
exponent is represented as a 2’s complement integer.
•There are no bits allocated to represent the base, we assume that the
base is implied for now, that is the base is 2.
•Since a 7-bit 2’s complement number can represent values in the range
-64 to 63, the range of numbers that can be represented is:
0.0000001 x 2-64 < = | x | <= 0.9999999 x 263
•In decimal representation this range is:
0.5421 x 10-20 < = | x | <= 9.2237 x 1018
A sample representation
Sign Exponent Fractional mantissa
bit
1 7 24
•24-bit mantissa with an implied binary point to the immediate left
•7-bit exponent in 2’s complement form, and implied base is 2.
Normalization
If the number is to be represented using only 7 significant mantissa digits,
the representation ignoring rounding is:
Consider the number: x = 0.0004056781 x 1012
x = 0.0004056 x 1012
If the number is shifted so that as many significant digits are brought into
7 available slots:
x = 0.4056781 x 109 = 0.0004056 x 1012
Exponent of x was decreased by 1 for every left shift of x.
A number which is brought into a form so that all of the available mantissa
digits are optimally used (this is different from all occupied which may
not hold), is called a normalized number.
Same methodology holds in the case of binary mantissas
0001101000(10110) x 28 = 1101000101(10) x 25
Normalization (contd..)
•A floating point number is in normalized form if the most significant
1 in the mantissa is in the most significant bit of the mantissa.
•All normalized floating point numbers in this system will be of the form:
0.1xxxxx.......xx
Range of numbers representable in this system, if every number must be
normalized is:
0.5 x 2-64 <= | x | < 1 x 263
Normalization, overflow and underflow
The procedure for normalizing a floating point number is:
Do (until MSB of mantissa = = 1)
Shift the mantissa left (or right)
Decrement (increment) the exponent by 1
end do
Applying the normalization procedure to: .000111001110....0010 x 2-62
gives: .111001110........ x 2-65
But we cannot represent an exponent of –65, in trying to normalize the
number we have underflowed our representation.
Applying the normalization procedure to: 1.00111000............x 263
gives: 0.100111..............x 264
This overflows the representation.
Changing the implied base
So far we have assumed an implied base of 2, that is our floating point
numbers are of the form:
x = m 2e
If we choose an implied base of 16, then:
x = m 16e
Then:
y = (m.16) .16e-1 (m.24) .16e-1 = m . 16e = x
•Thus, every four left shifts of a binary mantissa results in a decrease of 1
in a base 16 exponent.
•Normalization in this case means shifting the mantissa until there is a 1 in
the first four bits of the mantissa.
Excess notation
•Rather than representing an exponent in 2’s complement form, it turns out to be
more beneficial to represent the exponent in excess notation.
•If 7 bits are allocated to the exponent, exponents can be represented in the range of
-64 to +63, that is:
-64 <= e <= 63
Exponent can also be represented using the following coding called as excess-64:
E’ = Etrue + 64
In general, excess-p coding is represented as:
E’ = Etrue + p
True exponent of -64 is represented as 0
0 is represented as 64
63 is represented as 127
This enables efficient comparison of the relative sizes of two floating point numbers.
IEEE notation
IEEE Floating Point notation is the standard representation in use. There are two
representations:
- Single precision.
- Double precision.
Both have an implied base of 2.
Single precision:
- 32 bits (23-bit mantissa, 8-bit exponent in excess-127 representation)
Double precision:
- 64 bits (52-bit mantissa, 11-bit exponent in excess-1023 representation)
Fractional mantissa, with an implied binary point at immediate left.
Sign Exponent Mantissa
1 8 or 11 23 or 52
Peculiarities of IEEE notation
•Floating point numbers have to be represented in a normalized form to
maximize the use of available mantissa digits.
•In a base-2 representation, this implies that the MSB of the mantissa is
always equal to 1.
•If every number is normalized, then the MSB of the mantissa is always 1.
We can do away without storing the MSB.
•IEEE notation assumes that all numbers are normalized so that the MSB
of the mantissa is a 1 and does not store this bit.
•So the real MSB of a number in the IEEE notation is either a 0 or a 1.
•The values of the numbers represented in the IEEE single precision
notation are of the form:
(+,-) 1.M x 2(E - 127)
•The hidden 1 forms the integer part of the mantissa.
•Note that excess-127 and excess-1023 (not excess-128 or excess-1024) are used
to represent the exponent.
Exponent field
In the IEEE representation, the exponent is in excess-127 (excess-1023)
notation.
The actual exponents represented are:
-126 <= E <= 127 and -1022 <= E <= 1023
not
-127 <= E <= 128 and -1023 <= E <= 1024
This is because the IEEE uses the exponents -127 and 128 (and -1023 and
1024), that is the actual values 0 and 255 to represent special conditions:
- Exact zero
- Infinity
Floating point arithmetic
Addition:
3.1415 x 108 + 1.19 x 106 = 3.1415 x 108 + 0.0119 x 108 = 3.1534 x 108
Multiplication:
3.1415 x 108 x 1.19 x 106 = (3.1415 x 1.19 ) x 10(8+6)
Division:
3.1415 x 108 / 1.19 x 106 = (3.1415 / 1.19 ) x 10(8-6)
Biased exponent problem:
If a true exponent e is represented in excess-p notation, that is as e+p.
Then consider what happens under multiplication:
a. 10(x + p) * b. 10(y + p) = (a.b). 10(x + p + y +p) = (a.b). 10(x +y + 2p)
Representing the result in excess-p notation implies that the exponent
should be x+y+p. Instead it is x+y+2p.
Biases should be handled in floating point arithmetic.
Floating point arithmetic: ADD/SUB
rule
 Choose the number with the smaller exponent.
 Shift its mantissa right until the exponents of both the
numbers are equal.
 Add or subtract the mantissas.
 Determine the sign of the result.
 Normalize the result if necessary and truncate/round
to the number of mantissa bits.
Note: This does not consider the possibility of overflow/underflow.
Floating point arithmetic: MUL rule
 Add the exponents.
 Subtract the bias.
 Multiply the mantissas and determine the sign of the
result.
 Normalize the result (if necessary).
 Truncate/round the mantissa of the result.
Floating point arithmetic: DIV rule
 Subtract the exponents
 Add the bias.
 Divide the mantissas and determine the sign of the
result.
 Normalize the result if necessary.
 Truncate/round the mantissa of the result.
Note: Multiplication and division does not require alignment of the
mantissas the way addition and subtraction does.
Guard bits
While adding two floating point numbers with 24-bit mantissas, we shift
the mantissa of the number with the smaller exponent to the right until
the two exponents are equalized.
This implies that mantissa bits may be lost during the right shift (that is,
bits of precision may be shifted out of the mantissa being shifted).
To prevent this, floating point operations are implemented by keeping
guard bits, that is, extra bits of precision at the least significant end
of the mantissa.
The arithmetic on the mantissas is performed with these extra bits of
precision.
After an arithmetic operation, the guarded mantissas are:
- Normalized (if necessary)
- Converted back by a process called truncation/rounding to a 24-bit
mantissa.
Truncation/rounding
 Straight chopping:
 The guard bits (excess bits of precision) are dropped.
 Von Neumann rounding:
 If the guard bits are all 0, they are dropped.
 However, if any bit of the guard bit is a 1, then the LSB of the retained bit is
set to 1.
 Rounding:
 If there is a 1 in the MSB of the guard bit then a 1 is added to the LSB of the
retained bits.
Rounding
 Rounding is evidently the most accurate truncation
method.
 However,
 Rounding requires an addition operation.
 Rounding may require a renormalization, if the addition operation de-
normalizes the truncated number.
 IEEE uses the rounding method.
0.111111100000 rounds to 0.111111 + 0.000001
=1.000000 which must be renormalized to 0.100000

More Related Content

Similar to COA-unit-2-Arithmetic.ppt

UNIT - II.pptx
UNIT - II.pptxUNIT - II.pptx
UNIT - II.pptxamudhak10
 
DPCO-Unit 2-Combinational Circuit.pdf
DPCO-Unit 2-Combinational Circuit.pdfDPCO-Unit 2-Combinational Circuit.pdf
DPCO-Unit 2-Combinational Circuit.pdfssuser08e250
 
Logic System Design KTU Chapter-4.ppt
Logic System Design KTU Chapter-4.pptLogic System Design KTU Chapter-4.ppt
Logic System Design KTU Chapter-4.pptAlbin562191
 
Unit 3 Arithmetic building blocks and memory Design (1).pdf
Unit 3 Arithmetic building blocks and  memory Design (1).pdfUnit 3 Arithmetic building blocks and  memory Design (1).pdf
Unit 3 Arithmetic building blocks and memory Design (1).pdfShreyasMahesh
 
Lab 9 D-Flip Flops: Shift Register and Sequence Counter
Lab 9 D-Flip Flops: Shift Register and Sequence CounterLab 9 D-Flip Flops: Shift Register and Sequence Counter
Lab 9 D-Flip Flops: Shift Register and Sequence CounterKatrina Little
 
Chapter 07 Digital Alrithmetic and Arithmetic Circuits
Chapter 07 Digital Alrithmetic and Arithmetic CircuitsChapter 07 Digital Alrithmetic and Arithmetic Circuits
Chapter 07 Digital Alrithmetic and Arithmetic CircuitsSSE_AndyLi
 
Comparison among Different Adders
Comparison among Different Adders Comparison among Different Adders
Comparison among Different Adders iosrjce
 
Admissions in india 2015
Admissions in india 2015Admissions in india 2015
Admissions in india 2015Edhole.com
 
combinational-circuit.pptx it tis creative study of digital electronics for ...
combinational-circuit.pptx it tis creative study of  digital electronics for ...combinational-circuit.pptx it tis creative study of  digital electronics for ...
combinational-circuit.pptx it tis creative study of digital electronics for ...RishabhSingh308993
 
combinational-circuit (1).ppt
combinational-circuit (1).pptcombinational-circuit (1).ppt
combinational-circuit (1).pptThanmayiKumar
 
combinational circuits dispositivos .ppt
combinational circuits dispositivos .pptcombinational circuits dispositivos .ppt
combinational circuits dispositivos .pptFilibertoMoralesGarc
 
Latches time borrowing and stealing sequential
Latches time borrowing and stealing sequentialLatches time borrowing and stealing sequential
Latches time borrowing and stealing sequentialShaziyaBibi1
 
Mba admission in india
Mba admission in indiaMba admission in india
Mba admission in indiaEdhole.com
 

Similar to COA-unit-2-Arithmetic.ppt (20)

UNIT - II.pptx
UNIT - II.pptxUNIT - II.pptx
UNIT - II.pptx
 
Chapter-04.pdf
Chapter-04.pdfChapter-04.pdf
Chapter-04.pdf
 
DPCO-Unit 2-Combinational Circuit.pdf
DPCO-Unit 2-Combinational Circuit.pdfDPCO-Unit 2-Combinational Circuit.pdf
DPCO-Unit 2-Combinational Circuit.pdf
 
Logic System Design KTU Chapter-4.ppt
Logic System Design KTU Chapter-4.pptLogic System Design KTU Chapter-4.ppt
Logic System Design KTU Chapter-4.ppt
 
Unit 3 Arithmetic building blocks and memory Design (1).pdf
Unit 3 Arithmetic building blocks and  memory Design (1).pdfUnit 3 Arithmetic building blocks and  memory Design (1).pdf
Unit 3 Arithmetic building blocks and memory Design (1).pdf
 
Lab 9 D-Flip Flops: Shift Register and Sequence Counter
Lab 9 D-Flip Flops: Shift Register and Sequence CounterLab 9 D-Flip Flops: Shift Register and Sequence Counter
Lab 9 D-Flip Flops: Shift Register and Sequence Counter
 
Chapter 07 Digital Alrithmetic and Arithmetic Circuits
Chapter 07 Digital Alrithmetic and Arithmetic CircuitsChapter 07 Digital Alrithmetic and Arithmetic Circuits
Chapter 07 Digital Alrithmetic and Arithmetic Circuits
 
Mcsl 17 ALP lab manual
Mcsl 17 ALP lab manualMcsl 17 ALP lab manual
Mcsl 17 ALP lab manual
 
Comparison among Different Adders
Comparison among Different Adders Comparison among Different Adders
Comparison among Different Adders
 
3306565.ppt
3306565.ppt3306565.ppt
3306565.ppt
 
Admissions in india 2015
Admissions in india 2015Admissions in india 2015
Admissions in india 2015
 
combinational-circuit.pptx it tis creative study of digital electronics for ...
combinational-circuit.pptx it tis creative study of  digital electronics for ...combinational-circuit.pptx it tis creative study of  digital electronics for ...
combinational-circuit.pptx it tis creative study of digital electronics for ...
 
QC-UNIT 2.ppt
QC-UNIT 2.pptQC-UNIT 2.ppt
QC-UNIT 2.ppt
 
combinational-circuit (1).ppt
combinational-circuit (1).pptcombinational-circuit (1).ppt
combinational-circuit (1).ppt
 
combinational-circuit (1).ppt
combinational-circuit (1).pptcombinational-circuit (1).ppt
combinational-circuit (1).ppt
 
combinational circuits dispositivos .ppt
combinational circuits dispositivos .pptcombinational circuits dispositivos .ppt
combinational circuits dispositivos .ppt
 
Unit 2.pptx
Unit 2.pptxUnit 2.pptx
Unit 2.pptx
 
Latches time borrowing and stealing sequential
Latches time borrowing and stealing sequentialLatches time borrowing and stealing sequential
Latches time borrowing and stealing sequential
 
Mba admission in india
Mba admission in indiaMba admission in india
Mba admission in india
 
Digital Logic circuit
Digital Logic circuitDigital Logic circuit
Digital Logic circuit
 

More from Ruhul Amin

COA-Unit5-ppt2.pptx
COA-Unit5-ppt2.pptxCOA-Unit5-ppt2.pptx
COA-Unit5-ppt2.pptxRuhul Amin
 
coa-Unit5-ppt1 (1).pptx
coa-Unit5-ppt1 (1).pptxcoa-Unit5-ppt1 (1).pptx
coa-Unit5-ppt1 (1).pptxRuhul Amin
 
COA-Unit3-ppt.pptx
COA-Unit3-ppt.pptxCOA-Unit3-ppt.pptx
COA-Unit3-ppt.pptxRuhul Amin
 
COA-Unit4-PPT.pptx
COA-Unit4-PPT.pptxCOA-Unit4-PPT.pptx
COA-Unit4-PPT.pptxRuhul Amin
 
COA-Unit2-flynns.pptx
COA-Unit2-flynns.pptxCOA-Unit2-flynns.pptx
COA-Unit2-flynns.pptxRuhul Amin
 
risc_and_cisc.ppt
risc_and_cisc.pptrisc_and_cisc.ppt
risc_and_cisc.pptRuhul Amin
 
COA-Unit-1-Basics.ppt
COA-Unit-1-Basics.pptCOA-Unit-1-Basics.ppt
COA-Unit-1-Basics.pptRuhul Amin
 

More from Ruhul Amin (7)

COA-Unit5-ppt2.pptx
COA-Unit5-ppt2.pptxCOA-Unit5-ppt2.pptx
COA-Unit5-ppt2.pptx
 
coa-Unit5-ppt1 (1).pptx
coa-Unit5-ppt1 (1).pptxcoa-Unit5-ppt1 (1).pptx
coa-Unit5-ppt1 (1).pptx
 
COA-Unit3-ppt.pptx
COA-Unit3-ppt.pptxCOA-Unit3-ppt.pptx
COA-Unit3-ppt.pptx
 
COA-Unit4-PPT.pptx
COA-Unit4-PPT.pptxCOA-Unit4-PPT.pptx
COA-Unit4-PPT.pptx
 
COA-Unit2-flynns.pptx
COA-Unit2-flynns.pptxCOA-Unit2-flynns.pptx
COA-Unit2-flynns.pptx
 
risc_and_cisc.ppt
risc_and_cisc.pptrisc_and_cisc.ppt
risc_and_cisc.ppt
 
COA-Unit-1-Basics.ppt
COA-Unit-1-Basics.pptCOA-Unit-1-Basics.ppt
COA-Unit-1-Basics.ppt
 

Recently uploaded

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksMagic Marks
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdfKamal Acharya
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 

Recently uploaded (20)

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 

COA-unit-2-Arithmetic.ppt

  • 2. Addition/subtraction of signed numbers si = ci +1 = 13 7 + Y 1 0 0 0 1 0 1 1 0 0 1 1 0 1 1 0 0 1 1 0 1 0 0 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 Example: 1 0 = = 0 0 1 1 1 1 1 0 0 1 1 1 1 0 Legend for stage i xi yi Carry-in ci Sumsi Carry-outci+1 X Z + 6 0 + xi yi si Carry-out ci+1 Carry-in ci xi yi ci xi yi ci xi yi ci xi yi ci xi yi ci   = + + + yi ci xi ci xi yi + + At the ith stage: Input: ci is the carry-in Output: si is the sum ci+1 carry-out to (i+1)st state
  • 3. Addition logic for a single stage Full adder (FA) ci ci 1 + s i Sum Carry yi xi c i yi xi c i yi x i xi ci yi si c i 1 + Full Adder (FA): Symbol for the complete circuit for a single stage of addition.
  • 4. n-bit adder •Cascade n full adder (FA) blocks to form a n-bit adder. •Carries propagate or ripple through this cascade, n-bit ripple carry adder. FA c0 y1 x1 s1 FA c1 y0 x0 s0 FA cn 1 - yn 1 - xn 1 - cn sn 1 - Most significant bit (MSB) position Least significant bit (LSB) position Carry-in c0 into the LSB position provides a convenient way to perform subtraction.
  • 5. K n-bit adder K n-bit numbers can be added by cascading k n-bit adders. n-bit c 0 yn xn s n cn y0 xn 1 - s 0 ckn s k 1 - ( )n x0 yn 1 - y2n 1 - x2n 1 - ykn 1 - s n 1 - s 2n 1 - s kn 1 - xkn 1 - adder n-bit adder n-bit adder Each n-bit adder forms a block, so this is cascading of blocks. Carries ripple or propagate through blocks, Blocked Ripple Carry Adder
  • 6. n-bit subtractor FA 1 y1 x1 s1 FA c1 y0 x0 s0 FA cn 1 - yn 1 - xn 1 - cn sn 1 - Most significant bit (MSB) position Least significant bit (LSB) position •Recall X – Y is equivalent to adding 2’s complement of Y to X. •2’s complement is equivalent to 1’s complement + 1. •X – Y = X + Y + 1 •2’s complement of positive and negative numbers is computed similarly.
  • 7. n-bit adder/subtractor (contd..) Add/Sub control n-bit adder x n 1 - x 1 x 0 c n s n 1 - s 1 s 0 c 0 y n 1 - y 1 y 0 •Add/sub control = 0, addition. •Add/sub control = 1, subtraction.
  • 8. Detecting overflows  Overflows can only occur when the sign of the two operands is the same.  Overflow occurs if the sign of the result is different from the sign of the operands.  Recall that the MSB represents the sign.  xn-1, yn-1, sn-1 represent the sign of operand x, operand y and result s respectively.  Circuit to detect overflow can be implemented by the following logic expressions: 1 1 1 1 1 1         n n n n n n s y x s y x Overflow 1    n n c c Overflow
  • 9. Computing the add time Consider 0th stage: x0 y0 c0 c1 s0 FA •c1 is available after 2 gate delays. •s1 is available after 1 gate delay. c i yi xi c i yi x i xi ci yi si c i 1 + Sum Carry
  • 10. Computing the add time (contd..) x0 y0 s2 FA x0 y0 x0 y0 s1 FA c2 s0 FA c1 c3 c0 x0 y0 s3 FA c4 Cascade of 4 Full Adders, or a 4-bit adder •s0 available after 1 gate delays, c1 available after 2 gate delays. •s1 available after 3 gate delays, c2 available after 4 gate delays. •s2 available after 5 gate delays, c3 available after 6 gate delays. •s3 available after 7 gate delays, c4 available after 8 gate delays. For an n-bit adder, sn-1 is available after 2n-1 gate delays cn is available after 2n gate delays.
  • 11. Fast addition Recall the equations: i i i i i i i i i i i c y c x y x c c y x s       1 Second equation can be written as: i i i i i i c y x y x c ) ( 1     We can write: i i i i i i i i i i y x P and y x G where c P G c      1 •Gi is called generate function and Pi is called propagate function •Gi and Pi are computed only from xi and yi and not ci, thus they can be computed in one gate delay after X and Y are applied to the inputs of an n-bit adder.
  • 12. Carry lookahead ci 1  Gi  Pici ci  Gi1  Pi1ci1  ci1  Gi  Pi(Gi1  Pi 1ci 1) continuing  ci1  Gi  Pi(Gi1  Pi 1(Gi 2  Pi 2ci 2 )) until ci1  Gi  PiGi 1  Pi Pi1Gi 2  ..  PiPi1..P1G0  Pi Pi 1...P0c0 •All carries can be obtained 3 gate delays after X, Y and c0 are applied. -One gate delay for Pi and Gi -Two gate delays in the AND-OR circuit for ci+1 •All sums can be obtained 1 gate delay after the carries are computed. •Independent of n, n-bit addition requires only 4 gate delays. •This is called Carry Lookahead adder.
  • 13. Carry-lookahead adder Carry-lookahead logic B cell B cell B cell B cell s 3 P3 G3 c 3 P2 G2 c 2 s 2 G 1 c 1 P 1 s 1 G 0 c 0 P 0 s 0 . c4 x 1 y 1 x 3 y 3 x 2 y 2 x 0 y 0 G i c i . . . P i s i x i y i B cell 4-bit carry-lookahead adder B-cell for a single stage
  • 14. Carry lookahead adder (contd..)  Performing n-bit addition in 4 gate delays independent of n is good only theoretically because of fan-in constraints.  Last AND gate and OR gate require a fan-in of (n+1) for a n- bit adder.  For a 4-bit adder (n=4) fan-in of 5 is required.  Practical limit for most gates.  In order to add operands longer than 4 bits, we can cascade 4-bit Carry-Lookahead adders. Cascade of Carry-Lookahead adders is called Blocked Carry-Lookahead adder. ci1  Gi  Pi Gi1  Pi Pi1 Gi2  ..  Pi Pi1 ..P 1 G0  Pi Pi1 ...P0 c0
  • 16. Blocked Carry-Lookahead adder c4  G3  P 3G2  P 3P2G1  P 3P 2 P 1G0  P 3P 2 P 1P0c0 Carry-out from a 4-bit block can be given as: Rewrite this as: P 0 I  P 3P 2 P 1P 0 G0 I  G3  P 3G2  P 3P 2G1  P 3P2P 1G0 Subscript I denotes the blocked carry lookahead and identifies the block. Cascade 4 4-bit adders, c16 can be expressed as: c16  G3 I  P 3 I G2 I  P 3 I P 2 I G1 I  P3 I P 2 I P 1 0 G0 I  P 3 I P 2 I P 1 0 P 0 0 c0
  • 17. Blocked Carry-Lookahead adder Carry-lookahead logic 4-bit adder 4-bit adder 4-bit adder 4-bit adder s15-12 P3 I G3 I c12 P2 I G2 I c8 s11-8 G1 I c4 P1 I s7-4 G0 I c0 P0 I s3-0 c16 x15-12 y15-12 x11-8 y11-8 x7-4 y7-4 x3-0 y3-0 . After xi, yi and c0 are applied as inputs: - Gi and Pi for each stage are available after 1 gate delay. - PI is available after 2 and GI after 3 gate delays. - All carries are available after 5 gate delays. - c16 is available after 5 gate delays. - s15 which depends on c12 is available after 8 (5+3)gate delays (Recall that for a 4-bit carry lookahead adder, the last sum bit is available 3 gate delays after all inputs are available)
  • 18.
  • 19. Multiplication of unsigned numbers Product of 2 n-bit numbers is at most a 2n-bit number. Unsigned multiplication can be viewed as addition of shifted versions of the multiplicand.
  • 20. Multiplication of unsigned numbers (contd..)  We added the partial products at end.  Alternative would be to add the partial products at each stage.  Rules to implement multiplication are:  If the ith bit of the multiplier is 1, shift the multiplicand and add the shifted multiplicand to the current value of the partial product.  Hand over the partial product to the next stage  Value of the partial product at the start stage is 0.
  • 21. Multiplication of unsigned numbers ith multiplier bit carry in carry out jth multiplicand bit ith multiplier bit Bit of incoming partial product (PPi) Bit of outgoing partial product (PP(i+1)) FA Typical multiplication cell
  • 22. Combinatorial array multiplier Multiplicand m3 m2 m1 m0 0 0 0 0 q3 q2 q1 q0 0 p2 p1 p0 0 0 0 p3 p4 p5 p6 p7 PP1 PP2 PP3 (PP0) , Product is: p7,p6,..p0 Multiplicand is shifted by displacing it through an array of adders. Combinatorial array multiplier
  • 23. Combinatorial array multiplier (contd..)  Combinatorial array multipliers are:  Extremely inefficient.  Have a high gate count for multiplying numbers of practical size such as 32- bit or 64-bit numbers.  Perform only one function, namely, unsigned integer product.  Improve gate efficiency by using a mixture of combinatorial array techniques and sequential techniques requiring less combinational logic.
  • 24. Sequential multiplication  Recall the rule for generating partial products:  If the ith bit of the multiplier is 1, add the appropriately shifted multiplicand to the current partial product.  Multiplicand has been shifted left when added to the partial product.  However, adding a left-shifted multiplicand to an unshifted partial product is equivalent to adding an unshifted multiplicand to a right-shifted partial product.
  • 25. Sequential Circuit Multiplier q n 1 - m n 1 - n-bit Adder Multiplicand M Control sequencer Multiplier Q 0 C Shift right Register A (initially 0) Add/Noadd control a n 1 - a 0 q 0 m 0 0 MUX
  • 26. Sequential multiplication (contd..) 1 1 1 1 1 0 1 1 1 1 1 1 1 1 1 0 1 1 1 0 1 1 0 1 1 1 0 1 Initial configuration Add M 1 1 0 1 C First cycle Second cycle Third cycle Fourth cycle No add Shift Shift Add Shift Shift Add 1 1 1 1 0 0 0 1 0 0 0 1 0 0 0 0 0 0 1 1 0 1 1 0 1 0 0 1 1 1 0 0 1 0 1 0 0 0 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 Q A Product
  • 27.
  • 28. Signed Multiplication  Considering 2’s-complement signed operands, what will happen to (- 13)(+11) if following the same method of unsigned multiplication? Sign extension of negative multiplicand. 1 0 1 1 1 1 1 1 0 0 1 1 1 1 0 1 1 0 1 0 1 0 0 0 1 1 1 0 1 1 0 0 0 0 0 0 1 1 0 0 1 1 1 0 0 0 0 0 0 0 0 1 1 0 0 1 1 1 1 1 13 - ( ) 143 - ( ) 11 + ( ) Sign extension is shown in blue
  • 29. Signed Multiplication  For a negative multiplier, a straightforward solution is to form the 2’s-complement of both the multiplier and the multiplicand and proceed as in the case of a positive multiplier.  This is possible because complementation of both operands does not change the value or the sign of the product.  A technique that works equally well for both negative and positive multipliers – Booth algorithm.
  • 30. Booth Algorithm  Consider in a multiplication, the multiplier is positive 0011110, how many appropriately shifted versions of the multiplicand are added in a standard procedure? 0 0 0 1 0 1 1 0 1 0 0 0 0 0 0 0 1 0 0 1 1 0 1 0 1 0 1 1 0 1 0 1 0 1 1 0 1 0 1 0 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 1 0 1 0 0 0 0 1 + 1 + 1 + 1 +
  • 31. Booth Algorithm  Since 0011110 = 0100000 – 0000010, if we use the expression to the right, what will happen? 0 1 0 1 0 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 0 1 0 0 1 0 0 0 0 0 0 1 0 1 1 0 1 0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 1 0 0 0 1 2's complement of the multiplicand 0 0 0 0 1 + 1 - 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  • 32. Booth Algorithm  In general, in the Booth scheme, -1 times the shifted multiplicand is selected when moving from 0 to 1, and +1 times the shifted multiplicand is selected when moving from 1 to 0, as the multiplier is scanned from right to left. Booth recoding of a multiplier. 0 0 1 1 0 1 0 1 1 1 0 0 1 1 0 1 0 0 0 0 0 0 0 0 0 0 1 + 1 - 1 - 1 + 1 - 1 + 1 - 1 + 1 - 1 +
  • 33. Booth Algorithm Booth multiplication with a negative multiplier. 0 1 0 1 1 1 1 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 1 1 0 1 1 0 0 1 1 1 0 0 0 0 0 0 0 1 0 0 0 1 1 1 1 1 1 1 0 1 1 0 1 1 1 0 1 0 6 - ( ) 13 + ( ) X 78 - ( ) +1 1 - 1 -
  • 34. Booth Algorithm Multiplier Bit i Bit i 1 - Version of multiplicand selected by bit i 0 1 0 0 0 1 1 1 0 M 1 + M 1  M 0 M Booth multiplier recoding table. X X X X
  • 35. Booth Algorithm  Best case – a long string of 1’s (skipping over 1s)  Worst case – 0’s and 1’s are alternating 1 0 1 1 1 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 1 1 1 1 0 1 1 0 1 0 0 0 1 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + 1 + Worst-case multiplier Ordinary multiplier Good multiplier
  • 37.
  • 38. Bit-Pair Recoding of Multipliers  Bit-pair recoding halves the maximum number of summands (versions of the multiplicand). 1 + 1  (a) Example of bit-pair recoding derived from Booth recoding 0 0 0 0 1 1 0 1 0 Implied 0 to right of LSB 1 0 Sign extension 1 2 1   
  • 39. Bit-Pair Recoding of Multipliers i 1 + i 1 (b) Table of multiplicand selection decisions selected at position i Multiplicand Multiplier bit-pair i 0 0 1 1 1 0 1 0 1 1 1 1 0 0 0 1 1 0 0 1 0 0 1 Multiplier bit on the right 0 0 X M 1 + 1 1 + 0 1 2 2 +     X M X M X M X M X M X M X M
  • 40. Bit-Pair Recoding of Multipliers 40 1 - 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 1 1 0 1 0 1 0 1 0 0 1 1 1 1 1 1 1 1 1 0 0 1 1 0 0 0 0 0 0 1 1 1 0 1 1 0 0 1 0 0 1 0 0 1 0 1 0 0 0 0 1 0 0 1 1 0 0 0 1 0 0 1 1 0 1 1 1 1 - 6 - ( ) 13 + ( ) 1 + 78 - ( ) 1 - 2 - ´ Figure 6.15. Multiplication requiring only n/2 summands.
  • 41. Carry-Save Addition of Summands  CSA speeds up the addition process. 41 P7 P6 P5 P4 P3 P2 P1 P0
  • 42. Carry-Save Addition of Summands(Cont.,) P3 P2 P1 P0 P5 P4 P7 P6
  • 43. Carry-Save Addition of Summands(Cont.,)  Consider the addition of many summands, we can:  Group the summands in threes and perform carry-save addition on each of these groups in parallel to generate a set of S and C vectors in one full-adder delay  Group all of the S and C vectors into threes, and perform carry-save addition on them, generating a further set of S and C vectors in one more full-adder delay  Continue with this process until there are only two vectors remaining  They can be added in a RCA or CLA to produce the desired product
  • 44. Carry-Save Addition of Summands Figure 6.17. A multiplication example used to illustrate carry-save addition as shown in Figure 6.18. 1 0 0 1 1 1 1 0 0 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 M Q A B C D E F (2,835) X (45) (63) 1 0 0 1 1 1 1 0 0 1 1 1 1 0 0 1 1 1 0 0 0 1 1 1 1 1 1 0 0 0 Product
  • 45. Figure 6.18. The multiplication example from Figure 6.17 performed using carry-save addition. 0 0 0 0 0 1 0 1 0 1 0 1 0 0 1 0 0 0 0 1 1 1 1 + 1 0 0 0 0 1 1 1 1 0 0 1 0 1 1 1 0 1 0 1 0 1 1 0 1 1 0 0 0 0 0 1 1 0 1 0 0 0 0 1 0 0 0 1 0 1 1 1 0 1 1 1 0 0 0 1 1 0 0 0 1 1 1 1 0 0 0 0 1 1 0 1 1 0 1 1 0 0 1 0 0 1 1 0 0 1 1 1 1 0 0 1 1 1 1 0 0 1 1 1 0 0 1 1 0 1 1 0 1 1 0 0 1 0 0 1 1 0 0 1 1 1 1 0 0 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 M Q A B C S 1 C 1 D E F S 2 C 2 S1 C 1 S2 S 3 C3 C2 S4 C 4 Product x
  • 46.
  • 47. Manual Division Longhand division examples. 1101 1 13 14 26 21 274 100010010 10101 1101 1 1110 1101 10000 13 1101
  • 48. Longhand Division Steps  Position the divisor appropriately with respect to the dividend and performs a subtraction.  If the remainder is zero or positive, a quotient bit of 1 is determined, the remainder is extended by another bit of the dividend, the divisor is repositioned, and another subtraction is performed.  If the remainder is negative, a quotient bit of 0 is determined, the dividend is restored by adding back the divisor, and the divisor is repositioned for another subtraction.
  • 49. Circuit Arrangement Figure 6.21. Circuit arrangement for binary division. qn-1 Divisor M Control Sequencer Dividend Q Shift left N+1 bit adder q0 Add/Subtract Quotient Setting A m0 0 mn-1 a0 an an-1
  • 50. Restoring Division  Shift A and Q left one binary position  Subtract M from A, and place the answer back in A  If the sign of A is 1, set q0 to 0 and add M back to A (restore A); otherwise, set q0 to 1  Repeat these steps n times
  • 51. Examples 1 0 1 1 1 Figure 6.22. A restoring-division example. 1 1 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 1 1 1 1 1 0 1 0 0 0 1 Subtract Shift Restore 1 0 0 0 0 1 0 0 0 0 1 1 Initially Subtract Shift 1 0 1 1 1 1 0 0 0 0 1 1 0 0 0 0 0 0 0 0 Subtract Shift Restore 1 0 1 1 1 0 1 0 0 0 1 0 0 0 0 1 1 Quotient Remainder Shift 1 0 1 1 1 1 0 0 0 0 Subtract Second cycle First cycle Third cycle Fourth cycle 0 0 0 0 0 0 1 0 1 1 0 0 0 0 1 1 1 0 0 0 0 1 1 1 1 1 Restore q0 Set q0 Set q0 Set q0 Set
  • 52. Nonrestoring Division  Avoid the need for restoring A after an unsuccessful subtraction.  Any idea?  Step 1: (Repeat n times) If the sign of A is 0, shift A and Q left one bit position and subtract M from A; otherwise, shift A and Q left and add M to A. Now, if the sign of A is 0, set q0 to 1; otherwise, set q0 to 0.  Step2: If the sign of A is 1, add M to A
  • 53. Examples A nonrestoring-division example. Add Restore remainder Remainder 0 0 0 0 1 1 1 1 1 1 0 0 0 1 1 1 Quotient 0 0 1 0 1 1 1 1 1 0 0 0 0 1 1 1 1 1 Shift 0 0 0 1 1 0 0 0 0 1 1 1 1 Add 0 0 0 1 1 0 0 0 0 1 0 0 0 1 1 1 0 1 Shift Subtract Initially 0 0 0 0 0 1 0 0 0 1 1 1 0 0 0 0 0 1 1 1 0 0 0 0 0 1 1 0 0 0 Shift Add 0 0 1 0 0 0 0 1 1 1 1 0 1 Shift Subtract 0 0 0 1 1 0 0 0 0 Fourth cycle Third cycle Second cycle First cycle q 0 Set q 0 Set q 0 Set q 0 Set
  • 54.
  • 55. Fractions If b is a binary vector, then we have seen that it can be interpreted as an unsigned integer by: V(b) = b31.231 + b30.230 + bn-3.229 + .... + b1.21 + b0.20 This vector has an implicit binary point to its immediate right: b31b30b29....................b1b0. implicit binary point Suppose if the binary vector is interpreted with the implicit binary point is just left of the sign bit: implicit binary point .b31b30b29....................b1b0 The value of b is then given by: V(b) = b31.2-1 + b30.2-2 + b29.2-3 + .... + b1.2-31 + b0.2-32
  • 56. Range of fractions The value of the unsigned binary fraction is: V(b) = b31.2-1 + b30.2-2 + b29.2-3 + .... + b1.2-31 + b0.2-32 The range of the numbers represented in this format is: In general for a n-bit binary fraction (a number with an assumed binary point at the immediate left of the vector), then the range of values is: 9999999998 . 0 2 1 ) ( 0 32      b V n b V     2 1 ) ( 0
  • 57. Scientific notation •Previous representations have a fixed point. Either the point is to the immediate right or it is to the immediate left. This is called Fixed point representation. •Fixed point representation suffers from a drawback that the representation can only represent a finite range (and quite small) range of numbers. A more convenient representation is the scientific representation, where the numbers are represented in the form: x  m1.m2m3m4  be Components of these numbers are: Mantissa (m), implied base (b), and exponent (e)
  • 58. Significant digits A number such as the following is said to have 7 significant digits x  0.m1m2m3m4m5m6m7  be Fractions in the range 0.0 to 0.9999999 need about 24 bits of precision (in binary). For example the binary fraction with 24 1’s: 111111111111111111111111 = 0.9999999404 Not every real number between 0 and 0.9999999404 can be represented by a 24-bit fractional number. The smallest non-zero number that can be represented is: 000000000000000000000001 = 5.96046 x 10-8 Every other non-zero number is constructed in increments of this value.
  • 59. Sign and exponent digits •In a 32-bit number, suppose we allocate 24 bits to represent a fractional mantissa. •Assume that the mantissa is represented in sign and magnitude format, and we have allocated one bit to represent the sign. •We allocate 7 bits to represent the exponent, and assume that the exponent is represented as a 2’s complement integer. •There are no bits allocated to represent the base, we assume that the base is implied for now, that is the base is 2. •Since a 7-bit 2’s complement number can represent values in the range -64 to 63, the range of numbers that can be represented is: 0.0000001 x 2-64 < = | x | <= 0.9999999 x 263 •In decimal representation this range is: 0.5421 x 10-20 < = | x | <= 9.2237 x 1018
  • 60. A sample representation Sign Exponent Fractional mantissa bit 1 7 24 •24-bit mantissa with an implied binary point to the immediate left •7-bit exponent in 2’s complement form, and implied base is 2.
  • 61. Normalization If the number is to be represented using only 7 significant mantissa digits, the representation ignoring rounding is: Consider the number: x = 0.0004056781 x 1012 x = 0.0004056 x 1012 If the number is shifted so that as many significant digits are brought into 7 available slots: x = 0.4056781 x 109 = 0.0004056 x 1012 Exponent of x was decreased by 1 for every left shift of x. A number which is brought into a form so that all of the available mantissa digits are optimally used (this is different from all occupied which may not hold), is called a normalized number. Same methodology holds in the case of binary mantissas 0001101000(10110) x 28 = 1101000101(10) x 25
  • 62. Normalization (contd..) •A floating point number is in normalized form if the most significant 1 in the mantissa is in the most significant bit of the mantissa. •All normalized floating point numbers in this system will be of the form: 0.1xxxxx.......xx Range of numbers representable in this system, if every number must be normalized is: 0.5 x 2-64 <= | x | < 1 x 263
  • 63. Normalization, overflow and underflow The procedure for normalizing a floating point number is: Do (until MSB of mantissa = = 1) Shift the mantissa left (or right) Decrement (increment) the exponent by 1 end do Applying the normalization procedure to: .000111001110....0010 x 2-62 gives: .111001110........ x 2-65 But we cannot represent an exponent of –65, in trying to normalize the number we have underflowed our representation. Applying the normalization procedure to: 1.00111000............x 263 gives: 0.100111..............x 264 This overflows the representation.
  • 64. Changing the implied base So far we have assumed an implied base of 2, that is our floating point numbers are of the form: x = m 2e If we choose an implied base of 16, then: x = m 16e Then: y = (m.16) .16e-1 (m.24) .16e-1 = m . 16e = x •Thus, every four left shifts of a binary mantissa results in a decrease of 1 in a base 16 exponent. •Normalization in this case means shifting the mantissa until there is a 1 in the first four bits of the mantissa.
  • 65. Excess notation •Rather than representing an exponent in 2’s complement form, it turns out to be more beneficial to represent the exponent in excess notation. •If 7 bits are allocated to the exponent, exponents can be represented in the range of -64 to +63, that is: -64 <= e <= 63 Exponent can also be represented using the following coding called as excess-64: E’ = Etrue + 64 In general, excess-p coding is represented as: E’ = Etrue + p True exponent of -64 is represented as 0 0 is represented as 64 63 is represented as 127 This enables efficient comparison of the relative sizes of two floating point numbers.
  • 66. IEEE notation IEEE Floating Point notation is the standard representation in use. There are two representations: - Single precision. - Double precision. Both have an implied base of 2. Single precision: - 32 bits (23-bit mantissa, 8-bit exponent in excess-127 representation) Double precision: - 64 bits (52-bit mantissa, 11-bit exponent in excess-1023 representation) Fractional mantissa, with an implied binary point at immediate left. Sign Exponent Mantissa 1 8 or 11 23 or 52
  • 67. Peculiarities of IEEE notation •Floating point numbers have to be represented in a normalized form to maximize the use of available mantissa digits. •In a base-2 representation, this implies that the MSB of the mantissa is always equal to 1. •If every number is normalized, then the MSB of the mantissa is always 1. We can do away without storing the MSB. •IEEE notation assumes that all numbers are normalized so that the MSB of the mantissa is a 1 and does not store this bit. •So the real MSB of a number in the IEEE notation is either a 0 or a 1. •The values of the numbers represented in the IEEE single precision notation are of the form: (+,-) 1.M x 2(E - 127) •The hidden 1 forms the integer part of the mantissa. •Note that excess-127 and excess-1023 (not excess-128 or excess-1024) are used to represent the exponent.
  • 68. Exponent field In the IEEE representation, the exponent is in excess-127 (excess-1023) notation. The actual exponents represented are: -126 <= E <= 127 and -1022 <= E <= 1023 not -127 <= E <= 128 and -1023 <= E <= 1024 This is because the IEEE uses the exponents -127 and 128 (and -1023 and 1024), that is the actual values 0 and 255 to represent special conditions: - Exact zero - Infinity
  • 69. Floating point arithmetic Addition: 3.1415 x 108 + 1.19 x 106 = 3.1415 x 108 + 0.0119 x 108 = 3.1534 x 108 Multiplication: 3.1415 x 108 x 1.19 x 106 = (3.1415 x 1.19 ) x 10(8+6) Division: 3.1415 x 108 / 1.19 x 106 = (3.1415 / 1.19 ) x 10(8-6) Biased exponent problem: If a true exponent e is represented in excess-p notation, that is as e+p. Then consider what happens under multiplication: a. 10(x + p) * b. 10(y + p) = (a.b). 10(x + p + y +p) = (a.b). 10(x +y + 2p) Representing the result in excess-p notation implies that the exponent should be x+y+p. Instead it is x+y+2p. Biases should be handled in floating point arithmetic.
  • 70. Floating point arithmetic: ADD/SUB rule  Choose the number with the smaller exponent.  Shift its mantissa right until the exponents of both the numbers are equal.  Add or subtract the mantissas.  Determine the sign of the result.  Normalize the result if necessary and truncate/round to the number of mantissa bits. Note: This does not consider the possibility of overflow/underflow.
  • 71. Floating point arithmetic: MUL rule  Add the exponents.  Subtract the bias.  Multiply the mantissas and determine the sign of the result.  Normalize the result (if necessary).  Truncate/round the mantissa of the result.
  • 72. Floating point arithmetic: DIV rule  Subtract the exponents  Add the bias.  Divide the mantissas and determine the sign of the result.  Normalize the result if necessary.  Truncate/round the mantissa of the result. Note: Multiplication and division does not require alignment of the mantissas the way addition and subtraction does.
  • 73. Guard bits While adding two floating point numbers with 24-bit mantissas, we shift the mantissa of the number with the smaller exponent to the right until the two exponents are equalized. This implies that mantissa bits may be lost during the right shift (that is, bits of precision may be shifted out of the mantissa being shifted). To prevent this, floating point operations are implemented by keeping guard bits, that is, extra bits of precision at the least significant end of the mantissa. The arithmetic on the mantissas is performed with these extra bits of precision. After an arithmetic operation, the guarded mantissas are: - Normalized (if necessary) - Converted back by a process called truncation/rounding to a 24-bit mantissa.
  • 74. Truncation/rounding  Straight chopping:  The guard bits (excess bits of precision) are dropped.  Von Neumann rounding:  If the guard bits are all 0, they are dropped.  However, if any bit of the guard bit is a 1, then the LSB of the retained bit is set to 1.  Rounding:  If there is a 1 in the MSB of the guard bit then a 1 is added to the LSB of the retained bits.
  • 75. Rounding  Rounding is evidently the most accurate truncation method.  However,  Rounding requires an addition operation.  Rounding may require a renormalization, if the addition operation de- normalizes the truncated number.  IEEE uses the rounding method. 0.111111100000 rounds to 0.111111 + 0.000001 =1.000000 which must be renormalized to 0.100000