100% found this document useful (1 vote)

344 views

3-3-Arithmetic Coding

The document discusses arithmetic coding, an entropy encoding technique. It begins with an introduction to arithmetic coding and how it recursively partitions the range [0,1) based on symbol probabilities to map a sequence to a unique value within the range. Examples are provided to demonstrate how encoding and decoding work by successively updating the range based on input symbols. The document notes limitations of early implementations and outlines techniques like scaling and incremental coding to allow for incremental output and avoid precision issues.

Uploaded by

MuhamadAndi

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

344 views

3-3-Arithmetic Coding

Uploaded by

MuhamadAndi

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 71

Multimedia Systems

Arithmetic Coding

CMPT365 Multimedia Systems 1

Outline – Part I

 Introduction
 Basic Encoding and Decoding
 Scaling and Incremental Coding
 Integer Implementation
 Adaptive Arithmetic Coding
 Binary Arithmetic Coding
 Applications
 JBIG, H.264, JPEG 2000

CMPT365 Multimedia Systems 2

Limitations of Huffman Code
 Need a probability distribution

 Hard to adapt to changing statistics

 Minimum codeword length is 1 bit

 Serious penalty for high-probability symbols

 Example: Binary source, P(0)=0.9

• Entropy: -0.9*log2(0.9)-0.1*log2(0.1) = 0.469 bit
• Huffman code: 0, 1  Avg. code length: 1 bit
• Joint coding is not practical for large alphabet.

 Arithmetic coding:
 Can resolve all of these problems.
 Code a sequence of symbols without having to generate codes for all
sequences of that length.

CMPT365 Multimedia Systems 3

Introduction
 Recall table look-up decoding of Huffman code
 N: alphabet size
 L: Max codeword length 1
 Divide [0, 2^L] into N intervals 00
 One interval for one symbol
 Interval size is roughly
010 011
proportional to symbol prob.

000 010 011 100

 Arithmetic coding applies this idea recursively

 Normalizes the range [0, 2^L] to [0, 1].
 Map a sequence to a unique tag in [0, 1).

abcd…..
dcba….. 0 1
CMPT365 Multimedia Systems 4
0 1
Arithmetic Coding a b c
 Disjoint and complete partition of the range [0, 1)
[0, 0.8), [0.8, 0.82), [0.82, 1)
 Each interval corresponds to one symbol
 Interval size is proportional to symbol probability

 The first symbol restricts the tag

position to be in one of the intervals 0 1

 The reduced interval is partitioned

recursively as more symbols are 0 1
processed.

0 1
 Observation: once the tag falls into an interval, it never gets out
of it

CMPT365 Multimedia Systems 5

Some Questions to think about:

 Why compression is achieved this way?

 How to implement it efficiently?
 How to decode the sequence?
 Why is it better than Huffman code?

CMPT365 Multimedia Systems 6

Example:
1 2 3
Symbol Prob.

1 0.8
0 0.8 0.82 1.0
2 0.02  Map to real line range [0, 1)

3 0.18  Order does not matter

 Decoder need to use the same order

 Disjoint but complete partition:

 1: [0, 0.8): 0, 0.799999…9
 2: [0.8, 0.82): 0.8, 0.819999…9
 3: [0.82, 1): 0.82, 0.999999…9
 (Think about the impact to integer
implementation)

CMPT365 Multimedia Systems 7

Encoding  Input sequence: “1321”

1 2 3
Range 1
0 0.8 0.82 1.0
1 2 3
Range 0.8
0 0.64 0.656 0.8

1 2 3
Range 0.144
0.656 0.7712 0.77408 0.8

1 2 3
Range 0.00288
0.7712 0.773504 0.7735616 0.77408
Final range: [0.7712, 0.773504): Encode 0.7712
Difficulties: 1. Shrinking of interval requires high precision for long sequence.
2. No output is generated until the entire sequence has been processed.
CMPT365 Multimedia Systems 8
Cumulative Density Function (CDF)

 For continuous distribution:

Probability Mass Function

0.4
x

 p( x)dx
0.2 0.2 0.2
FX ( x)  P( X  x) 

1 2 3 4 X
 For discrete distribution:

i
FX (i )  P( X  i )   P( X  k )
1.0
CDF 0.8
k 1
0.4
 Properties:
0.2
 Non-decreasing
 Piece-wise constant
X
 Each segment is closed at the lower end. 1 2 3 4

CMPT365 Multimedia Systems 9

Encoder Pseudo Code
low=0.0, high=1.0;
 Keep track of LOW,
while (not EOF) {
HIGH, RANGE n = ReadSymbol();
 Any two are sufficient, RANGE = HIGH - LOW;
e.g., LOW and RANGE. HIGH = LOW + RANGE * CDF(n);
LOW = LOW + RANGE * CDF(n-1);
}
output LOW;

Input HIGH LOW RANGE

Initial 1.0 0.0 1.0

1 0.0+1.00.8=0.8 0.0+1.00 = 0.0 0.8

3 0.0 + 0.81=0.8 0.0 + 0.80.82=0.656 0.144

2 0.656+0.1440.82=0.77408 0.656+0.1440.8=0.7712 0.00288

1 0.7712+0.002880=0.7712 0.7712+0.002880.8=0.773504 0.002304

CMPT365 Multimedia Systems 10

Decoding Receive 0.7712

1 2 3
Decode 1
0 0.8 0.82 1.0
1 2 3
Decode 3
0 0.64 0.656 0.8

1 2 3
Decode 2
0.656 0.7712 0.77408 0.8

1 2 3
Decode 1
0.7712 0.773504 0.7735616 0.77408

Drawback: need to recalculate all thresholds each time.

CMPT365 Multimedia Systems 11
Simplified Decoding
x  low
 Normalize RANGE to [0, 1) each time x
 No need to recalculate the thresholds. range
Receive 0.7712 1 2 3
Decode 1

x =(0.7712-0) / 0.8 0 0.8 0.82 1.0

= 0.964
1 2 3
Decode 3

0 0.8 0.82 1.0

x =(0.964-0.82) / 0.18
= 0.8 1 2 3
Decode 2
x =(0.8-0.8) / 0.02 0 0.8 0.82 1.0
=0
Decode 1
1 2 3

0 0.8 0.82 1.0

CMPT365 Multimedia Systems 12
Decoder Pseudo Code

Low = 0; high = 1;
x = Encoded_number
While (x ≠ low) {
n = DecodeOneSymbol(x);
output symbol n;
x = (x - CDF(n-1)) / (CDF(n) - CDF(n-1));
};

CMPT365 Multimedia Systems 13

Outline

CMPT365 Multimedia Systems 14

Scaling and Incremental Coding
 Problems of Previous examples:
 Need high precision
 No output is generated until the entire sequence is encoded

 Key Observation:
 As the RANGE reduces, many MSB’s of LOW and HIGH become identical:

• Example: Binary form of 0.7712 and 0.773504:

0.1100010.., 0.1100011..,
 We can output identical MSB’s and re-scale the rest:
•  Incremental encoding
 This also allows us to achieve infinite precision with finite-precision integers.

CMPT365 Multimedia Systems 15

E1 and E2 Scaling
 E1: [LOW HIGH) in [0, 0.5)
0 0.5 1.0
 LOW: 0.0xxxxxxx (binary),
 HIGH: 0.0xxxxxxx.

0 0.5 1.0
 Output 0, then shift left by 1 bit
 [0, 0.5) [0, 1): E1(x) = 2 x

 E2: [LOW HIGH) in [0.5, 1)

0 0.5 1.0
 LOW: 0.1xxxxxxx,
 HIGH: 0.1xxxxxxx.

 Output 1, subtract 0.5, 0 0.5 1.0

shift left by 1 bit
 [0.5, 1) [0, 1): E2(x) = 2(x - 0.5)

CMPT365 Multimedia Systems 16

Encoding with E1 and E2
Symbol Prob.

1 0.8
Input 1 2 0.02
0 0.8 1.0 3 0.18
Input 3
0 0.656 0.8 E2: Output 1
Input 2 2(x – 0.5)
0.312 0.5424 0.54816 0.6 E2: Output 1

0.0848 0.09632
E1: 2x, Output 0
0.1696 0.19264 E1: Output 0

0.3392 0.38528 E1: Output 0

0.6784 0.77056 E2: Output 1

Input 1
Encode any value
0.3568 0.54112 in the tag, e.g., 0.5
Output 1
0.3568 0.504256 All outputs: 1100011
CMPT365 Multimedia Systems 17
To verify

 LOW = 0.5424 (0.10001010... in binary),

HIGH = 0.54816 (0.10001100... in binary).
 So we can send out 10001 (0.53125)
 Equivalent to E2E1E1E1E2
 After left shift by 5 bits:
 LOW = (0.5424 – 0.53125) x 32 = 0.3568
 HIGH = (0.54816 – 0.53125) x 32 = 0.54112
 Same as the result in the last page.

CMPT365 Multimedia Systems 18

 Note: Complete all possible scaling before Symbol Prob.

1 0.8
encoding the next symbol
2 0.02
3 0.18

Comparison with Huffman

 Input Symbol 1 does not cause any output
 Input Symbol 3 generates 1 bit
 Input Symbol 2 generates 5 bits

 Symbols with larger probabilities generates less number of bits.

 Sometimes no bit is generated at all
 Advantage over Huffman coding
 Large probabilities are desired in arithmetic coding
 Can use context-adaptive method to create larger probability
and to improve compression ratio.

CMPT365 Multimedia Systems 19

Incremental Decoding
Input 1100011
Decode 1: Need ≥ 5 bits
(verify)
0 0.8 1.0 Read 6 bits:
Tag: 110001, 0.765625
0 0.656 0.8 Decode 3, E2 scaling
Tag: 100011 (0.546875)
0.312 0.5424 0.54816 0.6 Decode 2, E2 scaling
Tag: 000110 (0.09375)
0.0848 0.09632
E1: Tag: 001100 (0.1875)

0.1696 0.19264 E1: Tag: 011000 (0.375)

0.3392 0.38528 E1: Tag: 110000 (0.75)

0.6784 0.77056 E2: Tag: 100000 (0.5)

0.3568 0.54112 Decode 1

 Summary: Complete all possible scaling before further decoding
Adjust LOW, HIGH and Tag together.
CMPT365 Multimedia Systems 20
Summary – Part I
 Introduction
 Encoding and Decoding
 Scaling and Incremental Coding
 E1, E2

 Next:
 Integer Implementation
• E3 scaling
 Adaptive Arithmetic Coding
 Binary Arithmetic Coding
 Applications
• JBIG, H.264, JPEG 2000

CMPT365 Multimedia Systems 21

Outline – Part II

 Review
 Integer Implementation
 Integer representation
 E3 Scaling
 Minimum word length

 Binary Arithmetic Coding

 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 22

Encoding Without Scaling
1 2 3
Range 1
0 0.8 0.82 1.0
1 2 3
Range 0.8
0 0.64 0.656 0.8

1 2 3
Range 0.144
0.656 0.7712 0.77408 0.8

1 2 3
Range 0.00288
0.7712 0.773504 0.7735616 0.77408
 Input sequence: “1321”

Final range: [0.7712, 0.773504): Encode 0.7712

CMPT365 Multimedia Systems 23
E1 and E2 Scaling
 E1: [LOW HIGH) in [0, 0.5)
0 0.5 1.0
 LOW: 0.0xxxxxxx (binary),
 HIGH: 0.0xxxxxxx.

0 0.5 1.0
 Output 0, then shift left by 1 bit
 [0, 0.5) [0, 1): E1(x) = 2 x

 E2: [LOW HIGH) in [0.5, 1)

0 0.5 1.0
 LOW: 0.1xxxxxxx,
 HIGH: 0.1xxxxxxx.

 Output 1, subtract 0.5, 0 0.5 1.0

shift left by 1 bit
 [0.5, 1) [0, 1): E2(x) = 2(x - 0.5)

CMPT365 Multimedia Systems 24

Encoding with E1 and E2
Symbol Prob.

1 0.8
Input 1 2 0.02
0 0.8 1.0 3 0.18
Input 3
0 0.656 0.8 E2: Output 1
Input 2 2(x – 0.5)
0.312 0.5424 0.54816 0.6 E2: Output 1

0.0848 0.09632
E1: 2x, Output 0
0.1696 0.19264 E1: Output 0

0.3392 0.38528 E1: Output 0

0.6784 0.77056 E2: Output 1

Input 1
Encode any value
0.3568 0.54112 in the tag, e.g., 0.5
Output 1
0.3568 0.504256 All outputs: 1100011
CMPT365 Multimedia Systems 25
To verify

 LOW = 0.5424 (0.10001010... in binary),

CMPT365 Multimedia Systems 26

Encoding Pseudo Code with E1, E2
Symbol Prob. CDF
(For floating-point implementation)
1 0.8 0.8
2 0.02 0.82
EncodeSymbol(n) { 3 0.18 1
//Update variables
RANGE = HIGH - LOW;
HIGH = LOW + RANGE * CDF(n);
LOW = LOW + RANGE * CDF(n-1);

//keep scaling before encoding next symbol

while LOW, HIGH in [0, 0.5) or [0.5, 1) {
send 0 for E1 and 1 for E2
scale LOW, HIGH
}
}

CMPT365 Multimedia Systems 27

0.1696 0.19264 E1: Tag: 011000 (0.375)

0.3392 0.38528 E1: Tag: 110000 (0.75)

0.6784 0.77056 E2: Tag: 100000 (0.5)

0.3568 0.54112 Decode 1

 Summary: Complete all possible scaling before further decoding
Adjust LOW, HIGH and Tag together.
CMPT365 Multimedia Systems 28
Decoding Pseudo Code with E1, E2
(For floating-point implementation)
DecodeSymbol(Tag) {
RANGE = HIGH - LOW;
n = 1;
While ( (tag - LOW) / RANGE >= CDF(n) ) {
n++;
} Symbol Prob. CDF

1 0.8 0.8
HIGH = LOW + RANGE * CDF(n);
2 0.02 0.82
LOW = LOW + RANGE * CDF(n-1);
3 0.18 1
//keep scaling before decoding next symbol
while LOW, HIGH in [0, 0.5) or [0.5, 1) {
scale LOW, HIGH by E1 or E2 rule
Left shift Tag and read one more bit to LSB
}
return n;
}

CMPT365 Multimedia Systems 29

Outline

 Review
 Integer Implementation
 Integer representation
 E3 Scaling
 Complete Algorithm
 Minimum word length

 Binary Arithmetic Coding

 Adaptive Arithmetic Coding
 Applications
 JBIG, H.264, JPEG 2000

CMPT365 Multimedia Systems 30

Integer Implementation
 Old formulas:
HIGH  LOW + RANGE * CDF(n);
LOW  LOW + RANGE * CDF(n-1);
 Integer approximation of CDF ( ):
 The number of occurrence of each symbol is usually collected by a
counter.
 Allow adaptive arithmetic coding

1 2 3
k P(k) nk Cum(k)

0 40 41 50 0 - - 0
i 1 0.8 40 40

n k
Cum(i )
2
3
0.02
0.18
1
9
41
50
CDF FX (i)  k 1

N N
CMPT365 Multimedia Systems 31
Integer Implementation
HIGH  LOW + RANGE * CDF(n);
LOW  LOW + RANGE * CDF(n-1);
RANGE  HIGH  LOW  1
HIGH  LOW  RANGE  Cum(n) / N   1
LOW  LOW  RANGE  Cum(n  1) / N 
 Why + 1 in RANGE and – 1 in HIGH?
 HIGH should be less than the LOW of the next interval
 The best integer value is HIGH = (next LOW) – 1
 [HIGH, HIGH + 1) still belongs to the current interval,
although we could not represent it explicitly.

0 n-1 n

LOW1 HIGH1 LOW2

CMPT365 Multimedia Systems 32
k P(k) Nk Cum(k)
Example 0 - - 0

RANGE  HIGH  LOW  1 1 0.8 40 40

2 0.02 1 41
HIGH  LOW  RANGE  Cum(n) / N   1
3 0.18 9 50
LOW  LOW  RANGE  Cum(n  1) / N 
 For 8-bit integers, initial LOW = 0, HIGH = 255  RANGE=256

 256  40   256  0 
If n = 1: HIGH  0     1  203 LOW  0    0
 50   50 
 256  41  256  40 
If n = 2: HIGH  0     1  208 LOW  0     204
 50   50 
 256  50   256  41
If n = 3: HIGH  0     1  255 LOW  0     209
 50   50 
1 2 3

0 203 208 255

CMPT365 Multimedia Systems 33
E1 Scaling for Integer
 E1 Scaling: [0, 0.5)  [0, 1), E1(x) = 2 x.
 LOW = 0xxxxxxx, HIGH =0xxxxxxx
 Output the MSB value 0, then shift left by 1 bit

 Important trick: Shift in 1 to HIGH and 0 to LOW

 HIGH: 0xxxxxxx  xxxxxxx1 HIGH = (HIGH << 1) + 1;
 LOW: 0xxxxxxx  xxxxxxx0 LOW = LOW << 1;

 Always assume HIGH ends with infinite number of 1’s:

 So that it approximates the LOW of the next interval.

HIGH 0 . 0 x x x x x x x 1 1 1 ...
LOW 0 . 0 x x x x x x x 0 0 0 ...
 This also ensures the RANGE is doubled after scaling:
 HIGH – LOW + 1  (2 x HIGH + 1 – 2 x LOW + 1) = 2(HIGH – LOW + 1)

CMPT365 Multimedia Systems 34

E2 Scaling for Integer
 E2 Scaling: [0.5, 1)  [0, 1), E2(x) = 2 (x - 0.5)
 LOW = 1xxxxxxx, HIGH =1xxxxxxx
 Output the MSB, then shift left by 1 bit (mul by 2)

 Same trick: Shift in 1 to HIGH and 0 to LOW

 HIGH: 1xxxxxxx  xxxxxxx1 HIGH = (HIGH << 1) + 1;
 LOW: 1xxxxxxx  xxxxxxx0 LOW = LOW << 1;

0 . 1 x x x x x x x 1 1 1 ...
0 . 1 x x x x x x x 0 0 0 ...

CMPT365 Multimedia Systems 35

Integer Encoding
[0, 0.8)  LOW = 0, HIGH = 203.
[0.8, 0.82)  LOW = 204, HIGH = 208.
Can we represent an interval in [203, 204) ? (Sequence 1333333……)

Input 1
0 203 255
Input 3
0 167 203 E2: Output 1
78 151
LOW: 167 (10100111) HIGH: 203 (11001011)

After E2: (shift in an 1 to HIGH and 0 to LOW)

LOW: 1(01001110) 78 (8-bit) HIGH: 1(10010111) 151 (8-bit)

In 8.1 format (8 bits for integer, 1 bit for fractional):

LOW: (10100111.0) 167 HIGH: (11001011.1) 203.5

By shifting in an 1 to HIGH, we can cover the range [203, 203.5].

The entire range [203, 204) can be covered by always shifting in 1 to HIGH.
CMPT365 Multimedia Systems 36
Outline

 Review
 Integer Implementation
 Integer representation
 E3 Scaling
 Complete Algorithm
 Minimum word length

 Binary Arithmetic Coding

 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 37

E3 Scaling: [0.25, 0.75)[0, 1)
 If RANGE straddles 1/2, E1 and E2 cannot be applied,
but the range can be quite small
 Example: LOW=0.4999, HIGH=0.5001
Binary: LOW=0.01111…., HIGH=0.1000…
 We may not have enough bits to represent the interval.

0.25 0.5 0.75

0 1
 E3 Scaling:
[0.25, 0.75) [0, 1):
E3(x) = 2(x - 0.25) 0 0.5 1

CMPT365 Multimedia Systems 38

Integer Implementation of E3
 Same trick: Shift in 1 to HIGH and 0 to LOW
 HIGH = ((HIGH – QUARTER) << 1) + 1;
 LOW = (LOW - QUARTER) << 1;
 QUARTER = 2^(M - 2) for m-bit integer. (64 for m = 8 bits)
LOW: HIGH:
01xxxxxx 10xxxxxx
- 01000000 - 01000000
00xxxxxx 01xxxxxx
× 2 × 2
0xxxxxx0 1xxxxxx0
LOW 01xxxxxx  0xxxxxx0 + 1
HIGH 10xxxxxx  1xxxxxx1 1xxxxxx1

Another way to implement E3 (Sayood book pp. 99):

 Left shift old LOW and HIGH, complement new MSB.
CMPT365 Multimedia Systems 39
Signaling of E3
 What should we send when E3 is used?
 Recall: we send 1 if E2 is used, send 0 if E1 is used

 Important relationships: (www.copro.org/download/ac_en.pdf)

E1  E3   E2   E1
n n

E2  E3   E1   E2
n n

Ei  E j 
n
Apply n Ej scalings, followed by an Ei scaling.

 What do they mean?

 A series of E3 followed by an E1 is equivalent to an E1 followed by a
series of E2.
 A series of E3 followed by an E2 is equivalent to an E2 followed by a
series of E1.

CMPT365 Multimedia Systems 40

Example  Previous example without E3:

Input 1
0 0.8 1.0
Input 3
0 0.656 0.8
Input 2
0.312 0.5424 0.54816 0.6 E2: Output 1

0.0848 0.09632
E1: Output 0
0.1696 0.19264
 With E3:
0.312 0.6

0.124 0.5848 0.59632 0.7 E3: (x-0.25)x2

Input 2
E2: Output 1
0.1696 0.19264
 The range after E2°E3 is the same as that after E1°E2
CMPT365 Multimedia Systems 41
Another View of the Equivalence
 Scaling of a range in [0.5, 0.75) with E1°E2

0 0.25 0.5 0.75 1

E2
0 0.25 0.5 0.75 1

0 0.25 0.5 0.75 1 E1

 Equivalent scaling of the range in [0.5, 0.75) with E2°E3

0 0.25 0.5 0.75 1

E3
0 0.25 0.5 0.75 1

0 0.25 0.5 0.75 1 E2

CMPT365 Multimedia Systems 42

A Simple Proof of E2°E3 = E1°E2
 Given an original range r:
 After applying E2: [0.5, 1) [0, 1), the range becomes
r1 = (r – 0.5) x 2
 After applying E1: [0, 0.5)  [0, 1), the range becomes
r2 = r1 x 2 = ((r – 0.5) x 2) x 2

 Given the same range r:

 After applying E3: [0.25, 0.75)  [0, 1), the range becomes
r3 = (r – 0.25) x 2
 After applying E2, the range becomes
r4 = (r3 – 0.5) x 2 = ((r – 0.25) x 2 – 0.5) x 2
= (r – 0.5) x 2 x 2
= r2

For formal proof: www.copro.org/download/ac_en.pdf

CMPT365 Multimedia Systems 43
Encoding Operation with E3
 Without E3:

Input 2
0.312 0.5424 0.54816 0.6 E2: Output 1

0.0848 0.09632
E1: Output 0
0.1696 0.19264
 With E3:
0.312 0.6

0.124 0.5848 0.59632 0.7 E3

Input 2 (no output here)
E2: Output 1
Output 0 here!
0.1696 0.19264
 Don’t send anything when E3 is used, but send a 0 after E2:
 The bit stream is identical to that of the old method
 Subsequent encoding is also same because of the same final interval
CMPT365 Multimedia Systems 44
Input 1100011
Decoding for E3 Read 6 bits:
Tag: 110001 (0.765625)
0 0.8 1.0 Decode 1

0 0.656 0.8 Decode 3, E2 scaling

Tag: 100011 (0.546875)
0.312 0.5424 0.54816 0.6 Decode 2, E2 scaling
Tag: 000110 (0.09375)
0.0848 0.09632
E1: Tag: 001100 (0.1875)

0.1696 0.19264
 With E3: Apply E3 whenever it is possible, nothing else is needed
0.312 0.6 Tag: 100011 (0.546875)

0.124 0.5848 0.59632 0.7 E3:

Tag: 100110 (0.59375)
Decode 2, E2 scaling
Tag: 001100 (0.1875)
0.1696 0.19264
Same status as the old method: low, high, range, tag.
CMPT365 Multimedia Systems 45
Summary of Different Scalings
0 0.25 0.5 0.75 1.0

Need E1 scaling

Need E2 scaling

Need E3 scaling

No scaling is required.
Ready to encode/decode
the next symbol.

CMPT365 Multimedia Systems 46

Outline

 Review
 Integer Implementation
 Integer representation
 E3 Scaling
 Complete Algorithm
 Minimum word length

 Binary Arithmetic Coding

 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 47

Encoding Pseudo Code with E1, E2, E3
(For integer implementation)
EncodeSymbol(n) {
Round off to integer
//Update variables
RANGE = HIGH - LOW + 1;
HIGH = HIGH + RANGE * Cum( n ) / N - 1;
LOW = LOW + RANGE * Cum(n-1) / N;

//Scaling before encoding next symbol

EncoderScaling(); //see next slide
}

CMPT365 Multimedia Systems 48

Encoding Pseudo Code with E1, E2, E3
EncoderScaling( ) {
while (E1, E2 or E3 is possible) {
if (E3 is possible) {
HIGH = ((HIGH - QUARTER) << 1) + 1;
LOW = (LOW - QUARTER) << 1;
Scale3++; //Save number of E3, but send nothing
}
if (E1 or E2 is possible) {
Let b=0 for E1 and b=1 for E2
send b
HIGH = (HIGH << 1) + 1;
LOW = (LOW << 1);
while (Scale3 > 0) { //send info about E3 now
send complement of b //E2 ° (E3)^n = (E1)^n ° E2
Scale3 --; //Send one bit for each E3
}
}
}
}

CMPT365 Multimedia Systems 49

Decoding Pseudo Code with E1, E2, E3
(For integer implementation) Intervals: [0, 203], [204, 208], [209, 255]

DecodeSymbol(Tag) {
RANGE = HIGH - LOW + 1;
n = 1;
While (Tag > LOW + RANGE * Cum(n) / N - 1) {
n++;
}
Round off to integer: HIGH of each interval
HIGH = LOW + RANGE * Cum(n) / N - 1;
LOW = LOW + RANGE * Cum(n-1) / N;

//keep scaling before decoding next symbol

DecoderScaling(Tag); //next slide
return n;
}

CMPT365 Multimedia Systems 50

Decoding Pseudo Code with E1, E2, E3
DecoderScaling(Tag) {
while (E1, E2 or E3 is possible) {
if (E1 or E2 is possible) {
LOW = LOW << 1;
HIGH = (HIGH << 1) + 1;
Tag = Tag << 1;
Tag = Tag | ReadBits(1);
}

if (E3 is possible) {
LOW = (LOW - QUARTER) << 1;
HIGH = ((HIGH - QUARTER) << 1) + 1;
Tag = (Tag - QUARTER) << 1;
Tag = Tag | ReadBits(1);
}
}
}

CMPT365 Multimedia Systems 51

Integer Encoding with E1, E2, E3
Input 1
0 203 255
Input 3
0 167 203 E2: Output 1
78 151 E3: Scale3=1
Input 2
28 146 148 175 E2: Output 1
Output 0
36 41 Scale3 = 0
E1: Output 0
72 83 E1: Output 0
144 167 E2: Output 1
32 79 E1: Output 0
64 159 E3: Scale3=1
Input 1 Output 0
0 152 191
Output 1
Final output: 11000100 10000000 Scale3=0
Output 7 more 0’s
CMPT365 Multimedia Systems 52
Integer Decoding with E1, E2, E3
Input: 11000100 10000000
Read 8 bits:
11000100 (196)
0 203 255 Decode 1
Decode 3
0 167 203 E2: Tag=10001001 (137)

78 151 E3: Tag = 10010010 (146)

Decode 2
28 146 148 175

E2: Tag=00100100 (36)

36 41 Decode 1
Tag = LOW: stop.

CMPT365 Multimedia Systems 53

Outline

 Review
 Integer Implementation
 Integer representation
 E3 Scaling
 Complete Algorithm
 Minimum word length

 Binary Arithmetic Coding

 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 54

How to decide the word length m?
 Need to guarantee non-zero interval for all symbols
in the worst case: 1 2 3

0 40 41 50
 RANGE  Cum(n) 
HIGH  LOW    1 k P(k) Nk Cum(k)
 N 
0 - - 0
 RANGE  Cum(n  1) 
LOW  LOW    1 0.8 40 40
 N  2 0.02 1 41
3 0.18 9 50

 RANGE  Cum(n)   RANGE  Cum(n  1) 

Need
   
N   N

even when Cum(n) – cum(n -1) = 1.

Otherwise HIGH < LOW.
 RANGE cannot be too small at any time. Intuitive!
CMPT365 Multimedia Systems 55
How to decide the word length m?
 When do we have the smallest RANGE without triggering a
scaling?

M = 2^m

0 M/4 M/2 3/4M M

 When interval is slightly larger than [M/4, M/2] or [M/2, 3/4M]

 None of E1, E2, and E3 can be applied

 Condition: 1/4 (2^m) > N

 Example: N = 50,  min m = 8 (1/4M=64)

CMPT365 Multimedia Systems 56

Outline

 Review
 Integer Implementation
 Binary Arithmetic Coding
 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 57

Binary Arithmetic Coding
 Arithmetic coding is slow in general:
To decode a symbol, we need a seris of decisions and multiplications:
While (Tag > LOW + RANGE * Cum(n) / N - 1) {
n++;
}

 The complexity is greatly reduced if we have only two symbols: 0 and

symbol 0 symbol 1

0 x 1
 Only two intervals in [0, 1): [0, x), [x, 1)

CMPT365 Multimedia Systems 58

Encoding of Binary Arithmetic Coding
HIGH  LOW  RANGE  Cum(n) / N   1
LOW  LOW  RANGE  Cum(n  1) / N 
LOW = 0, HIGH = 1 Prob(0)=0.6. Sequence: 0110
LOW = 0, HIGH = 0.6
0 0.6 1

LOW = 0.36, HIGH = 0.6

0 0.36 0.6

LOW = 0.504, HIGH = 0.6

0.36 0.504 0.6

LOW = 0.504, HIGH = 0.5616

0.504 0.5616 0.6
Only need to update LOW or HIGH for each symbol.
CMPT365 Multimedia Systems 59
Decoding of Binary Arithmetic Coding
Tag

0 0.6 1

Only one decision to make:

While (Tag > LOW + RANGE * Cum(n) / N - 1)) {

n++;
}

if (Tag > LOW + RANGE * Cum(Symbol 0) / N - 1)

{
n = 1;
} else {
n = 0;
} CMPT365 Multimedia Systems 60
Applications of Binary Arithmetic Coding

 Increasingly popular:
 JBIG, JBIG2, JPEG2000, H.264

 Covert non-binary signals into binary first

 H.264: Golomb-Rice Code
 Bit-plane coding

 Various simplifications to avoid multiplication:

 H.264: Table look-up for RANGE * Cum(n) / N
 JBIG: Eliminate multiplication by assuming the RANGE is
close to 1. Scale if RANGE too small.

CMPT365 Multimedia Systems 61

Outline

 Review
 Integer Implementation
 Binary Arithmetic Coding
 Adaptive Arithmetic Coding

CMPT365 Multimedia Systems 62

Adaptive Arithmetic Coding

 Observation: The partition of [0, 1) can be

different from symbol to symbol
 The bit stream can be decoded perfectly as long
as both encoder and decoder are synchronized
(use the same partition).
 General approach:
 Starting from a pre-defined probability distribution
 Update probability after each symbol

 This is very difficult for Huffman coding:

 Has to redesign the codebook when prob changes

CMPT365 Multimedia Systems 63

 Binary sequence: 01111
Example  Initial counters for 0’s and 1’s:
C(0)=C(1)=1.
0 0.5 1  P(0)=P(1)=0.5

 After encoding 0: C(0)=2, C(1)=1.

P(0)=2/3, P(1)=1/3
0 0.3333 0.5

 After encoding 01: C(0)=2, C(1)=2.

P(0)=1/2, P(1)=1/2
0.3333 0.4167 0.5

 After encoding 011:C(0)=2, C(1)=3.

P(0)=2/5, P(1)=3/5
0.4167 0.45 0.5
C(0)=2, C(1)=4.
 After encoding 0111:
P(0)=1/3, P(1)=2/3.
0.45 0.4667 0.5 Encode 0.4667.

CMPT365 Multimedia Systems 64

 Input 0.4667.
Decoding  Initial counters for 0’s and 1’s:
C(0)=C(1)=1  P(0)=P(1)=0.5
0 0.5 1 Decode 0

 After decoding 0: C(0)=2, C(1)=1. P(0)=2/3,

P(1)=1/3
0 0.3333 0.5 Decode 1

 After decoding 01: C(0)=2, C(1)=2. P(0)=1/2,

P(1)=1/2
0.3333 0.4167 0.5
Decode 1
 After decoding 011: C(0)=2, C(1)=3. P(0)=2/5,
P(1)=3/5
0.4167 0.45 0.5 Decode 1

 After decoding 0111: C(0)=2, C(1)=4.

P(0)=1/3, P(1)=2/3.
0.45 0.4667 0.5 Decode 1

CMPT365 Multimedia Systems 65

Context-adaptive Arithmetic Coding
 In many cases, a sample has strong correlation with its near
neighbors.
 Idea:
 Collect conditional probability distribution of a symbol for given
neighboring symbols (context):
P(x(n) | x(n-1), x(n-2), … x(n-k))
 Use this conditional probability to encode the next symbol
 More skewed probability distribution can be obtained (desired by
arithmetic coding)

abcbcab abcbcab
1-D
cbabcba
Context 2-D
template Context
template
CMPT365 Multimedia Systems 66
Example 0110101
 Binary sequence: 0, 1
 Neighborhood (template) size: 3
2^3=8 possible combinations (contexts) of 3 neighbors
(x(n-1), x(n-2), x(n-3)).
 Collect frequencies of 0’s and 1’s under each context

Context C(0) C(1)

(0, 0, 0) 9 2
0 1
(0, 0, 1) 3 6
(0, 1, 0) 10 10 0 1
(0, 1, 1) … …
0 1
(1, 0, 0) … …
(1, 0, 1) … … Each symbol is coded with
(1, 1, 0) … … the probability distribution
(1, 1, 1) … … associated with its context.

CMPT365 Multimedia Systems 67

Encoding Pseudo Code
InitProbModel(ProbModel);
While (not EOF) {
n = ReadSymbol( );

//Determine the neighboring combination (context)

context = GetContext();

//Encode with the corresponding probability model

EncodeSymbol(ProbModel[context], n);

//update probability counters for this context

UpdateProb(ProbModel[context], n);
}

CMPT365 Multimedia Systems 68

Decoding Pseudo Code
InitProbModel(ProbModel);
While (not EOF) {
//Determine the neighboring combination (context)
context = GetContext();

//Decode with the corresponding probability model

n = DecodeSymbol(ProbModel[context]);

//update probability counters for this context

UpdateProb(ProbModel[context], n);
}

CMPT365 Multimedia Systems 69

Performance of Arithmetic Coding
 For a sequence of length m:
 H(X) <= Average Bit Rate <= H(X) + 2/m
 Can approaches the entropy very quickly.

 Huffman coding:
 H(X) <= Average Bit Rate <= H(X) + 1/m
 Impractical: need to generate codewords for all sequences of
length m.

CMPT365 Multimedia Systems 70

Summary – Part II

 Arithmetic coding:
 Partition the range [0, 1) recursively according to symbol
probabilities.
 Incremental Encoding and Decoding
 E1, E2, E3 scaling

 Binary Arithmetic Coding

 Context Adaptive Arithmetic Coding
 Next: Quantization (lossy compression)

CMPT365 Multimedia Systems 71

Assignment 3
No ratings yet
Assignment 3
5 pages
DRAM Technology
No ratings yet
DRAM Technology
16 pages
Computer Network Imp Questions
100% (1)
Computer Network Imp Questions
4 pages
Chapter 11 Cryptographic Hash Functions
No ratings yet
Chapter 11 Cryptographic Hash Functions
38 pages
Module 3a - Itext & Image Compression
No ratings yet
Module 3a - Itext & Image Compression
54 pages
Digital System Design Using Verilog
No ratings yet
Digital System Design Using Verilog
3 pages
3 Memory Interface
No ratings yet
3 Memory Interface
14 pages
Chapter Two (Part 2) Data Link Layer Protocols (Q & A)
No ratings yet
Chapter Two (Part 2) Data Link Layer Protocols (Q & A)
14 pages
B. To Reduce The Size of Data To Save Space
100% (1)
B. To Reduce The Size of Data To Save Space
25 pages
5CS3-01: Information Theory & Coding: Unit-3 Linear Block Code
No ratings yet
5CS3-01: Information Theory & Coding: Unit-3 Linear Block Code
75 pages
Practice Questions CNNs Solns
No ratings yet
Practice Questions CNNs Solns
11 pages
Error-Free Compression: Variable Length Coding
No ratings yet
Error-Free Compression: Variable Length Coding
13 pages
MCQ On Unit 1 2 PDF
0% (1)
MCQ On Unit 1 2 PDF
22 pages
Class:-BE Sub:-Wsns: MCQ Question Bank
No ratings yet
Class:-BE Sub:-Wsns: MCQ Question Bank
13 pages
Cryptography Hash Functions
No ratings yet
Cryptography Hash Functions
5 pages
Memory Hierarchy and Cache Quiz Answers
No ratings yet
Memory Hierarchy and Cache Quiz Answers
3 pages
02 - Digital Image Processing
No ratings yet
02 - Digital Image Processing
38 pages
Mock Endsem Question Paper Image ProcessingElective V INSEM SEM
No ratings yet
Mock Endsem Question Paper Image ProcessingElective V INSEM SEM
3 pages
Unit 3 - Cyclic Code MCQ
No ratings yet
Unit 3 - Cyclic Code MCQ
6 pages
IoT-Enabling-Technologies
No ratings yet
IoT-Enabling-Technologies
17 pages
Chapter 3 Multimedia Compression
0% (1)
Chapter 3 Multimedia Compression
61 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
3 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
Midterm Solution - COSC 3213 - Computer Networks 1
No ratings yet
Midterm Solution - COSC 3213 - Computer Networks 1
13 pages
DSPAA Jan 2019 QP Solution
No ratings yet
DSPAA Jan 2019 QP Solution
34 pages
R20-Atcd-Q.p - Model Paper.
100% (1)
R20-Atcd-Q.p - Model Paper.
3 pages
Advanced Computer Architecture Question Paper
No ratings yet
Advanced Computer Architecture Question Paper
1 page
Chapter 2 Digital Image Fundamantels
No ratings yet
Chapter 2 Digital Image Fundamantels
58 pages
Tutorial2 Modelanswer
No ratings yet
Tutorial2 Modelanswer
5 pages
CS8494 Software Engineering MCQ
No ratings yet
CS8494 Software Engineering MCQ
34 pages
Chapter 23 Quizzes
100% (1)
Chapter 23 Quizzes
4 pages
EC8002 Multimedia Compression and Communication Notes 2
No ratings yet
EC8002 Multimedia Compression and Communication Notes 2
109 pages
Cellular and Mobile Communication: Handoff Strategies Prioritizing Handoff Practical Handoff Considerations
No ratings yet
Cellular and Mobile Communication: Handoff Strategies Prioritizing Handoff Practical Handoff Considerations
20 pages
Peripheral Interface Controller
No ratings yet
Peripheral Interface Controller
62 pages
جميع اسئلة الرؤيا
No ratings yet
جميع اسئلة الرؤيا
13 pages
Reconfigurable Hardware Design Approach For Economic Neural Network
No ratings yet
Reconfigurable Hardware Design Approach For Economic Neural Network
5 pages
Data Compression Question Bank
No ratings yet
Data Compression Question Bank
3 pages
Python
No ratings yet
Python
126 pages
Mid Term Paper 2023: Part-I
No ratings yet
Mid Term Paper 2023: Part-I
3 pages
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
No ratings yet
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
73 pages
Computer Engineering of Nov., 2020 Fall 2020/2021: Notes For Students
No ratings yet
Computer Engineering of Nov., 2020 Fall 2020/2021: Notes For Students
3 pages
Wireless and Mobile Network Architecture: Handoff Management Detection and Assignment
No ratings yet
Wireless and Mobile Network Architecture: Handoff Management Detection and Assignment
30 pages
MCQ - 11
No ratings yet
MCQ - 11
6 pages
Memory System
No ratings yet
Memory System
51 pages
RTL Simulation and Synthesis With PLDs QP Mid-1
0% (1)
RTL Simulation and Synthesis With PLDs QP Mid-1
1 page
Embedded System LESSONPLAN
No ratings yet
Embedded System LESSONPLAN
7 pages
Data Encryption Standard (DES)
No ratings yet
Data Encryption Standard (DES)
37 pages
IMAGE ENCRYPTION AND DECRYPTION USING BLOWFISH ALGORITHM Report
No ratings yet
IMAGE ENCRYPTION AND DECRYPTION USING BLOWFISH ALGORITHM Report
16 pages
Computer Organization and Architecture PDF
No ratings yet
Computer Organization and Architecture PDF
63 pages
CS1028 Network Security PDF
No ratings yet
CS1028 Network Security PDF
120 pages
C20 5 6 Sem ECE
No ratings yet
C20 5 6 Sem ECE
95 pages
MODULE 3 Embedded Systems Sensors and Interfacing Actuators Communcation Interface
No ratings yet
MODULE 3 Embedded Systems Sensors and Interfacing Actuators Communcation Interface
114 pages
CNS Theory Syllabus PDF
No ratings yet
CNS Theory Syllabus PDF
3 pages
HW 3
No ratings yet
HW 3
2 pages
Goal: Solution: Let, X Be The Light Sleeper
No ratings yet
Goal: Solution: Let, X Be The Light Sleeper
3 pages
IntroductionToMultimedia Networking
100% (1)
IntroductionToMultimedia Networking
11 pages
Bluetooth - Radio Layer
No ratings yet
Bluetooth - Radio Layer
16 pages
JPEG2000 Image Compression Standard
No ratings yet
JPEG2000 Image Compression Standard
12 pages
CS 8491 Computer Architecture
No ratings yet
CS 8491 Computer Architecture
103 pages
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
No ratings yet
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
23 pages
Lempel-Ziv-Welch (LZW) Compression Algorithm
No ratings yet
Lempel-Ziv-Welch (LZW) Compression Algorithm
22 pages
3-2 - Compression Algorithm
0% (1)
3-2 - Compression Algorithm
24 pages
3-1-Lossless Compression
No ratings yet
3-1-Lossless Compression
10 pages
3-0-Fundamental of Compression
No ratings yet
3-0-Fundamental of Compression
17 pages
MoO3-TiO2 Hydroisomerization Performance Using N-Hexane in
No ratings yet
MoO3-TiO2 Hydroisomerization Performance Using N-Hexane in
11 pages
Instant Access to Higher Order Networks An Introduction to Simplicial Complexes 1st Edition Ginestra Bianconi ebook Full Chapters
100% (4)
Instant Access to Higher Order Networks An Introduction to Simplicial Complexes 1st Edition Ginestra Bianconi ebook Full Chapters
51 pages
Admission-cum-Scholarship Test: (Sample Paper)
No ratings yet
Admission-cum-Scholarship Test: (Sample Paper)
9 pages
Spring JDBC Class Notes
No ratings yet
Spring JDBC Class Notes
7 pages
Single Machine Infinite Bus (SMIB) : The Real Power Dispatched From The Generator Is Given by
No ratings yet
Single Machine Infinite Bus (SMIB) : The Real Power Dispatched From The Generator Is Given by
20 pages
Work Scope Ethane Extraction-V1
No ratings yet
Work Scope Ethane Extraction-V1
6 pages
Control Systems Ece Nov 2020
No ratings yet
Control Systems Ece Nov 2020
2 pages
Constructor and Destructor of Java
No ratings yet
Constructor and Destructor of Java
5 pages
عملي دوائر اليوم2
No ratings yet
عملي دوائر اليوم2
24 pages
Cpus: Latency Oriented Design
No ratings yet
Cpus: Latency Oriented Design
2 pages
Sigma Load Control
No ratings yet
Sigma Load Control
3 pages
(Ebook) The Art of Multiprocessor Programming by Herlihy, Maurice, Shavit, Nir, Luchangco, Victor, Spear, Michael ISBN 9780124159501, 0124159508 - The latest updated ebook is now available for download
100% (2)
(Ebook) The Art of Multiprocessor Programming by Herlihy, Maurice, Shavit, Nir, Luchangco, Victor, Spear, Michael ISBN 9780124159501, 0124159508 - The latest updated ebook is now available for download
86 pages
MS6002S
No ratings yet
MS6002S
16 pages
Colloids and Surfaces A: Physicochemical and Engineering Aspects
No ratings yet
Colloids and Surfaces A: Physicochemical and Engineering Aspects
11 pages
CHAPTER 2 - THE CHEMICALS COMPONENTS OF THE CELLS
No ratings yet
CHAPTER 2 - THE CHEMICALS COMPONENTS OF THE CELLS
4 pages
Data Provisioning in SAP HANA
No ratings yet
Data Provisioning in SAP HANA
12 pages
Red Seal Sample Examination Questions
No ratings yet
Red Seal Sample Examination Questions
3 pages
General Chemistry 2 February Monthly Test Reviewer
No ratings yet
General Chemistry 2 February Monthly Test Reviewer
4 pages
Volume Resonator by MR - Charis Israel Ancha
100% (2)
Volume Resonator by MR - Charis Israel Ancha
3 pages
Che 342 Practice Prob Vi
No ratings yet
Che 342 Practice Prob Vi
3 pages
RDP
No ratings yet
RDP
2 pages
FM ASM Bao Khanh 2023
No ratings yet
FM ASM Bao Khanh 2023
6 pages
Incomplete and Co-Dominance
No ratings yet
Incomplete and Co-Dominance
13 pages
Unit 1 Quantitative Techniques
No ratings yet
Unit 1 Quantitative Techniques
30 pages
Ct Solution Mom
No ratings yet
Ct Solution Mom
25 pages
Cheatsheet Google Dorking En
No ratings yet
Cheatsheet Google Dorking En
5 pages
Olex Xlpe Catalog 05 2002
No ratings yet
Olex Xlpe Catalog 05 2002
48 pages
AUTOSAR SWS EthernetStateManager
No ratings yet
AUTOSAR SWS EthernetStateManager
56 pages
Practical Dual Bander With Optional Bluetooth Headset: Vhf/Uhf Dual Band Transceiver
No ratings yet
Practical Dual Bander With Optional Bluetooth Headset: Vhf/Uhf Dual Band Transceiver
2 pages
ATG - Automatic Tank Gauge System: Installation Requirements of MTG vs. Hybrid Radar
No ratings yet
ATG - Automatic Tank Gauge System: Installation Requirements of MTG vs. Hybrid Radar
1 page