Multimedia Data Compression

The document discusses multimedia data compression, focusing on lossless and lossy compression techniques. It explains entropy coding, Huffman coding, adaptive coding, and dictionary-based coding (LZW), detailing their algorithms and properties. The document emphasizes the importance of efficient coding methods for reducing data size in multimedia applications.

Uploaded by

Dani Abera

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Multimedia Data Compression

Uploaded by

Dani Abera

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

COMPUTER GRAPHICS & MULTIMEDIA

Chapter TWO
Multimedia Data Compression

1
2.1 LOSSLESS AND LOSSY COMPRESSION
• Compression: the process of coding that will effectively reduce the
total number of bits needed to represent certain information.

Fig 2.1 A general data compression scheme

• We call the output of the encoder codes or codewords.
• The intermediate medium could either be data storage or a
communication/computer network.
•If the compression and decompression processes induce no
information loss, the compression scheme is lossless; otherwise, it is
lossy.
B0
COMPRESSIONRATIO 
B1
B0 – number of bits before compression B1 – number of bits after
compression

•In general, we would desire any codec (encoder/decoder scheme)

to have a compression ratio much larger than 1.0.
•The higher the compression ratio, the better the lossless
compression scheme, as long as it is computationally feasible.
2.2 ENTROPY CODING
• The entropy η of an information source with alphabet S = {s1, s2, . . . , sn} is:
n

  H (S )   pi log 2 1
i1 pi
n
  pi log2 pi
i1

•pi – probability that symbol si will occur in S.

1
• log2 pi – indicates the amount of information contained in si, which
corresponds to the number of bits needed to encode si.
• The definition of entropy is aimed at identifying often-occurring
symbols in the datastream as good candidates for short codewords in
the compressed bitstream.
• We use a variable-length coding scheme for entropy coding—
frequently occurring symbols are given codes that are quickly
transmitted, while infrequently occurring ones are given longer codes.
• For example, E occurs frequently in English, so we should give it a
shorter code than Q, say.
• If we use to denote the average length (measured in bits) of the
codewords produced by the encoder, the Shannon Coding Theorem
states that the entropy is the best we can do (under certain
conditions):

• Coding schemes aim to get as close as possible to this theoretical

lower bound.
2.3 HUFFMAN CODING
• Huffman coding is an efficient method of compressing data without
losing information.
• Huffman coding provides an efficient, unambiguous code by analyzing
the frequencies that certain symbols appear in a message.
• Symbols that appear more often will be encoded as a shorter-bit
string while symbols that aren't used as much will be encoded as
longer strings.
• There are mainly two major parts in Huffman Coding
1) Build a Huffman Tree from input characters.
2) Traverse the Huffman Tree and assign codes to characters.
ALGORITHM
1. Initialization: put all symbols on the list sorted according to
their frequency counts.
2. Repeat until the list has only one symbol left.
a) From the list, pick two symbols with the lowest frequency counts. Form a
Huffman subtree that has these two symbols as child nodes and create a
parent node for them.
b) Assign the sum of the children’s frequency counts to the parent and insert it
into the list, such that the order is maintained.
c) Delete the children from the list.
3. Assign a codeword for each leaf based on the path from the
root.
PROPERTIES OF HUFFMAN CODING
1. Unique Prefix Property: No Huffman code is a prefix of any other
Huffman code - precludes any ambiguity in decoding.
2. Optimality: minimum redundancy code - proved optimal for a given
data model (i.e., a given, accurate, probability distribution):
a) The two least frequent symbols will have the same length for their Huffman
codes, differing only at the last bit.
b) Symbols that occur more frequently will have shorter Huffman codes than
symbols that occur less frequently.
c) The average code length for an information source S is strictly less than η + 1.
l   1
EXAMPLE:
• Suppose the string below is to be sent over a network.

• Each character occupies 8 bits. There are a total of 15 characters in the

above string. Thus, a total of 8*15 = 120 bits are required to send this
string.
• Using the Huffman Coding technique, we can compress the string to a
smaller size.
• Huffman coding first creates a tree using the frequencies of the
character and then generates code for each character.
• Once the data is encoded, it has to be decoded. Decoding is done using
the same tree.
Huffman coding is done with the help of the following steps.
1. Calculate the frequency of each character in the string.

2. Sort the characters in increasing order of the frequency. These are

stored in a priority queue Q.

3. Make each unique character as a leaf node.

4. CREATE AN EMPTY NODE Z. ASSIGN THE MINIMUM FREQUENCY TO
THE LEFT CHILD OF Z AND ASSIGN THE SECOND MINIMUM
FREQUENCY TO THE RIGHT CHILD OF Z. SET THE VALUE OF THE Z AS
THE SUM OF THE ABOVE TWO MINIMUM FREQUENCIES.

5. Remove these two minimum frequencies from Q and add the sum
into the list of frequencies (* denote the internal nodes in the figure
above).
6. Insert node z into the tree.
7. REPEAT STEPS 3 TO 5 FOR ALL THE CHARACTERS.

(a) (b)
8. FOR EACH NON-LEAF NODE, ASSIGN 0 TO THE
LEFT EDGE AND 1 TO THE RIGHT EDGE.
• For sending the above string over a network, we have to send the
tree as well as the above compressed-code. The total size is given by
the table below.

• Without encoding, the total size of the string was 120 bits. After
encoding the size is reduced to 32+15+28 = 75 bits.
Decoding the code
•For decoding the code, we can take the code and traverse through the
tree to find the character.
•Let 101 is to be decoded, we can traverse from the root as in the figure
below.
2.4 ADAPTIVE CODING

• The Huffman algorithm requires prior statistical knowledge

about the information source, and such information is often
not available.
• This is particularly true in multimedia applications, where future
data is unknown before its arrival, as for example in live (or
streaming) audio and video.
• Even when the statistics are available, the transmission of the
symbol table could represent heavy overhead.
• The solution is to use adaptive compression algorithms, in which
statistics are gathered and updated dynamically as the datastream
arrives. The probabilities are no longer based on prior knowledge but
on the actual data received so far.
• The new coding methods are “adaptive” because, as the probability
distribution of the received symbols changes, symbols will be given
new (longer or shorter) codes.
• This is especially desirable for multimedia data, when the content
(the music or the color of the scene) and hence the statistics can
change rapidly.
ADAPTIVE HUFFMAN CODING
Procedures:
• Initial_code assigns symbols with some initially agreed-
upon codes, without any prior knowledge of the frequency
counts for them. For example, some conventional codes such as
ASCII may be used for coding character symbols.
• update_tree is a procedure for constructing an adaptive
Huffman tree. It basically does two things: it increments the
frequency counts for the symbols (including any new ones),
and updates the configuration of the tree.
EXAMPLE
Adaptive Huffman Coding for Symbol String AADCCDD

Initial code assignment for AADCCDD using adaptive Huffman coding

• Let us assume that the initial code assignment for both the encoder
and decoder simply follows the ASCII order for the 26 symbols in an
alphabet, A through Z, as the table above shows.
• To improve the implementation of the algorithm, we adopt an
additional rule: if any character/symbol is to be sent the first time, it
must be preceded by a special symbol, NEW. The initial code for NEW is
0. The count for NEW is always kept as 0 (the count is never
increased); hence it is always denoted as NEW:(0)
• It is important to emphasize that the code for a particular symbol
often changes during the adaptive Huffman coding process. The more
frequent the symbol up to the moment, the shorter the code.
• For example, after AADCCDD, when the character D overtakes A as
the most frequent symbol, its code changes from 101 to 0. This is of
course fundamental for the adaptive algorithm—codes are reassigned
dynamically according to the new probability distribution of the
symbols.
2.5 DICTIONARY-BASED CODING (LZW)
• LZW(Lempel-Ziv-Welch) employs an adaptive – dictionary based
compression technique. Unlike variable- length coding, in which the
length of code words are different, LZW uses fixed- length codeword
to represent variable-length strings of symbols/characters that
commonly occur together, such as words in English text.
• The LZW encoder and decoder build up the same dictionary
dynamically while receiving the data.
• LZW places longer and longer repeated entries into a dictionary, and
then emits the code for an element, rather than the string itself, if the
element has already been placed in the dictionary.
ALGORITHM:
EXAMPLE
LZW Compression for String ABABBABCABABBA
•Let us start with a very simple dictionary (also referred to as a string
table), initially containing only three characters, with codes as
follows:

• Now if the input string is ABABBABCABABBA, the LZW compression

algorithm works as follows:
• The output codes are 1 2 4 5 2 3 4 6 1. Instead of 14 characters, only 9
codes need to be sent. If we assume each character or code is
transmitted as a byte, that is quite a saving (the compression ratio
would be 14/9 = 1.56).
• LZW is an adaptive algorithm, in which the encoder and decoder
independently build their own string tables. Hence, there is no
overhead involving transmitting the string table.
THANK YOU FOR YOUR ATTENTION

Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)
Codigo Tunstall
No ratings yet
Codigo Tunstall
5 pages
All Matlab Codes PDF
100% (2)
All Matlab Codes PDF
44 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
hggj Chapter Four
No ratings yet
hggj Chapter Four
30 pages
Group Assignment Multimedia System
No ratings yet
Group Assignment Multimedia System
26 pages
Group Presentation Digital Communication Systems
No ratings yet
Group Presentation Digital Communication Systems
29 pages
Chapter 3-Part II
100% (1)
Chapter 3-Part II
26 pages
Wa0023.
No ratings yet
Wa0023.
28 pages
Assignment cyber security solved
No ratings yet
Assignment cyber security solved
22 pages
CHAPTER FOURmultimedia
No ratings yet
CHAPTER FOURmultimedia
23 pages
Aadel Veri
No ratings yet
Aadel Veri
37 pages
Unit-Ii Itc 2302
100% (1)
Unit-Ii Itc 2302
21 pages
Data Compression Algorithms and Their Applications
100% (1)
Data Compression Algorithms and Their Applications
14 pages
Module 2
No ratings yet
Module 2
6 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
MM05-1
No ratings yet
MM05-1
27 pages
Lec0 Source Coding & Compression
No ratings yet
Lec0 Source Coding & Compression
38 pages
Text and Text Compression
No ratings yet
Text and Text Compression
28 pages
Static Huffman Coding Term Paper
No ratings yet
Static Huffman Coding Term Paper
23 pages
Compression and Decompression Using Huffman Convention Synopsis
No ratings yet
Compression and Decompression Using Huffman Convention Synopsis
10 pages
IP Unit 5 Notes
No ratings yet
IP Unit 5 Notes
97 pages
5. Audio Coding and Standards
No ratings yet
5. Audio Coding and Standards
32 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
Module IV
No ratings yet
Module IV
37 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Huffman Coding Algorithm: Data Compression and Data Retrieval
No ratings yet
Huffman Coding Algorithm: Data Compression and Data Retrieval
15 pages
Compression
100% (1)
Compression
38 pages
Huffman Encoding
No ratings yet
Huffman Encoding
16 pages
DC 3
No ratings yet
DC 3
20 pages
Turbo Code
No ratings yet
Turbo Code
14 pages
Compression Techniques and Cyclic Redundency Check
No ratings yet
Compression Techniques and Cyclic Redundency Check
5 pages
Application of Compression
No ratings yet
Application of Compression
14 pages
Sample questions of DC of Unit 3
No ratings yet
Sample questions of DC of Unit 3
3 pages
1.5 KNR2103 - Week 9 - Day 2 - PDF PDF
No ratings yet
1.5 KNR2103 - Week 9 - Day 2 - PDF PDF
52 pages
Huffman Trees and Codes-v1
No ratings yet
Huffman Trees and Codes-v1
15 pages
Why Needed?: Without Compression, These Applications Would Not Be Feasible
No ratings yet
Why Needed?: Without Compression, These Applications Would Not Be Feasible
11 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
29 pages
MMC Unit II
No ratings yet
MMC Unit II
40 pages
CHAPTER 2
No ratings yet
CHAPTER 2
22 pages
Hufman Exp
No ratings yet
Hufman Exp
2 pages
Compression For Sending and Storing Information: Text, Audio, Images, Videos
No ratings yet
Compression For Sending and Storing Information: Text, Audio, Images, Videos
28 pages
Unit 2
No ratings yet
Unit 2
28 pages
Unit 2 - Part 7 Coding Information Sources: 1 Adaptive Variable-Length Codes
No ratings yet
Unit 2 - Part 7 Coding Information Sources: 1 Adaptive Variable-Length Codes
5 pages
1 18
No ratings yet
1 18
9 pages
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
No ratings yet
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
25 pages
Coding Line Coding Covered
No ratings yet
Coding Line Coding Covered
68 pages
Unite 4-Greedy Method - CSE
No ratings yet
Unite 4-Greedy Method - CSE
41 pages
Day 20
No ratings yet
Day 20
33 pages
DC M3
No ratings yet
DC M3
14 pages
Analysis and Comparison of Algorithms For Lossless Data Compression
No ratings yet
Analysis and Comparison of Algorithms For Lossless Data Compression
8 pages
Chapter 7 Mmedia
No ratings yet
Chapter 7 Mmedia
26 pages
Algorithm
No ratings yet
Algorithm
14 pages
LDPC Encoding and Decoding For High Memory and DSP Applications
No ratings yet
LDPC Encoding and Decoding For High Memory and DSP Applications
62 pages
Lecture 13 - Delta Coding
No ratings yet
Lecture 13 - Delta Coding
41 pages
Compression II
No ratings yet
Compression II
51 pages
09 CM0340 Basic Compression Algorithms
No ratings yet
09 CM0340 Basic Compression Algorithms
73 pages
Synopsis On: Data Compression
No ratings yet
Synopsis On: Data Compression
25 pages
Mini Project
No ratings yet
Mini Project
26 pages
1 History of Coding Theory
No ratings yet
1 History of Coding Theory
7 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Application and Implementation of DES Algorithm Based on FPGA
From Everand
Application and Implementation of DES Algorithm Based on FPGA
madhav
No ratings yet
WINSEM2024-25_BCSE204L_TH_VL2024250501510_2024-12-18_Reference-Material-I
No ratings yet
WINSEM2024-25_BCSE204L_TH_VL2024250501510_2024-12-18_Reference-Material-I
22 pages
huffman code
No ratings yet
huffman code
9 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
170 sp17 mt2
No ratings yet
170 sp17 mt2
16 pages
Advanced Algorithm Design and Analysis (Lecture 5) : SW5 Fall 2004
No ratings yet
Advanced Algorithm Design and Analysis (Lecture 5) : SW5 Fall 2004
18 pages
Algorithm Design Techniques
No ratings yet
Algorithm Design Techniques
75 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
163 pages
Lab Manual 15
No ratings yet
Lab Manual 15
9 pages
Daa Lab Manual
No ratings yet
Daa Lab Manual
28 pages
Algorithms Module 3
No ratings yet
Algorithms Module 3
7 pages
B9bc7e68 1729316114477
No ratings yet
B9bc7e68 1729316114477
61 pages
Compression
No ratings yet
Compression
106 pages
Huffman Coding
No ratings yet
Huffman Coding
23 pages
Matlab
No ratings yet
Matlab
4 pages
Greedy Algorithms: CSE373: Design and Analysis of Algorithms
No ratings yet
Greedy Algorithms: CSE373: Design and Analysis of Algorithms
52 pages
Review of Data Compression and Different Techniques of Data Compression IJERTV2IS1106
No ratings yet
Review of Data Compression and Different Techniques of Data Compression IJERTV2IS1106
8 pages
Information Theory PDF
No ratings yet
Information Theory PDF
26 pages
Comprehensive Review on Lossy and Lossless Compression Techniques
No ratings yet
Comprehensive Review on Lossy and Lossless Compression Techniques
10 pages
A Programming Language
100% (2)
A Programming Language
314 pages
3 IA Question Paper ADA (BCS401)
No ratings yet
3 IA Question Paper ADA (BCS401)
2 pages
Lecture# 08 Greedy Algorithms
No ratings yet
Lecture# 08 Greedy Algorithms
63 pages
Assignment Agmase
No ratings yet
Assignment Agmase
14 pages
Lecture 3 Compression in Multimedia
No ratings yet
Lecture 3 Compression in Multimedia
60 pages
PIP Question Bank 2014 15
No ratings yet
PIP Question Bank 2014 15
13 pages
10 - Trees
No ratings yet
10 - Trees
47 pages
Dsa Q31
No ratings yet
Dsa Q31
3 pages
Top 40 DAA Interview Questions and Answers
100% (1)
Top 40 DAA Interview Questions and Answers
2 pages
DAA Module 3 Power Point-S.Mercy
No ratings yet
DAA Module 3 Power Point-S.Mercy
56 pages