0% found this document useful (0 votes)

78 views7 pages

Huffman Coding Notes

Huffman coding is a lossless data compression algorithm that uses variable-length codes to encode characters based on their frequency of occurrence. It creates a binary tree by assigning codes to characters from the most frequent to the least, with the most common characters getting the shortest codes. This results in more frequent characters requiring fewer bits on average than less common characters, allowing the text to be compressed. The algorithm runs in O(n log n) time and allows the original text to be perfectly reconstructed from the encoded data.

Uploaded by

Rebel Star

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

78 views7 pages

Huffman Coding Notes

Uploaded by

Rebel Star

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 7

Huffman Coding

Introduction
Huffman Coding is one approach followed for T
ext Compression. Text
compression means reducing the space requirement for saving a particular text.

Huffman Coding is a lossless data compression algorithm, ie. it is a way of

compressing data without the data losing any information in the process. It is
useful in cases where there is a series of frequently occurring characters.

Working of Huffman Algorithm:

Suppose, the given string is:

Here, each of the characters of the string takes 8 bits of memory. Since there are a
total of 15 characters in the string so the total memory consumption will be 15*8 =
120 bits. Let’s try to compress its size using the Huffman Algorithm.

First-of-all, Huffman Coding creates a tree by calculating the frequencies of each

character of the string and then assigns them some unique code so that we can
retrieve the data back using these codes.

Follow the steps below:

1. Begin with calculating the frequency of each character value in the given
string.

2. Sort the characters in ascending order concerning their frequency and store
them in a priority queue, say Q
.
3. Each character should be considered as a different leaf node.

4. Make an empty node, say z . The left child of z is marked as the minimum
frequency and the right child, the second minimum frequency. The value of z
is calculated by summing up the first two frequencies.

Here, “.” denote the internal nodes.

5. Now, remove the two characters with the lowest frequencies from the
priority queue Q a
nd append their sum to the same.
6. Simply insert the above node z to the tree.
7. For every character in the string, repeat steps 3 to 5.

8. Assign 0 to the left side and 1 to the right side except for the leaf nodes.

The size table is given below:

Character Frequency Code Size

A 5 11 5*2 = 10

B 1 100 1*3 = 3

C 6 0 6*1 = 6

D 3 101 3*3 = 9

4*8 = 32 bits 15 bits 28 bits

Size before encoding: 120 bits

Size after encoding: 32 + 15 + 28 = 75 bits

To decode the code, simply traverse through the tree (starting from the root)
to find the character. Suppose we want to decode 101, then:

Time complexity:
In the case of encoding, inserting each character into the priority queue takes
O(log n) time. Therefore, for the complete array, the time complexity becomes
O(nlog(n)).

Similarly, extraction of the element from the priority queue takes O

(log n) time.
Hence, for the complete array, the achieved time complexity is O(nlog n).

Python Code:
Go through the given Python code, for deeper understanding:

# Huffman Coding in python

string = 'BCAADDDCCACACAC' #String similar to the above-taken example

# Creating tree nodes

class NodeTree(object):

def init(self, left=None, right=None):

self.left = left
self.right = right

def children(self): #Return children of a node

return (self.left, self.right)

def nodes(self):
return (self.left, self.right)

def __str__(self):
return '%s_%s' % (self.left, self.right)

# Main function implementing huffman coding

def huffman_code_tree(node, left=True, binString=''):
if type(node) is str:
return {node: binString}
(l, r) = node.children()
d = dict()
d.update(huffman_code_tree(l, True, binString + '0'))
d.update(huffman_code_tree(r, False, binString + '1'))
return d

# Calculating frequency
freq = {}
for c in string:
if c in freq:
freq[c] += 1
else:
freq[c] = 1

freq = sorted(freq.items(), key=lambda x: x[1], reverse=True)

nodes = freq

while len(nodes) > 1:

(key1, c1) = nodes[-1]
(key2, c2) = nodes[-2]
nodes = nodes[:-2]
node = NodeTree(key1, key2)
nodes.append((node, c1 + c2))

nodes = sorted(nodes, key=lambda x: x[1], reverse=True)

huffmanCode = huffman_code_tree(nodes[0][0])

print(' Char | Huffman code ')

print('----------------------')
for (char, frequency) in freq:
print(' %-4r |%12s' % (char, huffmanCode[char]))

Applications of Huffman Coding:

● They are used for transmitting fax and text.
● They are used by conventional compression formats like PKZIP, GZIP, etc.

Huffman Coding
No ratings yet
Huffman Coding
10 pages
Huffman Code
No ratings yet
Huffman Code
29 pages
Heap Sort
No ratings yet
Heap Sort
11 pages
Floyd-Warshall Algorithm
100% (1)
Floyd-Warshall Algorithm
7 pages
Final Exam Question 171 CSI217
No ratings yet
Final Exam Question 171 CSI217
2 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
22 pages
Lec01 Introduction PDF
No ratings yet
Lec01 Introduction PDF
49 pages
Graph and Graph Traaversals
No ratings yet
Graph and Graph Traaversals
19 pages
Lecture 02 Complexity Analysis
No ratings yet
Lecture 02 Complexity Analysis
15 pages
4.2.7 Dictionaries
No ratings yet
4.2.7 Dictionaries
16 pages
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
No ratings yet
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
8 pages
Programming Questions
No ratings yet
Programming Questions
16 pages
5) B Tree
No ratings yet
5) B Tree
28 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
Python Programming
No ratings yet
Python Programming
3 pages
UNIT 3 Queue
No ratings yet
UNIT 3 Queue
17 pages
Coding Round Question & Answers
No ratings yet
Coding Round Question & Answers
56 pages
Py Lab
No ratings yet
Py Lab
56 pages
Python Programming
No ratings yet
Python Programming
2 pages
List of Programs Adv Python
No ratings yet
List of Programs Adv Python
5 pages
Binary Tree - Interview Questions and Practice Problems
No ratings yet
Binary Tree - Interview Questions and Practice Problems
9 pages
Huffman Coding 1
No ratings yet
Huffman Coding 1
54 pages
Unit 3 - Data Structure - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Data Structure - WWW - Rgpvnotes.in
18 pages
Data Structures - Python 3.7.0
No ratings yet
Data Structures - Python 3.7.0
13 pages
Python Regular Expression - Exercises, Practice, Solution - W3resource12
No ratings yet
Python Regular Expression - Exercises, Practice, Solution - W3resource12
1 page
Heap Sort
No ratings yet
Heap Sort
8 pages
Python OOPs Concepts
No ratings yet
Python OOPs Concepts
4 pages
Unit-4 Complete Notes
No ratings yet
Unit-4 Complete Notes
30 pages
Routing Algorithms
No ratings yet
Routing Algorithms
64 pages
CAT Questions
No ratings yet
CAT Questions
2 pages
Dsa Basic Data Structure
No ratings yet
Dsa Basic Data Structure
72 pages
Unit1 - Introduction & Syntax of Python Program
No ratings yet
Unit1 - Introduction & Syntax of Python Program
7 pages
Exception Handling in Python
No ratings yet
Exception Handling in Python
12 pages
DS Unit-5
No ratings yet
DS Unit-5
50 pages
Daa
No ratings yet
Daa
113 pages
BFS DFS
No ratings yet
BFS DFS
29 pages
CPE121 - Chapter01 - Introduction To Data Structures and Algorithm
No ratings yet
CPE121 - Chapter01 - Introduction To Data Structures and Algorithm
24 pages
Application Development Using Python: Dept. of CSE, DSATM 2020-21 1
100% (1)
Application Development Using Python: Dept. of CSE, DSATM 2020-21 1
9 pages
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
No ratings yet
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
18 pages
Queue: Basic Operation On Queue
No ratings yet
Queue: Basic Operation On Queue
14 pages
Question Bank Unit 1 PDF
No ratings yet
Question Bank Unit 1 PDF
27 pages
Algorithms and Data Structures: Binary Search Algorithm
No ratings yet
Algorithms and Data Structures: Binary Search Algorithm
3 pages
Python Sets
No ratings yet
Python Sets
12 pages
13.file Handling
No ratings yet
13.file Handling
66 pages
Python
No ratings yet
Python
4 pages
Data Structures - Stack - and - Queue - Hands-On
100% (1)
Data Structures - Stack - and - Queue - Hands-On
3 pages
DAA Unit 1
No ratings yet
DAA Unit 1
84 pages
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
No ratings yet
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
15 pages
A Star Algorithm
No ratings yet
A Star Algorithm
24 pages
Package in Java
No ratings yet
Package in Java
22 pages
Python Classes and Objects
No ratings yet
Python Classes and Objects
36 pages
PYP-002: Preparatory Computer Science (0-2-1) : King Fahd University of Petroleum & Minerals
No ratings yet
PYP-002: Preparatory Computer Science (0-2-1) : King Fahd University of Petroleum & Minerals
5 pages
IT Data Structures Chapter 3
100% (1)
IT Data Structures Chapter 3
53 pages
5 Python Modules
100% (1)
5 Python Modules
24 pages
Programming in Java
No ratings yet
Programming in Java
359 pages
Computing Ks3 Lesson Com y8 u6 l1
100% (1)
Computing Ks3 Lesson Com y8 u6 l1
7 pages
Hash Function
No ratings yet
Hash Function
9 pages
Data Structures
No ratings yet
Data Structures
11 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
2.3a Huffman Coding
No ratings yet
2.3a Huffman Coding
25 pages
Os 1
No ratings yet
Os 1
14 pages
Python Ppt
No ratings yet
Python Ppt
16 pages
Assignment 1 - Basics of Python
No ratings yet
Assignment 1 - Basics of Python
2 pages
CP Experiment No 7
No ratings yet
CP Experiment No 7
6 pages
Interrupt Programming
No ratings yet
Interrupt Programming
5 pages
Multi Layer Perceptron - Neural Network
No ratings yet
Multi Layer Perceptron - Neural Network
3 pages
Session 19 - SVM
No ratings yet
Session 19 - SVM
21 pages
Embedded Based Fire Detection and Fighting Robot Using IoT For Industrial Environment
No ratings yet
Embedded Based Fire Detection and Fighting Robot Using IoT For Industrial Environment
4 pages
VxWorks Programmers Guide5.5
100% (2)
VxWorks Programmers Guide5.5
539 pages
Java / J2EE Technical Architect - Interview Questions and Answers
100% (1)
Java / J2EE Technical Architect - Interview Questions and Answers
5 pages
Ruby Tutorial For Beginners
No ratings yet
Ruby Tutorial For Beginners
27 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
7 pages
Test 3 It PDF
No ratings yet
Test 3 It PDF
2 pages
A Short Tutorial On User Exits
100% (1)
A Short Tutorial On User Exits
63 pages
SQR FAQ's - C
No ratings yet
SQR FAQ's - C
24 pages
150 Problem Set 2
No ratings yet
150 Problem Set 2
11 pages
React Tutorial
No ratings yet
React Tutorial
19 pages
Computer upsc
No ratings yet
Computer upsc
7 pages
Step To Writing Algorithms
No ratings yet
Step To Writing Algorithms
6 pages
Bresenham'S Ellipse Drawing Algorithm: / REGION 1
No ratings yet
Bresenham'S Ellipse Drawing Algorithm: / REGION 1
3 pages
Chapter 13 Functions
No ratings yet
Chapter 13 Functions
12 pages
MySQL Quiz Results1
No ratings yet
MySQL Quiz Results1
8 pages
1.6 Higher-Order Functions
No ratings yet
1.6 Higher-Order Functions
12 pages
Data Structure and Algorithm, Spring 2023 Homework 1: P4-P6 Release and Due: 13:00:00, Tuesday, April 11, 2023
No ratings yet
Data Structure and Algorithm, Spring 2023 Homework 1: P4-P6 Release and Due: 13:00:00, Tuesday, April 11, 2023
22 pages
Informix 4GL
No ratings yet
Informix 4GL
33 pages
Week 5
No ratings yet
Week 5
35 pages
02 Cucumber Introduction - TDD
No ratings yet
02 Cucumber Introduction - TDD
5 pages
Get MATLAB Programming with Applications for Engineers 1st Edition Edition Stephen J. Chapman PDF ebook with Full Chapters Now
100% (2)
Get MATLAB Programming with Applications for Engineers 1st Edition Edition Stephen J. Chapman PDF ebook with Full Chapters Now
47 pages
Dubealex - 'S RGSS and Ruby Tutorial
No ratings yet
Dubealex - 'S RGSS and Ruby Tutorial
67 pages
Pythonlearn 02 Expressions
No ratings yet
Pythonlearn 02 Expressions
34 pages