Speech Coding Techniques Overview

This document provides solutions to homework problems on speech and audio coding. It discusses: 1) Designing a Huffman codebook for a source and coding a sequence using the codebook. 2) Applying predictive coding (DPCM) to a sample sequence, quantizing prediction errors. 3) How DPCM can reduce bit rate compared to PCM by exploiting small prediction errors. 4) The principles of adaptive DPCM (ADPCM), including forward and backward adaptation. 5) The differences between waveform coders, vocoders, and hybrid coders in terms of techniques used and supported bit rate/quality ranges.

Uploaded by

duy phan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views3 pages

Speech Coding Techniques Overview

Uploaded by

duy phan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

Polytechnic University, Dept.

Electrical and Computer Engineering

EE3414 Multimedia Communication System I
Spring 2006, Yao Wang
___________________________________________________________________________________

Homework 4 (Speech and Audio Coding) Solution

1. Consider a source with 4 symbols {a,b,c,d}. The probability of the 4 symbols are
P(a)=0.4, p(b) = 0.1, p(c)=0.2, p(d)= 0.3.
a. Design a Huffman codebook for these symbols. Determine the average bit rate and compared it
to the entropy of this source.
b. Code the sequence {aacddacbda} using the codebook you designed. Write the resulting binary
bitsteam. Calculate the average bit rate.

Solution: (a) Huffman code design:

Symbol Prob Codeword Length

1
“a“ 0.4 “1“ 1

1
“d“ 0.3 “ 01 “ 2

0
1
“c“ 0.2 0.6 “ 001 “ 3
0

0 0.3
“b“ 0.1 “ 000 “ 3

l  0.4 1 0.3 2  (0.2  0.1)3  1.9

; H   pk log pk 1.85 . H<l<H+1

Note that for this problem, you could have assigned “0” to “a” and “1” to the sum of “d,c,b”
which has a prob. of 0.6. Either solution is correct.
(b) Using the codebook above, the sequence {aacddacbda} is coded into {1100101011001000011}.
The total number of bits is 19. The total number of symbols is 10. So the average bit rate is
19/10=1.9 bit/symbol. (Note in general, the bit rate for a particular sequence may not be the same as
that calculated based on the probability. But in this example, the sequence was chosen so that the
symbol frequency matches with the given probability exactly. So the two average bit rates are the
same.)
2. Consider a predictive coding system using delta modulation. The predictor predicts the current sample
using the previously reconstructed sample. The prediction error is quantized according to
 2 e0
Q ( e)  
 2 e0
For the sample sequence {3,4,5,3,1,…}
Show the predicted value, the prediction error, the quantized prediction error, and the reconstructed
value, for each sample, starting from the first sample. Assume that the encoder and decoder will use the
value of 2 as the prediction value for the first sample.
Assuming “1” represent e>=0. “0” for e<0, also write the binary representation of the coded stream.
What is the bit rate of the coded stream? (in terms of bit/sample)

Solution:

Sample Original Predicted value Prediction error Quantized Reconstructed value

index value (=previous (=original value- error (=predicted
reconstructed value) predicted value) value+quantized error)
1 3 2 1 2 4
2 4 4 0 2 6
3 5 6 -1 -2 4
4 3 4 -1 -2 2
5 1 2 -1 -2 0

The coded stream is “11000”. The bit rate is 1 bit/sample.

3. Explain why predictive coding (DPCM) can reduce the average bit rate compared to coding
each sampling directly (PCM).

If the predictor is well designed, the predicted value should be close to the original value most of
the time. Therefore, the prediction error is usually small, with occasional large values. Using
entropy coding, such errors can be coded with fewer bits than the original value.

4. Explain the principle of ADPCM and two different types of adaptation (forward and backward)
and what are their pros and cons.

With non-adaptive DPCM, the linear predictor used is fixed and is determined by minimizing the
mean square error between the original sample and the predicted sample. Essentially the linear
predictor coefficients are determined from the correlation coefficients of adjacent samples. When
the underlying signal changes its statistics in time (i.e. how correlated adjacent samples are), ideally
the predictor coefficients should also change. This is the idea behind adaptive DPCM or ADPCM.
The forward adaptation method looks ahead at a group of N samples, and computes the correlation
coefficients between adjacent samples, and based on the resulting correlation statistics, determine
the optimal linear predictor to be used for this N samples. The encoder needs to send not only the
prediction errors, but also the predictor used for every N samples. With backward adaptation, the
predictor for the new samples are determined based on the previously coded samples. Because the
decoder can perform the same computation to derive the predictor for the new samples, the encoder
does not need to send the predictor coefficients. But the predictor determined based on the past
samples are not as good as the predictor determined using the current samples, so the backward
adaptation will yield larger prediction errors in general. The backward adaptation however does not
need to buffer next N samples, so it has lower delay.

5. Explain the main difference between waveform-based coders, vocoders, and hybrid coders for
speech coding, in terms of techniques used and the bit-rate/quality range of each.

Waveform-based coders try to reproduce the speech sample values as closely as possible given the
target bit rate. They achieve compression by making use of the fact that adjacent samples have
similar values and employs ADPCM type of techniques. Vocoders are targeted for applications
requiring very low bit rate but the speech does not have to sound very natural, as long as it is
intelligible. Vocoders make use of the fact that the human vocal track can be modeled by a linear
filter, with filter coefficients change with the shape of the vocal track (depending on the sound
being produced). Given a set of speech samples, a vocoder deduces the filter model and the
excitation signal driving the filter and send these model parameters. The decoder synthesizes the
speech samples from these model parameters. Hybrid coders work in between, targeting at
applications requiring intermediate bit rate and quality. This is achieved by allowing a larger range
of the excitation signal. It also allows more sophisticated adaptation of the filter. Sometimes it also
allow specifying the errors between the original samples and those produced by the filter model.

Digital Communication Systems Assignment
No ratings yet
Digital Communication Systems Assignment
3 pages
EE1F1 Information Engineering Exam Solutions
No ratings yet
EE1F1 Information Engineering Exam Solutions
9 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Assignment - Digital Communication
No ratings yet
Assignment - Digital Communication
6 pages
Information Theory and Coding Question Bank
No ratings yet
Information Theory and Coding Question Bank
6 pages
Information Theory & Coding Assignment
No ratings yet
Information Theory & Coding Assignment
5 pages
EE1F1 Information Engineering Exam Solutions
No ratings yet
EE1F1 Information Engineering Exam Solutions
13 pages
ECS452 2019 Premidterm HW
No ratings yet
ECS452 2019 Premidterm HW
33 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Sampling Theorem and Source Coding Explained
No ratings yet
Sampling Theorem and Source Coding Explained
19 pages
Information Coding Techniques Question Bank
No ratings yet
Information Coding Techniques Question Bank
10 pages
Information Coding Techniques Exam Guide
No ratings yet
Information Coding Techniques Exam Guide
3 pages
Telecommunications I Homework 2 Solutions
No ratings yet
Telecommunications I Homework 2 Solutions
3 pages
Question Bank: Information Coding Techniques
No ratings yet
Question Bank: Information Coding Techniques
10 pages
Huffman and Lempel-Ziv Encoding Analysis
No ratings yet
Huffman and Lempel-Ziv Encoding Analysis
4 pages
Channel Coding Exam
No ratings yet
Channel Coding Exam
9 pages
Fundamental Limits in Mobile Communication
No ratings yet
Fundamental Limits in Mobile Communication
31 pages
@vtudeveloper - in DC Solved MQ
0% (1)
@vtudeveloper - in DC Solved MQ
46 pages
Digital Communications Final Exam 2020
No ratings yet
Digital Communications Final Exam 2020
4 pages
Waveform Coding
No ratings yet
Waveform Coding
62 pages
Source Coding
No ratings yet
Source Coding
10 pages
B.Tech Info Theory Exam 2021
No ratings yet
B.Tech Info Theory Exam 2021
9 pages
Information Theory & Coding Exam Paper
No ratings yet
Information Theory & Coding Exam Paper
5 pages
Shannon-Hartley Law & Source Coding
No ratings yet
Shannon-Hartley Law & Source Coding
5 pages
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
No ratings yet
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
17 pages
Digital Communications Exam ECE-3201
No ratings yet
Digital Communications Exam ECE-3201
2 pages
Information Theory Jan 2020 (2017 Scheme)
No ratings yet
Information Theory Jan 2020 (2017 Scheme)
3 pages
CST446 Data Compression Exam Key
No ratings yet
CST446 Data Compression Exam Key
10 pages
B.Tech Information Theory Exam Paper
No ratings yet
B.Tech Information Theory Exam Paper
2 pages
ICT 2marks
No ratings yet
ICT 2marks
20 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
Communication Systems and Coding Techniques
No ratings yet
Communication Systems and Coding Techniques
40 pages
ECT306 ITC Previous University QP and Scheme
No ratings yet
ECT306 ITC Previous University QP and Scheme
38 pages
Psychoacoustic Masking and Entropy Analysis
No ratings yet
Psychoacoustic Masking and Entropy Analysis
4 pages
Eecs 554 hw2
No ratings yet
Eecs 554 hw2
3 pages
Digital Communications Course Overview
No ratings yet
Digital Communications Course Overview
8 pages
G1002 Information Theory Exam Paper
No ratings yet
G1002 Information Theory Exam Paper
3 pages
Shanonfano and Huffman Coding
No ratings yet
Shanonfano and Huffman Coding
18 pages
Multimedia Coding Techniques
No ratings yet
Multimedia Coding Techniques
44 pages
Information Theory and Coding Sample Question 2021
No ratings yet
Information Theory and Coding Sample Question 2021
5 pages
Lec05 Arithmetic Coding II
No ratings yet
Lec05 Arithmetic Coding II
44 pages
19ECE312 - VI Sem Jul 2023
No ratings yet
19ECE312 - VI Sem Jul 2023
3 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Communication Engineering II Sample Questions
No ratings yet
Communication Engineering II Sample Questions
7 pages
Question Bank-1 Merged
No ratings yet
Question Bank-1 Merged
12 pages
Advanced Digital Signal Processing Course
No ratings yet
Advanced Digital Signal Processing Course
22 pages
Information Theory Problems
No ratings yet
Information Theory Problems
2 pages
EE 121 Digital Communication Midterm Exam
No ratings yet
EE 121 Digital Communication Midterm Exam
10 pages
Digital Communication Question Bank 2021-22
No ratings yet
Digital Communication Question Bank 2021-22
18 pages
VTU E&CE (CBCS) 5th Sem Information Theory and Coding Full Notes (1-5 Modules)
80% (5)
VTU E&CE (CBCS) 5th Sem Information Theory and Coding Full Notes (1-5 Modules)
691 pages
Data Compression Exam Paper 2017-18
No ratings yet
Data Compression Exam Paper 2017-18
2 pages
Problem Set XI: Source Coding and Compression
No ratings yet
Problem Set XI: Source Coding and Compression
11 pages
Information Theory & Coding Assignment
No ratings yet
Information Theory & Coding Assignment
2 pages
Distortionless Source Coding Guide
No ratings yet
Distortionless Source Coding Guide
80 pages
Marwadi University Faculty of Engineering Information & Communication Technology B.Tech
No ratings yet
Marwadi University Faculty of Engineering Information & Communication Technology B.Tech
5 pages
Integrity Test Report for Batch 003-21
No ratings yet
Integrity Test Report for Batch 003-21
9 pages
Integrity Test Report for Batch 002-21
No ratings yet
Integrity Test Report for Batch 002-21
8 pages
Examples For S7WebServer DOC v20 en
No ratings yet
Examples For S7WebServer DOC v20 en
84 pages
Understanding Cathode Ray Tubes in Oscilloscopes
No ratings yet
Understanding Cathode Ray Tubes in Oscilloscopes
5 pages
Understanding Cathode Ray Tubes in Oscilloscopes
No ratings yet
Understanding Cathode Ray Tubes in Oscilloscopes
5 pages
Four Basic Electric Circuit Elements
No ratings yet
Four Basic Electric Circuit Elements
3 pages
Instruction Writing for Electrical Safety
No ratings yet
Instruction Writing for Electrical Safety
1 page
Homework 13 (Wireless Communications) Solution
No ratings yet
Homework 13 (Wireless Communications) Solution
2 pages
The DC Motor: I. Reading and Comprehension
No ratings yet
The DC Motor: I. Reading and Comprehension
2 pages
AMS/ACI/ENS Filing Guidelines
No ratings yet
AMS/ACI/ENS Filing Guidelines
17 pages
Analog Communication Modulation Homework
No ratings yet
Analog Communication Modulation Homework
1 page
Huawei Microwave Antenna Alignment Guide
No ratings yet
Huawei Microwave Antenna Alignment Guide
17 pages
NHD-C12832A1Z-FSW-FBW-3V3 LCD Module
No ratings yet
NHD-C12832A1Z-FSW-FBW-3V3 LCD Module
9 pages
Pulse Modulation Techniques Explained
No ratings yet
Pulse Modulation Techniques Explained
39 pages
Mk6ES Factsheet English
No ratings yet
Mk6ES Factsheet English
2 pages
PCB Design by Using Fusion - 5 Test Projects
No ratings yet
PCB Design by Using Fusion - 5 Test Projects
3 pages
An 127
No ratings yet
An 127
20 pages
The Hands-On ARM Mbed Development Lab Manual - Agus Kurniawan
100% (2)
The Hands-On ARM Mbed Development Lab Manual - Agus Kurniawan
125 pages
Speaker Lobing Calculator - Polar Response - Audio Judgement
No ratings yet
Speaker Lobing Calculator - Polar Response - Audio Judgement
21 pages
Marantz dv3100
No ratings yet
Marantz dv3100
60 pages
Corporatefed Springer
No ratings yet
Corporatefed Springer
8 pages
Architect ARCS Installation Guide
No ratings yet
Architect ARCS Installation Guide
19 pages
MD810 Series AC Drive User Guide
No ratings yet
MD810 Series AC Drive User Guide
579 pages
Pro Audio Low Midrange Driver
No ratings yet
Pro Audio Low Midrange Driver
1 page
S 700 Extractive Gas Analyzers Overview
No ratings yet
S 700 Extractive Gas Analyzers Overview
62 pages
A Presentation On +5V DC Power Supplier
No ratings yet
A Presentation On +5V DC Power Supplier
12 pages
General Electrical Product Catalogue
No ratings yet
General Electrical Product Catalogue
60 pages
Semiconductor Backend Manufacturing Basics
No ratings yet
Semiconductor Backend Manufacturing Basics
29 pages
Motorola Gp3188 Users Manual 272275
No ratings yet
Motorola Gp3188 Users Manual 272275
2 pages
PCO sCMOS Ebook
No ratings yet
PCO sCMOS Ebook
54 pages
HSDPA Call Setup Process Overview
No ratings yet
HSDPA Call Setup Process Overview
2 pages
It On User Manual
No ratings yet
It On User Manual
12 pages
Cheer Tek
No ratings yet
Cheer Tek
21 pages
EasyIO FW-28 Controller Overview
No ratings yet
EasyIO FW-28 Controller Overview
8 pages
Fichas Slate Honeywell
No ratings yet
Fichas Slate Honeywell
1 page
Operational Amplifiers: Op-Amp Terminals and Symbols
No ratings yet
Operational Amplifiers: Op-Amp Terminals and Symbols
11 pages
MF Induction Melting Furnace Manual
No ratings yet
MF Induction Melting Furnace Manual
34 pages
Pads and Attenuators Lab 3
No ratings yet
Pads and Attenuators Lab 3
6 pages
YMC Piezoelectric Sensor Overview
No ratings yet
YMC Piezoelectric Sensor Overview
43 pages
Displacement Sensor Selection Guide
No ratings yet
Displacement Sensor Selection Guide
17 pages
Presentation A-Eberle AVR TMS
100% (1)
Presentation A-Eberle AVR TMS
82 pages

Speech Coding Techniques Overview

Uploaded by

Speech Coding Techniques Overview

Uploaded by

Polytechnic University, Dept.

Electrical and Computer Engineering

Homework 4 (Speech and Audio Coding) Solution

Solution: (a) Huffman code design:

Symbol Prob Codeword Length

l  0.4 *1 0.3* 2  (0.2  0.1)3  1.9

Sample Original Predicted value Prediction error Quantized Reconstructed value

The coded stream is “11000”. The bit rate is 1 bit/sample.

You might also like

l  0.4 1 0.3 2  (0.2  0.1)3  1.9