0% found this document useful (0 votes)

37 views52 pages

M4 Video

The document provides an overview of video compression, covering both analog and digital video concepts, including motion compensation and various MPEG standards. It outlines the principles of spatial and temporal redundancy in video frames, detailing techniques such as intra and inter frame coding, subsampling, differencing, and motion compensation. The course outcomes emphasize understanding and applying data compression techniques and standards in video and audio contexts.

Uploaded by

jamiemathew1303

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views52 pages

M4 Video

Uploaded by

jamiemathew1303

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Basics of Video Compression

Neena Raj N. R.

Department of Computer Science and Engineering

Mar Baselios College of Engineering and Technology, Nalanchira
Syllabus

Module 3 :Video Compression

Basics of Video Compression- Analog video and Digital Video,

Motion Compensation, MPEG-1 standard and Video Syntax, MPEG-1
Pel Reconstruction, MPEG-4 standard, Functionalities for MPEG-4

Neena Raj N. R. CS1U43D DCT 2 / 52

Course Outcomes

Course Outcomes
CO1 Describe the fundamental principles of data Understand
compression.
CO2 Make use of statistical and dictionary based Apply
compression techniques for various applications
CO3 Illustrate various image compression standards. Apply
CO4 Understand
Summarize video compression mechanisms to re-
duce the redundancy in video.
CO5 Use the fundamental properties of digital audio Understand
to compress audio data.

Neena Raj N. R. CS1U43D DCT 3 / 52

Video

Video is an electronic medium for the recording, copying, playback,

broadcasting, and display of moving visual media.

Neena Raj N. R. CS1U43D DCT 4 / 52

Analog Video

An analog video camera converts the image it “sees” through its lens
to an electric voltage (a signal) that varies with time according to the
intensity and color of the light emitted from the different image parts.
Such a signal is called analog, since it is analogous (proportional) to
the light intensity.

Neena Raj N. R. CS1U43D DCT 5 / 52

Analog Video

Fig. (a) CRT Operation. (b) Persistence.

Neena Raj N. R. CS1U43D DCT 6 / 52

Analog Video

Fig. (c) Odd Scan Lines. (d) Even Scan Lines.

Neena Raj N. R. CS1U43D DCT 7 / 52

Analog Video

The signal instructs the hardware to turn the beam off, move it to the
top-left corner of the screen, turn it on, and sweep a horizontal line
on the screen.
While the beam is swept horizontally along the top scan line, the
analog signal is used to adjust the beam’s intensity according to the
image parts being displayed.
At the end of the first scan line, the signal instructs the television
hardware to turn the beam off, move it back and slightly down, to the
start of the third (not the second) scan line, turn it on, and sweep
that line.

Neena Raj N. R. CS1U43D DCT 8 / 52

Analog Video

Moving the beam to the start of the next scan line is known as a
retrace.
The time it takes to retrace is the horizontal blanking time.
This way, one field of the picture is created on the screen line by line,
using just the odd-numbered scan lines.
At the end of the last line, the signal contains instructions for a frame
retrace.
This turns the beam off and moves it to the start of the next field
(the second scan line) to scan the field of even-numbered scan lines.
The time it takes to do the vertical retrace is the vertical blanking
time.

Neena Raj N. R. CS1U43D DCT 9 / 52

Analog Video

The picture is therefore created in two fields that together make a

frame.
The picture is said to be interlaced.

Neena Raj N. R. CS1U43D DCT 10 / 52

Analog Video

Composite Video
The common television receiver found in many homes receives from
the transmitter a composite signal, where the luminance and
chrominance components are multiplexed.
This type of signal was designed in the early 1950s, when color was
added to television transmissions.
The basic black-and-white signal becomes the luminance (Y)
component, and two chrominance components C1 and C2 are added.
Those can be U and V , Cb and Cr, I and Q, or any other
chrominance components.

Neena Raj N. R. CS1U43D DCT 11 / 52

Analog Video

Fig. Main components of a transmitter and a receiver using a composite signal

The main point is that only one signal is needed.

If the signal is sent on the air, only one frequency is needed. If it is
sent on a cable, only one cable is used.

Neena Raj N. R. CS1U43D DCT 12 / 52

Analog Video

Neena Raj N. R. CS1U43D DCT 13 / 52

Analog Video

Neena Raj N. R. CS1U43D DCT 14 / 52

Analog Video

Neena Raj N. R. CS1U43D DCT 15 / 52

Analog Video

Composite video is cheap but has problems such as cross-luminance

and cross- chrominance artifacts in the displayed image.

Neena Raj N. R. CS1U43D DCT 16 / 52

Analog Video

Component Video
Component video is an analog video signal that has been split into
two or more component channels.
In popular use, it refers to a type of component analog video (CAV)
information that is transmitted or stored as three separate signals.
Component video can be contrasted with composite video in which all
the video information is combined into a single signal that is used in
analog television.
Like composite, component cables do not carry audio and are often
paired with audio cables.

Neena Raj N. R. CS1U43D DCT 17 / 52

Analog Video

It requires more bandwidth and good synchronization of three

components.

Fig. Main components of a transmitter and a receiver using a component signal

Neena Raj N. R. CS1U43D DCT 18 / 52

Digital Video

Digital video is the case where the original image is generated, in the
camera, in the form of pixels.
An analog image seems to have infinite resolution, whereas a digital
image has a fixed, finite resolution that cannot be increased without
loss of image quality.
1 It can be easily edited. This makes it possible to produce special
effects.
2 It can be stored on any digital medium, such as hard disks, removable
cartridges, CD-ROMs, or DVDs.
3 It can be compressed. This allows for more storage (when video is
stored on a digital medium) and also for fast transmission

Neena Raj N. R. CS1U43D DCT 19 / 52

Digital Video

Digital video is, in principle, a sequence of images, called frames,

displayed at a certain frame rate (so many frames per second, or fps)
to create the illusion of animation.
This rate, as well as the image size and pixel depth, depend heavily
on the application.
Surveillance cameras, for example, use the very low frame rate of five
fps, while HDTV displays 25 fps.
Most video applications also involve sound. It is part of the overall
video data and has to be compressed with the video image.

Neena Raj N. R. CS1U43D DCT 20 / 52

Digital Video

There are few video applications do not include sound. Three

common examples are: (1) Surveillance camera, (2) an old, silent
movie being restored and converted from film to video, and (3) a
video presentation taken underwater.

Neena Raj N. R. CS1U43D DCT 21 / 52

Digital Video

A complete piece of video is sometimes called a presentation.

It consists of a number of acts, where each act is broken down into
several scenes.
A scene is made of several shots or sequences of action, each a
succession of frames, where there is a small change in scene and
camera position between consecutive frames.
The hierarchy is thus

piece act scene sequence frame

Neena Raj N. R. CS1U43D DCT 22 / 52

Video Compression

Video compression is based on two principles.

The first is the spatial redundancy that exists in each frame.
The second is the fact that most of the time, a video frame is very
similar to its immediate neighbors. This is called temporal
redundancy.
A typical technique for video compression should therefore start by
encoding the first frame using a still image compression method.
It should then encode each successive frame by identifying the
differences between the frame and its predecessor, and encoding these
differences.

Neena Raj N. R. CS1U43D DCT 23 / 52

Video Compression

If the frame is very different from its predecessor (as happens with the
first frame of a shot), it should be coded independently of any other
frame.
In the video compression literature, a frame that is coded using its
predecessor is called inter frame (or just inter), while a frame that is
coded independently is called intra frame (or just intra).

Neena Raj N. R. CS1U43D DCT 24 / 52

Video Compression

Video compression is normally lossy.

Encoding a frame Fi in terms of its predecessor Fi−1 introduces some
distortions.
As a result, encoding frame Fi+1 in terms of Fi increases the
distortion.
Even in lossless video compression, a frame may lose some bits.
If a frame Fi has lost some bits, then all the frames following it, up to
the next intra frame, are decoded improperly, perhaps even leading to
accumulated errors.
This is why intra frames should be used from time to time inside a
sequence, not just at its beginning.

Neena Raj N. R. CS1U43D DCT 25 / 52

Video Compression

An intra frame is labeled I , and an inter frame is labeled P (for

predictive).
Inter frame I can be coded based on one of its predecessors and also
on one of its successors.
A frame that is encoded based on both past and future frames is
labeled B (for bidirectional).
We usually don’t mind if the encoder is slow, but the decoder has to
be fast.
A typical case is video recorded on a hard disk or on a DVD, to be
played back.
The encoder can take minutes or hours to encode the data.

Neena Raj N. R. CS1U43D DCT 26 / 52

Video Compression

The decoder, however, has to play it back at the correct frame rate
(so many frames per second), so it has to be fast.
This is why a typical video decoder works in parallel.
It has several decoding circuits working simultaneously on several
frames.
An I frame is decoded independently of any other frame.
A P frame is decoded using the preceding I or P frame.
A B frame is decoded using the preceding and following I or P frames.

Neena Raj N. R. CS1U43D DCT 27 / 52

Video Compression