IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 3, MARCH 2001 301
Overview of Fine Granularity Scalability in
MPEG-4 Video Standard
Weiping Li, Fellow, IEEE
Abstract—Streaming Video Profile is the subject of an Amend-
ment of MPEG-4, and is developed in response to the growing need
on a video-coding standard for streaming video over the Internet.
It provides the capability to distribute single-layered frame-based
video over a wide range of bit rates with high coding efficiency. It
also provides fine granularity scalability (FGS), and its combina-
tion with temporal scalability, to address a variety of challenging
problems in deliveringvideo over the Internet. This paper provides
an overview of the FGS video coding technique in this Amendment
of the MPEG-4.
Index Terms—Bitplane coding, Internet, media, MPEG, scala-
bility, standards, streaming, video.
I. INTRODUCTION
T
HE OBJECTIVE of video coding has traditionally been
to optimize video quality at a given bit rate. Due to the
network video applications, such as Internet streaming video,
the objective is somewhat changed. This change is necessary
because network video has introduced a new system configura-
tion, as illustrated in Fig. 1, and the network channel capacity
varies over a wide range depending on the type of connections
and the network traffic at any given time.
In a traditional communication system, the encoder com-
presses the input video signal into a bit rate that is less than,
and close to, the channel capacity, and the decoder reconstructs
the video signal using all the bits received from the channel.
In such a model, two basic assumptions are made. The first
is that the encoder knows the channel capacity. The second is
that the decoder is able to decode all the bits received from
the channel fast enough to reconstruct the video. These two
basic assumptions are challenged in Internet streaming video
applications. First of all, due to the video server used between
the encoder and the channel, as shown in Fig. 1, plus the
varying channel capacity, the encoder no longer knows the
channel capacity and does not know at which bit rate the video
quality should be optimized. Secondly, more and more appli-
cations use a software video client/decoder that has to share
the computational resources with other operations on the user
terminal. The video decoder may not be able to decode all the
bits received from the channel fast enough for reconstruction
of the video signal. Therefore, the objective of video coding
for Internet streaming video is changed to optimizing the video
quality over a given bit rate range instead of at a given bit
Manuscript received June 15, 2000; revised December 7, 2000. This paper
was recommended by Guest Editors M. R. Civanlar, A. Luthra, S. Wenger, and
W. Zhu.
The author is with WebCast Technologies, Inc., Mountain View, CA 94041
USA (e-mail: li@webcasttech.com).
Publisher Item Identifier S 1051-8215(01)02236-4.
Fig. 1. System configuration for Internet Streaming Video.
Fig. 2. Illustration of video coding performance.
rate. The bitstream should be partially decodable at any bit rate
within the bit rate range to reconstruct a video signal with the
optimized quality at that bit rate. Fig. 2 illustrates this point.
The horizontal axis in Fig. 2 indicates the channel bit rate,
while the vertical axis indicates the video quality received by
a user. The distortion-rate curve indicates the upper bound in
quality for any coding technique at any given bit rate. The three
staircase curves indicate the performance of an optimal nonscal-
able coding technique. Once a given bit rate is chosen—either
low, medium, or high—the nonscalable coding technique tries to
achieve the optimal quality indicated by having the upper corner
of the staircase curve very close to the distortion-rate curve. If
the channel bit rate happens to be at the video-coding bit rate,
the received video quality is the best. However, if the channel
bit rate is lower than the video coding bit rate, a so-called “dig-
ital cutoff” phenomenon happens and the received video quality
becomes very poor. On the other hand, if the channel bit rate is
higher than the video-coding bit rate, the received video quality
does not become any better. Scalable video coding has been
an interesting topic. In MPEG-2 and MPEG-4, several layered
1051–8215/01$10.00 © 2001 IEEE