没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
www.vcodex.com H.264 / MPEG-4 Part 10 : Overview
© Iain E G Richardson 07/10/02 Page 1 of 1
H.264 / MPEG-4 Part 10 White Paper
Overview of H.264
1. Introduction
Broadcast television and home entertainment have been revolutionised by the advent of digital TV and
DVD-video. These applications and many more were made possible by the standardisation of video
compression technology. The next standard in the MPEG series, MPEG4, is enabling a new generation
of internet-based video applications whilst the ITU-T H.263 standard for video compression is now
widely used in videoconferencing systems.
MPEG4 (Visual) and H.263 are standards that are based on video compression (“video coding”)
technology from circa. 1995. The groups responsible for these standards, the Motion Picture Experts
Group and the Video Coding Experts Group (MPEG and VCEG) are in the final stages of developing
a new standard that promises to significantly outperform MPEG4 and H.263, providing better
compression of video images together with a range of features supporting high-quality, low-bitrate
streaming video. The history of the new standard, “Advanced Video Coding” (AVC), goes back at
least 7 years.
After finalising the original H.263 standard for videotelephony in 1995, the ITU-T Video Coding
Experts Group (VCEG) started work on two further development areas: a “short-term” effort to add
extra features to H.263 (resulting in Version 2 of the standard) and a “long-term” effort to develop a
new standard for low bitrate visual communications. The long-term effort led to the draft “H.26L”
standard, offering significantly better video compression efficiency than previous ITU-T standards. In
2001, the ISO Motion Picture Experts Group (MPEG) recognised the potential benefits of H.26L and
the Joint Video Team (JVT) was formed, including experts from MPEG and VCEG. JVT’s main task
is to develop the draft H.26L “model” into a full International Standard. In fact, the outcome will be
two identical) standards: ISO MPEG4 Part 10 of MPEG4 and ITU-T H.264. The “official” title of the
new standard is Advanced Video Coding (AVC); however, it is widely known by its old working title,
H.26L and by its ITU document number, H.264 [1].
2. H.264 CODEC
In common with earlier standards (such as MPEG1, MPEG2 and MPEG4), the H.264 draft standard
does not explicitly define a CODEC (enCOder / DECoder pair). Rather, the standard defines the
syntax of an encoded video bitstream together with the method of decoding this bitstream. In practice,
however, a compliant encoder and decoder are likely to include the functional elements shown in
Figure 2-1 and Figure 2-2. Whilst the functions shown in these Figures are likely to be necessary for
compliance, there is scope for considerable variation in the structure of the CODEC. The basic
functional elements (prediction, transform, quantization, entropy encoding) are little different from
previous standards (MPEG1, MPEG2, MPEG4, H.261, H.263); the important changes in H.264 occur
in the details of each functional element.
The Encoder (Figure 2-1) includes two dataflow paths, a “forward” path (left to right, shown in blue)
and a “reconstruction” path (right to left, shown in magenta). The dataflow path in the Decoder
(Figure 2-2) is shown from right to left to illustrate the similarities between Encoder and Decoder.
www.vcodex.com H.264 / MPEG-4 Part 10 : Overview
© Iain E G Richardson 07/10/02 Page 2 of 2
F
n
(current)
F'
n-1
(reference)
MC
Intra
prediction
ME
Filter
Inter
Intra
T
T
-1
Q
Q
-1
Reorder
Entropy
encode
NAL
D
n
P
uF'
n
+
-
+
+
X
F'
n
(reconstructed)
D'
n
(1 or 2 previously
encoded frames)
Choose
Intra
prediction
Figure 2-1 AVC Encoder
F'
n
(reconstructed)
MC
Filter
Inter
Intra
T
-1
Q
-1
Reorder
Entropy
decode
NAL
P
+
+
XD'
n
uF'
n
F'
n-1
(reference)
Intra
prediction
(1 or 2 previously
encoded frames)
Figure 2-2 AVC Decoder
2.1 Encoder (forward path)
An input frame F
n
is presented for encoding. The frame is processed in units of a macroblock
(corresponding to 16x16 pixels in the original image). Each macroblock is encoded in intra or inter
mode. In either case, a prediction macroblock P is formed based on a reconstructed frame. In Intra
mode, P is formed from samples in the current frame n that have previously encoded, decoded and
reconstructed (uF’
n
in the Figures; note that the unfiltered samples are used to form P). In Inter mode,
P is formed by motion-compensated prediction from one or more reference frame(s). In the Figures,
the reference frame is shown as the previous encoded frame F’
n-1
; however, the predicton for each
macroblock may be formed from one or two past or future frames (in time order) that have already
been encoded and reconstructed.
The prediction P is subtracted from the current macroblock to produce a residual or difference
macroblock D
n
. This is transformed (using a block transform) and quantized to give X, a set of
quantized transform coefficients. These coefficients are re-ordered and entropy encoded. The entropy-
encoded coefficients, together with side information required to decode the macroblock (such as the
macroblock prediction mode, quantizer step size, motion vector information describing how the
macroblock was motion-compensated, etc) form the compressed bitstream. This is passed to a
Network Abstraction Layer (NAL) for transmission or storage.
2.2 Encoder (reconstruction path)
www.vcodex.com H.264 / MPEG-4 Part 10 : Overview
© Iain E G Richardson 07/10/02 Page 3 of 3
The quantized macroblock coefficients X are decoded in order to reconstruct a frame for encoding of
further macroblocks. The coefficients X are re-scaled (Q
-1
) and inverse transformed (T
-1
) to produce a
difference macroblock D
n
’. This is not identical to the original difference macroblock D
n
; the
quantization process introduces losses and so D
n
’ is a distorted version of D
n
.
The prediction macroblock P is added to D
n
’ to create a reconstructed macroblock uF’
n
(a distorted
version of the original macroblock). A filter is applied to reduce the effects of blocking distortion and
reconstructed reference frame is created from a series of macroblocks F’
n
.
2.3 Decoder
The decoder receives a compressed bitstream from the NAL. The data elements are entropy decoded
and reordered to produce a set of quantized coefficients X. These are rescaled and inverse transformed
to give D
n
’ (this identical to the D
n
’ shown in the Encoder). Using the header information decoded
from the bitstream, the decoder creates a prediction macroblock P, identical to the original prediction
P formed in the encoder. P is added to D
n
’ to produce uF’
n
which this is filtered to create the decoded
macroblock F’
n
.
It should be clear from the Figures and from the discussion above that the purpose of the
reconstruction path in the encoder is to ensure that both encoder and decoder use identical reference
frames to create the prediction P. If this is not the case, then the predictions P in encoder and decoder
will not be identical, leading to an increasing error or “drift” between the encoder and decoder.
3. References
1 ITU-T Rec. H.264 / ISO/IEC 11496-10, “Advanced Video Coding”, Final Committee Draft, Document JVT-
E022, September 2002
www.vcodex.com H.264 / MPEG-4 Part 10 : Intra Prediction
© Iain E G Richardson 30/04/03 Page 1 of 6
H.264 / MPEG-4 Part 10 White Paper
Revised April 03
Prediction of Intra Macroblocks
1. Introduction
The Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG are finalising a new standard for
the coding (compression) of natural video images. The new standard [1,2] will be known as H.264 and
also MPEG-4 Part 10, “Advanced Video Coding”. This document describes the methods of predicting
intra-coded macroblocks in an H.264 CODEC.
If a block or macroblock is encoded in intra mode, a prediction block is formed based on previously
encoded and reconstructed (but un-filtered) blocks. This prediction block P is subtracted from the
current block prior to encoding. For the luminance (luma) samples, P may be formed for each 4x4 sub-
block or for a 16x16 macroblock. There are a total of 9 optional prediction modes for each 4x4 luma
block; 4 optional modes for a 16x16 luma block; and one mode that is always applied to each 4x4
chroma block.
2. 4x4 luma prediction modes
Figure 1 shows a luminance macroblock in a QCIF frame and a 4x4 luma block that is required to be
predicted. The samples above and to the left have previously been encoded and reconstructed and are
therefore available in the encoder and decoder to form a prediction reference. The prediction block P
is calculated based on the samples labelled A-M in Figure 2, as follows. Note that in some cases, not
all of the samples A-M are available within the current slice: in order to preserve independent
decoding of slices, only samples within the current slice are available for prediction. DC prediction
(mode 0) is modified depending on which samples A-M are available; the other modes (1-8) may only
be used if all of the required prediction samples are available (except that, if E, F, G and H are not
available, their value is copied from sample D).
The arrows in Figure 3 indicate the direction of prediction in each mode. For modes 3-8, the predicted
samples are formed from a weighted average of the prediction samples A-Q. The encoder may select
the prediction mode for each block that minimizes the residual between P and the block to be encoded.
www.vcodex.com H.264 / MPEG-4 Part 10 : Intra Prediction
© Iain E G Richardson 30/04/03 Page 2 of 6
Original macroblock 4x4 luma block to be predicted
Figure 1 Original macroblock and 4x4 luma block to be predicted
M A B C D E F G H
I
a b c d
J
e f g h
K
i j k l
L
m n o p
Figure 2 Labelling of prediction samples (4x4)
Mean
(A..D,
I..L)
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
M A B C D E F G H
I
J
K
L
2 (DC)0 (vertical) 1 (horizontal) 4 (diagonal down-right)3 (diagonal down-left)
5 (vertical-right) 7 (vertical-left) 8 (horizontal-up)6 (horizontal-down)
Figure 3 4x4 luma prediction modes
剩余39页未读,继续阅读
资源评论
mz454619501
- 粉丝: 43
- 资源: 13
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功