OverviewoftheHighEfficiencyVideoCoding(HEVC)Standard.pdf资源-CSDN文库

HEVC

H.265

需积分: 31 165 浏览量 2019-12-05 18:52:05 上传评论收藏 4.01MB PDF 举报

资源推荐

资源详情

资源评论

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012 1649

Overview of the High Efﬁciency Video Coding

(HEVC) Standard

Gary J. Sullivan, Fellow, IEEE, Jens-Rainer Ohm, Member, IEEE, Woo-Jin Han, Member, IEEE, and

Thomas Wiegand,

Fellow, IEEE

Abstract—High Efﬁciency Video Coding (HEVC) is currently

being prepared as the newest video coding standard of the

ITU-T Video Coding Experts Group and the ISO/IEC Moving

Picture Experts Group. The main goal of the HEVC standard-

ization effort is to enable signiﬁcantly improved compression

performance relative to existing standards—in the range of 50%

bit-rate reduction for equal perceptual video quality. This paper

provides an overview of the technical features and characteristics

of the HEVC standard.

Index Terms—Advanced video coding (AVC), H.264, High

Efﬁciency Video Coding (HEVC), Joint Collaborative Team

on Video Coding (JCT-VC), Moving Picture Experts Group

(MPEG), MPEG-4, standards, Video Coding Experts Group

(VCEG), video compression.

I. Introduction

HE High Efﬁciency Video Coding (HEVC) standard is

the most recent joint video project of the ITU-T Video

Coding Experts Group (VCEG) and the ISO/IEC Moving

Picture Experts Group (MPEG) standardization organizations,

working together in a partnership known as the Joint Col-

laborative Team on Video Coding (JCT-VC) [1]. The ﬁrst

edition of the HEVC standard is expected to be ﬁnalized in

January 2013, resulting in an aligned text that will be published

by both ITU-T and ISO/IEC. Additional work is planned to

extend the standard to support several additional application

scenarios, including extended-range uses with enhanced pre-

cision and color format support, scalable video coding, and

3-D/stereo/multiview video coding. In ISO/IEC, the HEVC

standard will become MPEG-H Part 2 (ISO/IEC 23008-2)

and in ITU-T it is likely to become ITU-T Recommendation

H.265.

Manuscript received May 25, 2012; revised August 22, 2012; accepted

August 24, 2012. Date of publication October 2, 2012; date of current

version January 8, 2013. This paper was recommended by Associate Editor

H. Gharavi. (Corresponding author: W.-J. Han.)

G. J. Sullivan is with Microsoft Corporation, Redmond, WA 98052 USA

(e-mail: garysull@microsoft.com).

J.-R. Ohm is with the Institute of Communication Engineering,

RWTH Aachen University, Aachen 52056, Germany (e-mail:

ohm@ient.rwth-aachen.de).

W.-J. Han is with the Department of Software Design and Management,

Gachon University, Seongnam 461-701, Korea (e-mail: hurumi@gmail.com).

T. Wiegand is with the Fraunhofer Institute for Telecommunications, Hein-

rich Hertz Institute, Berlin 10587, Germany, and also with the Berlin Institute

of Technology, Berlin 10587, Germany (e-mail: twiegand@ieee.org).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TCSVT.2012.2221191

Video coding standards have evolved primarily through the

development of the well-known ITU-T and ISO/IEC standards.

The ITU-T produced H.261 [2] and H.263 [3], ISO/IEC

produced MPEG-1 [4] and MPEG-4 Visual [5], and the two

organizations jointly produced the H.262/MPEG-2 Video [6]

and H.264/MPEG-4 Advanced Video Coding (AVC) [7] stan-

dards. The two standards that were jointly produced have had a

particularly strong impact and have found their way into a wide

variety of products that are increasingly prevalent in our daily

lives. Throughout this evolution, continued efforts have been

made to maximize compression capability and improve other

characteristics such as data loss robustness, while considering

the computational resources that were practical for use in prod-

ucts at the time of anticipated deployment of each standard.

The major video coding standard directly preceding the

HEVC project was H.264/MPEG-4 AVC, which was initially

developed in the period between 1999 and 2003, and then

was extended in several important ways from 2003–2009.

H.264/MPEG-4 AVC has been an enabling technology for dig-

ital video in almost every area that was not previously covered

by H.262/MPEG-2 Video and has substantially displaced the

older standard within its existing application domains. It is

widely used for many applications, including broadcast of high

deﬁnition (HD) TV signals over satellite, cable, and terrestrial

transmission systems, video content acquisition and editing

systems, camcorders, security applications, Internet and mo-

bile network video, Blu-ray Discs, and real-time conversa-

tional applications such as video chat, video conferencing, and

telepresence systems.

However, an increasing diversity of services, the grow-

ing popularity of HD video, and the emergence of beyond-

HD formats (e.g., 4k×2k or 8k×4k resolution) are creating

even stronger needs for coding efﬁciency superior to H.264/

MPEG-4 AVC’s capabilities. The need is even stronger when

higher resolution is accompanied by stereo or multiview

capture and display. Moreover, the trafﬁc caused by video

applications targeting mobile devices and tablet PCs, as well

as the transmission needs for video-on-demand services, are

imposing severe challenges on today’s networks. An increased

desire for higher quality and resolutions is also arising in

mobile applications.

HEVC has been designed to address essentially all existing

applications of H.264/MPEG-4 AVC and to particularly focus

on two key issues: increased video resolution and increased

use of parallel processing architectures. The syntax of HEVC

1051-8215/$31.00

 2012 IEEE

1650 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 22, NO. 12, DECEMBER 2012

is generic and should also be generally suited for other

applications that are not speciﬁcally mentioned above.

As has been the case for all past ITU-T and ISO/IEC video

coding standards, in HEVC only the bitstream structure and

syntax is standardized, as well as constraints on the bitstream

and its mapping for the generation of decoded pictures. The

mapping is given by deﬁning the semantic meaning of syntax

elements and a decoding process such that every decoder

conforming to the standard will produce the same output

when given a bitstream that conforms to the constraints of the

standard. This limitation of the scope of the standard permits

maximal freedom to optimize implementations in a manner

appropriate to speciﬁc applications (balancing compression

quality, implementation cost, time to market, and other con-

siderations). However, it provides no guarantees of end-to-

end reproduction quality, as it allows even crude encoding

techniques to be considered conforming.

To assist the industry community in learning how to use the

standard, the standardization effort not only includes the de-

velopment of a text speciﬁcation document, but also reference

software source code as an example of how HEVC video can

be encoded and decoded. The draft reference software has been

used as a research tool for the internal work of the committee

during the design of the standard, and can also be used as a

general research tool and as the basis of products. A standard

test data suite is also being developed for testing conformance

to the standard.

This paper is organized as follows. Section II highlights

some key features of the HEVC coding design. Section III

explains the high-level syntax and the overall structure of

HEVC coded data. The HEVC coding technology is then

described in greater detail in Section IV. Section V explains

the proﬁle, tier, and level design of HEVC. Since writing an

overview of a technology as substantial as HEVC involves a

signiﬁcant amount of summarization, the reader is referred

to [1] for any omitted details. The history of the HEVC

standardization effort is discussed in Section VI.

II. HEVC Coding Design and Feature Highlights

The HEVC standard is designed to achieve multiple goals,

including coding efﬁciency, ease of transport system integra-

tion and data loss resilience, as well as implementability using

parallel processing architectures. The following subsections

brieﬂy describe the key elements of the design by which

these goals are achieved, and the typical encoder operation

that would generate a valid bitstream. More details about the

associated syntax and the decoding process of the different

elements are provided in Sections III and IV.

A. Video Coding Layer

The video coding layer of HEVC employs the same hy-

brid approach (inter-/intrapicture prediction and 2-D transform

coding) used in all video compression standards since H.261.

Fig. 1 depicts the block diagram of a hybrid video encoder,

which could create a bitstream conforming to the HEVC

standard.

An encoding algorithm producing an HEVC compliant

bitstream would typically proceed as follows. Each picture

is split into block-shaped regions, with the exact block par-

titioning being conveyed to the decoder. The ﬁrst picture

of a video sequence (and the ﬁrst picture at each clean

random access point into a video sequence) is coded using

only intrapicture prediction (that uses some prediction of data

spatially from region-to-region within the same picture, but has

no dependence on other pictures). For all remaining pictures

of a sequence or between random access points, interpicture

temporally predictive coding modes are typically used for

most blocks. The encoding process for interpicture prediction

consists of choosing motion data comprising the selected

reference picture and motion vector (MV) to be applied for

predicting the samples of each block. The encoder and decoder

generate identical interpicture prediction signals by applying

motion compensation (MC) using the MV and mode decision

data, which are transmitted as side information.

The residual signal of the intra- or interpicture prediction,

which is the difference between the original block and its pre-

diction, is transformed by a linear spatial transform. The trans-

form coefﬁcients are then scaled, quantized, entropy coded,

and transmitted together with the prediction information.

The encoder duplicates the decoder processing loop (see

gray-shaded boxes in Fig. 1) such that both will generate

identical predictions for subsequent data. Therefore, the quan-

tized transform coefﬁcients are constructed by inverse scaling

and are then inverse transformed to duplicate the decoded

approximation of the residual signal. The residual is then

added to the prediction, and the result of that addition may

then be fed into one or two loop ﬁlters to smooth out artifacts

induced by block-wise processing and quantization. The ﬁnal

picture representation (that is a duplicate of the output of the

decoder) is stored in a decoded picture buffer to be used for

the prediction of subsequent pictures. In general, the order of

encoding or decoding processing of pictures often differs from

the order in which they arrive from the source; necessitating a

distinction between the decoding order (i.e., bitstream order)

and the output order (i.e., display order) for a decoder.

Video material to be encoded by HEVC is generally ex-

pected to be input as progressive scan imagery (either due to

the source video originating in that format or resulting from

deinterlacing prior to encoding). No explicit coding features

are present in the HEVC design to support the use of interlaced

scanning, as interlaced scanning is no longer used for displays

and is becoming substantially less common for distribution.

However, a metadata syntax has been provided in HEVC to

allow an encoder to indicate that interlace-scanned video has

been sent by coding each ﬁeld (i.e., the even or odd numbered

lines of each video frame) of interlaced video as a separate

picture or that it has been sent by coding each interlaced frame

as an HEVC coded picture. This provides an efﬁcient method

of coding interlaced video without burdening decoders with a

need to support a special decoding process for it.

In the following, the various features involved in hybrid

video coding using HEVC are highlighted as follows.

1) Coding tree units and coding tree block (CTB) structure:

The core of the coding layer in previous standards was

SULLIVAN et al.: OVERVIEW OF THE HEVC STANDARD 1651

Fig. 1. Typical HEVC video encoder (with decoder modeling elements shaded in light gray).

the macroblock, containing a 16×16 block of luma sam-

ples and, in the usual case of 4:2:0 color sampling, two

corresponding 8×8 blocks of chroma samples; whereas

the analogous structure in HEVC is the coding tree unit

(CTU), which has a size selected by the encoder and

can be larger than a traditional macroblock. The CTU

consists of a luma CTB and the corresponding chroma

CTBs and syntax elements. The size L×L of a luma

CTB can be chosen as L = 16, 32, or 64 samples, with

the larger sizes typically enabling better compression.

HEVC then supports a partitioning of the CTBs into

smaller blocks using a tree structure and quadtree-like

signaling [8].

2) Coding units (CUs) and coding blocks (CBs): The

quadtree syntax of the CTU speciﬁes the size and

positions of its luma and chroma CBs. The root of the

quadtree is associated with the CTU. Hence, the size of

the luma CTB is the largest supported size for a luma

CB. The splitting of a CTU into luma and chroma CBs

is signaled jointly. One luma CB and ordinarily two

chroma CBs, together with associated syntax, form a

coding unit (CU). A CTB may contain only one CU or

may be split to form multiple CUs, and each CU has an

associated partitioning into prediction units (PUs) and a

tree of transform units (TUs).

3) Prediction units and prediction blocks (PBs): The de-

cision whether to code a picture area using interpicture

or intrapicture prediction is made at the CU level. A

PU partitioning structure has its root at the CU level.

Depending on the basic prediction-type decision, the

luma and chroma CBs can then be further split in size

and predicted from luma and chroma prediction blocks

(PBs). HEVC supports variable PB sizes from 64×64

down to 4×4 samples.

4) TUs and transform blocks: The prediction residual is

coded using block transforms. A TU tree structure has

its root at the CU level. The luma CB residual may be

identical to the luma transform block (TB) or may be

further split into smaller luma TBs. The same applies to

the chroma TBs. Integer basis functions similar to those

of a discrete cosine transform (DCT) are deﬁned for the

square TB sizes 4×4, 8×8, 16×16, and 32×32. For the

4×4 transform of luma intrapicture prediction residuals,

an integer transform derived from a form of discrete sine

transform (DST) is alternatively speciﬁed.

5) Motion vector signaling: Advanced motion vector pre-

diction (AMVP) is used, including derivation of several

most probable candidates based on data from adjacent

PBs and the reference picture. A merge mode for MV

coding can also be used, allowing the inheritance of

MVs from temporally or spatially neighboring PBs.

Moreover, compared to H.264/MPEG-4 AVC, improved

skipped and direct motion inference are also speciﬁed.

6) Motion compensation: Quarter-sample precision is used

for the MVs, and 7-tap or 8-tap ﬁlters are used for

interpolation of fractional-sample positions (compared

to six-tap ﬁltering of half-sample positions followed

by linear interpolation for quarter-sample positions in

剩余19页未读，继续阅读

评论收藏

内容反馈

植田真梨惠

粉丝: 0
资源: 2

Overview of the High Efficiency Video Coding(HEVC) Standard.pdf

最新资源

Overview of the High Efficiency Video Coding(HEVC) Standard.pdf

Overview of the High Efficiency Video Coding (HEVC) Standard

HEVC-Overview

Intra Coding of the HEVC Standard

High efficiency video coding (HEVC) text specification draft 8

Overview of the High Efficiency Video Coding (HEVC) Standard.pdf

Overview of the High Efficiency Video Coding.pdf

Overview of the High Efficiency Video Coding

Overview of the Range Extensions for the HEVC Standard

Overview of HEVC codec standard.rar

Overview of (HEVC)翻译版

HEVC入门论文(多篇)

HEVC_overview

Introduction to the High-Efficiency Video Coding Standard

Introduction to High Efficiency Video Coding

Towards the Next Video Standard: High Efficiency Video Coding

Comparison of the Coding Efficiency of Video Coding Standard

High.Efficiency.Video.Coding.Coding.Tools.and.Specification

Directionlet在视频编码中的应用

DCT变换的理论和应用

Standardized Extensions of High Efficiency Video Coding

Overview_of_the_H.264_AVC_Video_Coding_Standard

Overview of the H.264-AVC Video Coding Standard

High Efficiency Video Coding Algorithms and Architectures

High Efficiency Video Coding (HEVC) Range Extensions text specification: Draft 7

Overview of the H.264_AVC Video Coding Standard

最新资源