_IEEE-HEVC-Overview.资源-CSDN文库

需积分: 9 56 浏览量 2013-07-27 08:01:35 上传评论收藏 871KB PDF 举报

### HEVC标准概述 #### 一、引言与背景 HEVC（High-Efficiency Video Coding，高效率视频编码）是ITU-T视频编码专家小组（VCEG）和ISO/IEC移动图像专家小组（MPEG）联合开发的最新一代视频压缩标准。该标准旨在实现比现有标准更显著的压缩性能提升，目标是在保持相同视觉质量的前提下减少大约50%的比特率。 #### 二、HEVC的发展历程与目标 HEVC项目由ITU-T VCEG与ISO/IEC MPEG合作成立的联合视频编码团队（Joint Collaborative Team on Video Coding, JCT-VC）共同推进。第一版HEVC标准预计于2013年1月完成，并将同时由ITU-T和ISO/IEC发布。除了基础版本外，计划进一步扩展标准以支持更多应用场景，包括专业用途的增强精度和色彩格式支持、可伸缩视频编码以及3D/立体/多视角视频编码。在ISO/IEC中，HEVC标准将成为MPEG-H Part 2（ISO/IEC 23008-2），而在ITU-T中则可能成为ITU-T H.265推荐标准。 #### 三、视频编码标准的发展视频编码标准的发展主要通过ITU-T和ISO/IEC两个组织的合作来推动。ITU-T之前发布了H.261和H.263标准，ISO/IEC则发布了MPEG-1和MPEG-4 Visual标准，而两个组织还共同制定了H.262/MPEG-2 Video和H.264/MPEG-4 AVC标准。这些标准的发展为HEVC提供了坚实的基础。 #### 四、HEVC的关键技术特点 HEVC的设计旨在提供高效的视频压缩能力，其关键技术特点包括： 1. **更大的编码单元**：HEVC引入了更大的编码单元（CU, Coding Unit），最大可达64x64像素，相比H.264的16x16像素大幅提高。 2. **灵活的分区结构**：HEVC采用了灵活的树状分区结构（Quad-Tree Partitioning），可以根据图像内容自动调整编码单元的大小。 3. **先进的帧内预测模式**：HEVC提供了更多的帧内预测方向，增强了预测的准确性。 4. **多帧间预测**：HEVC支持多个参考帧进行预测，提高了运动补偿的灵活性和准确性。 5. **高效的熵编码方法**：HEVC采用了一种新的熵编码方案——Cabac（Context-Adaptive Binary Arithmetic Coding），相比H.264中的Cavlc更为高效。 6. **环路滤波器的改进**：HEVC优化了环路滤波器的设计，能够更好地去除块效应等伪影。 #### 五、应用前景 HEVC标准的推出将对视频通信领域产生深远的影响。由于其出色的压缩效率，HEVC被广泛应用于高清视频流媒体传输、视频会议系统、数字电视广播、移动设备视频播放等多个领域。此外，随着互联网带宽限制的缓解和技术的进步，HEVC也有望在虚拟现实（VR）、增强现实（AR）等新兴领域发挥重要作用。 ### 结论 HEVC作为新一代视频压缩标准，在提高视频压缩效率方面取得了显著进展。通过对关键技术特点的介绍，我们可以看出HEVC不仅提升了视频编码的性能，也为未来的视频应用开辟了广阔的空间。随着标准的不断完善和技术的进一步发展，HEVC有望成为未来视频领域的主导标准之一。

资源推荐

资源详情

资源评论

PRE-PUBLICATION DRAFT, TO APPEAR IN IEEE TRANS. ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, DEC. 2012

IEEE by sending an email to pubs-permissions@ieee.org.



Abstract— High-Efficiency Video Coding (HEVC) is currently

being prepared as the newest video coding standard of the ITU-T

Video Coding Experts Group (VCEG) and the ISO/IEC Moving

Picture Experts Group (MPEG). The main goal of the HEVC

standardization effort is to enable significantly improved

compression performance relative to existing standards – in the

range of 50% bit rate reduction for equal perceptual video

quality. This article provides an overview of the technical features

and characteristics of the HEVC standard.

Index Terms—Video compression, Standards, HEVC, JCT-VC,

MPEG, VCEG, H.264, MPEG-4, AVC.

I. INTRODUCTION

EVC, the High Efficiency Video Coding standard, is the

most recent joint video project of the ITU-T VCEG and

ISO/IEC MPEG standardization organizations, working

together in a partnership known as the Joint Collaborative

Team on Video Coding (JCT-VC) [1]. The first edition of the

HEVC standard is expected to be finalized in January 2013,

resulting in an aligned text that will be published by both ITU-T

and ISO/IEC. Additional work is planned to extend the

standard to support several additional application scenarios

including professional uses with enhanced precision and color

format support, scalable video coding, and 3D / stereo /

multiview video coding. In ISO/IEC, the HEVC standard will

become MPEG-H Part 2 (ISO/IEC 23008-2) and in ITU-T it is

likely to become ITU-T Recommendation H.265.

Video coding standards have evolved primarily through the

development of the well-known ITU-T and ISO/IEC standards.

The ITU-T produced H.261 [2] and H.263 [3], ISO/IEC

produced MPEG-1 [4] and MPEG-4 Visual [5], and the two

organizations jointly produced the H.262/MPEG-2 Video [6]

and H.264/MPEG-4 AVC [7] standards. The two standards that

Manuscript received May 25, 2012.

G. J. Sullivan is with Microsoft Corporation, Redmond, WA 98052 USA

(e-mail: garysull@microsoft.com).

J.-R. Ohm is with the Institute of Communications Engineering, RWTH

Aachen University, Aachen 52056, Germany (e-mail:

ohm@ient.rwth-aachen.de).

W.-J. Han is with the Dept. of Software Development and Management,

Gachon University, Seongnam, 461-701, Korea (corresponding author to

provide phone: +82-31-750-8668; fax: +82-31-750-5662; e-mail:

hurumi@gmail.com).

T. Wiegand is jointly affiliated with the Fraunhofer Institute for

Telecommunications, Heinrich Hertz Institute, Einsteinufer 37, and the Berlin

Institute of Technology, Einsteinufer 35, both in 10587 Berlin, Germany

(e-mail: twiegand@ieee.org)

were jointly produced have had a particularly strong impact and

have found their way into a wide variety of products that are

increasingly prevalent in our daily lives. Throughout this

evolution, continued efforts have been made to maximize

compression capability and improve other characteristics such

as data loss robustness, while considering the computational

resources that were practical for use in products at the time of

anticipated deployment of each standard.

The major video coding standard directly preceding the

HEVC project was H.264/MPEG-4 Advanced Video Coding

(AVC), which was initially developed during 1999–2003, and

then was extended in several important ways during

2003–2009. H.264/MPEG-4 AVC was an enabling technology

for digital video in almost every area that was not previously

covered by H.262/MPEG-2 Video, and has substantially

displaced the older standard within its existing application

domain. It is widely used for many applications, including

broadcast of high definition (HD) TV signals over satellite,

cable, and terrestrial transmission systems, video content

acquisition and editing systems, camcorders, security

applications, Internet and mobile network video, Blu-ray discs,

and real-time conversational applications such as video chat,

video conferencing, and telepresence systems.

However, an increasing diversity of services, the growing

popularity of HD video, and the emergence of beyond-HD

formats (e.g. 4k×2k or 8k×4k resolution) are creating even

stronger needs for coding efficiency superior to

H.264/MPEG-4 AVC’s capabilities. The need is even stronger

when higher resolution is accompanied by stereo or multi-view

capture and display. Moreover, the traffic caused by video

applications targeting mobile devices and tablet-PCs, as well as

the transmission needs for video on demand services, are

imposing severe challenges on today’s networks. An increased

desire for higher quality and resolutions is also arising in

mobile applications.

HEVC has been designed to address essentially all existing

applications of H.264/MPEG-4 AVC and to particularly focus

on two key issues: increased video resolution and increased use

of parallel processing architectures. The syntax of HEVC is

generic and should also be generally suited for other

applications that are not specifically mentioned above.

As has been the case for all past ITU-T and ISO/IEC video

coding standards, in HEVC only the bitstream structure and

syntax is standardized, as well as constraints on the bitstream

and its mapping for the generation of decoded pictures. The

mapping is given by defining the semantic meaning of syntax

elements and a decoding process such that every decoder

conforming to the standard will produce the same output when

Overview of the High Efficiency Video Coding

(HEVC) Standard

Gary J. Sullivan, Fellow IEEE, Jens-Rainer Ohm, Member IEEE, Woo-Jin Han, Member IEEE, and

Thomas Wiegand, Fellow IEEE

PRE-PUBLICATION DRAFT, TO APPEAR IN IEEE TRANS. ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, DEC. 2012

IEEE by sending an email to pubs-permissions@ieee.org.

given a bitstream that conforms to the constraints of the

standard. This limitation of the scope of the standard permits

maximal freedom to optimize implementations in a manner

appropriate to specific applications (balancing compression

quality, implementation cost, time to market, etc.). However, it

provides no guarantees of end-to-end reproduction quality, as it

allows even crude encoding techniques to be considered

conforming.

To assist the industry community in learning how to use the

standard, the standardization effort not only includes the

development of a text specification document, but also

reference software source code as an example of how HEVC

video can be encoded and decoded. The draft reference

software has been used as a research tool for the internal work

of the committee during the design of the standard, and can also

be used as a general research tool and as the basis of products.

A standard test data suite is also being developed for testing

conformance to the standard.

This paper is organized as follows. Section II highlights

some key features of the HEVC coding design. Section III

explains the high-level syntax and the overall structure of

HEVC coded video data. The HEVC video coding technology

is then described in greater detail in Section IV. Section V

explains the profile, tier and level design of HEVC, including

its Main profile in particular. Since writing an overview of a

technology as substantial as HEVC involves a substantial

amount of summarization, the reader is referred to [1] for any

omitted details. The history of the HEVC standardization effort

is discussed in Section VI.

II. HEVC CODING DESIGN AND FEATURE HIGHLIGHTS

The HEVC standard is designed to achieve multiple goals:

coding efficiency, transport system integration and data loss

resilience, as well as implementability using parallel processing

architectures. The following sub-sections describe at a glance

the key elements of the design by which these goals are

achieved, and the typical encoder operation which would

generate a valid bitstream. (More details about the associated

syntax and decoding process of the different elements are

provided in sections III and IV.)

A. Video coding layer

The video coding layer of HEVC employs the same “hybrid”

approach (inter-/intra-picture prediction and 2D transform

coding) used in all video compression standards since H.261.

Fig. 1 depicts the block diagram of a hybrid video encoder,

which could create a bitstream conforming to the HEVC

standard.

An encoding algorithm producing an HEVC compliant

bitstream would typically proceed as follows. Each picture is

split into block-shaped regions, with the exact block

partitioning being conveyed to the decoder. The first picture of

a video sequence (and the first picture at each “clean” random

access point into a video sequence) is coded using only

intra-picture prediction (which uses some prediction of data

spatially from region-to-region within the same picture but has

no dependence on other pictures). For all remaining pictures of

a sequence or between random access points, inter-picture

Decoded

Picture

Buffer

Filter Control

Data

Header

Formatting &

CABAC

Scaling &

Inverse

Transform

Motion

Compensation

General

Control

Data

Quantized

Transform

Coefficients

Intra Prediction

Data

Intra/Inter

Selection

General Coder

Control

Motion

Estimation

Transform,

Scaling &

Quantization

Input

Video

Signal

Split into CTUs

Intra-Picture

Prediction

Deblocking &

SAO Filters

Output

Video

Signal

Intra-Picture

Estimation

Motion

Data

Coded

Bitstream

Filter Control

Analysis

Fig. 1. Typical HEVC video encoder.

PRE-PUBLICATION DRAFT, TO APPEAR IN IEEE TRANS. ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, DEC. 2012

IEEE by sending an email to pubs-permissions@ieee.org.

temporally-predictive coding modes are typically used for most

blocks. The encoding process for inter-picture prediction

consists of choosing motion data comprising the selected

reference picture and motion vector (MV) to be applied for

predicting the samples of each block. The encoder and decoder

generate identical inter prediction signals by applying motion

compensation (MC) using the MV and mode decision data,

which are transmitted as side information.

The residual signal of the intra or inter prediction, which is

the difference between the original block and its prediction, is

transformed by a linear spatial transform. The transform

coefficients are then scaled, quantized, entropy coded, and

transmitted together with the prediction information.

The encoder duplicates the decoder processing loop such that

both will generate identical predictions for subsequent data.

Therefore, the quantized transform coefficients are constructed

by inverse scaling and are then inverse transformed to duplicate

the decoded approximation of the residual signal. The residual

is then added to the prediction, and the result of that addition

may then be fed into one or two loop filters to smooth out

artifacts induced by the block-wise processing and quantization.

The final picture representation (which is a duplicate of the

output of the decoder) is stored in a decoded picture buffer to be

used for the prediction of subsequent pictures. In general, the

order of the encoding or decoding processing of pictures often

differs from the order in which they arrive from the source;

necessitating a distinction between the decoding order (a.k.a.

bitstream order) and the output order (a.k.a. display order) for

a decoder.

Video material to be encoded by HEVC is generally

expected to be input as progressive scan imagery (either due to

the source video originating in that format or resulting from

de-interlacing prior to encoding). No explicit coding features

are present in the HEVC design to support the use of interlaced

scanning, as interlaced scanning is no longer used for displays

and is becoming substantially less common for distribution.

However, metadata syntax has been provided in HEVC to

allow an encoder to indicate that interlace-scanned video has

been sent by coding each field (i.e. the even or odd numbered

lines of each video frame) of interlaced video as a separate

picture or that it has been sent by coding each interlaced frame

as an HEVC coded picture. This provides an efficient method

of coding interlaced video without burdening decoders with a

need to support a special decoding process for it.

In the following, the various features involved in hybrid

video coding using HEVC are highlighted:

 Coding Tree Units and Coding Tree Block structure:

The core of the coding layer in previous standards was the

macroblock, containing a 16×16 block of luma samples

and, in the usual case of 4:2:0 color sampling, two

corresponding 8×8 blocks of chroma samples; whereas the

analogous structure in HEVC is the coding tree unit

(CTU), which has a size selected by the encoder and can be

larger than a traditional macroblock. The CTU consists of a

luma coding tree block (CTB) and the corresponding

chroma CTBs and syntax elements. The size LL of a luma

CTB can be chosen as L = 16, 32, or 64 samples, with the

larger sizes typically enabling better compression. HEVC

then supports a partitioning of the CTBs into smaller

blocks using a tree structure and quadtree-like

signaling [8].

 Coding Units and Coding Blocks: The quadtree syntax of

the CTU specifies the size and positions of its luma and

chroma coding blocks (CBs). The root of the quadtree is

associated with the CTU. Hence, the size of the luma CTB

is the largest supported size for a luma CB. The splitting of

a CTU into luma and chroma CBs is signaled jointly. One

luma CB and ordinarily two chroma CBs, together with

associated syntax, form a Coding Unit (CU). A CTB may

contain only one CU or may be split to form multiple CUs,

and each CU has an associated partitioning into prediction

units (PUs) and a tree of transform units (TUs).

 Prediction Units and Prediction Blocks: The decision

whether to code a picture area using inter-picture or

intra-picture prediction is made at the CU level. A

prediction unit (PU) partitioning structure has its root at

the CU level. Depending on the basic prediction type

decision, the luma and chroma CBs can then be further

split in size and predicted from luma and chroma

prediction blocks (PBs). HEVC supports variable PB sizes

from 64×64 down to 4×4 samples.

 Transform Units and Transform Blocks: The prediction

residual is coded using block transforms. A transform unit

(TU) tree structure has its root at the CU level. The luma

CB residual may be identical to the luma transform block

(TB) or may be further split into smaller luma TBs. The

same applies to the chroma TBs. Integer basis functions

similar to those of a discrete cosine transform (DCT) are

defined for the square TB sizes 4×4, 8×8, 16×16, and

32×32. For the 4×4 transform of intra-picture prediction

residuals, an integer transform derived from a form of

discrete sine transform (DST) is alternatively specified.

 Motion vector signaling: Advanced motion vector

prediction (AMVP) is used, including derivation of several

most probable candidates based on data from adjacent PBs

and the reference picture. A “merge” mode for MV coding

can be also used, allowing the inheritance of MVs from

neighboring PBs. Moreover, compared to H.264/MPEG-4

AVC, improved “skipped” and “direct” motion inference

are also specified.

 Motion compensation: Quarter-sample precision is used

for the MVs, and 7-tap or 8-tap filters are used for

interpolation of fractional-sample positions (compared to

6-tap filtering of half-sample positions followed by

bi-linear interpolation of quarter-sample positions in

H.264/MPEG-4 AVC). Similar to H.264/MPEG-4 AVC,

multiple reference pictures are used. For each PB, either

one or two motion vectors can be transmitted, resulting

either in uni-predictive or bi-predictive coding,

respectively. As in H.264/MPEG-4 AVC, a scaling and

offset operation may be applied to the prediction signal(s)

in a manner known as weighted prediction.

 Intra-picture prediction: The decoded boundary samples

of adjacent blocks are used as reference data for spatial

prediction in PB regions when inter-picture prediction is

not performed. Intra prediction supports 33 directional

剩余18页未读，继续阅读

评论收藏

内容反馈

Nereus_Li

粉丝: 18
资源: 1

_IEEE-HEVC-Overview.

HEVC-Overview

HEVC_Overview_rev2

Overview of IEEE Standard 91-1984

Test Model 11 of 3D-HEVC and MV-HEVC.docx

MV-HEVC and 3D-HEVC Reference Software 16.2

Elecard-HEVC-Analyzer.zip

3D-HEVC Test Model 2

3D-HEVC的VS配置

3D-HEVC中英文对照

TEncBinCoderCABAC_1.rar_3D-HEVC_TEncSearch_xdl

Overview of HEVC codec standard.rar

IEEE Smart Grid Overview

Elecard-HEVC码流分析软件.zip

3d-HEVC TEST MODEL11

h265官方文档__T-REC-H.265-201504-I!!PDF-E.pdf

Test Model 11 of 3D-HEVC and MV-HEVC： JVT3V-K1003

3D-HEVC HTM平台的简单配置

Overview of the High Efficiency Video Coding (HEVC) Standard.pdf

01-SVT-HEVC.zip SVT-HEVC的官方最新源码

Overview of HEVC

arm64_ChromePublic_HEVC-92.0.4515.115.apk

An Overview of Tiles in HEVC.pdf

Overview of the High Efficiency Video Coding (HEVC) Standard

Overview of the Range Extensions for the HEVC Standard

H.264_And_MPEG-4_Video_Compression.pdf

最新资源