ImageQualityAssessmentFromErrorVisibilitytoStructuralSimilarity资源-CSDN文库

毕业设计

需积分: 1 129 浏览量 2024-04-21 13:13:31 上传评论收藏 1.63MB PDF 举报

资源推荐

资源详情

资源评论

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/3327793

Image Quality Assessment: From Error Visibility to Structural Similarity

ArticleinIEEE Transactions on Image Processing · May 2004

DOI: 10.1109/TIP.2003.819861·Source: IEEE Xplore

CITATIONS

19,306

READS

9,976

4 authors, including:

Some of the authors of this publication are also working on these related projects:

Predicting the quality of images compressed after distortion in two steps View project

Create new project "Perceptual Quality" View project

Zhou Wang

University of Waterloo

228 PUBLICATIONS50,145 CITATIONS

SEE PROFILE

Alan Bovik

University of Texas at Austin

906 PUBLICATIONS98,413 CITATIONS

SEE PROFILE

Eero P. Simoncelli

New York University

349 PUBLICATIONS73,865 CITATIONS

SEE PROFILE

All content following this page was uploaded by Eero P. Simoncelli on 23 September 2014.

The user has requested enhancement of the downloaded file.

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 4, APRIL 2004 1

Image Quality Assessment: From Error Visibility to

Structural Similarity

Zhou Wang, Member, IEEE, Alan C. Bovik, Fellow, IEEE

Hamid R. Sheikh, Student Member, IEEE, and Eero P. Simoncelli, Senior Member, IEEE

Abstract— Objective methods for assessing perceptual im-

age quality have traditionally attempted to quantify the vis-

ibility of errors between a distorted image and a reference

image using a variety of known properties of the human

visual system. Under the assumption that human visual

perception is highly adapted for extracting structural infor-

mation from a scene, we introduce an alternative framework

for quality assessment based on the degradation of struc-

tural information. As a speciﬁc example of this concept,

we develop a Structural Similarity Index and demonstrate

its promise through a set of intuitive examples, as well as

comparison to both subjective ratings and state-of-the-art

objective methods on a database of images compressed with

JPEG and JPEG2000.

Keywords— Error sensitivity, human visual system (HVS),

image coding, image quality assessment, JPEG, JPEG2000,

perceptual quality, structural information, structural simi-

larity (SSIM).

I. Introduction

Digital images are subject to a wide variety of distor-

tions during acquisition, pro cessing, compression, storage,

transmission and reproduction, any of which may result

in a degradation of visual quality. For applications in

which images are ultimately to be viewed by human be-

ings, the only “correct” method of quantifying visual im-

age quality is through subjective evaluation. In practice,

however, subjective evaluation is usually too inconvenient,

time-consuming and expensive. The goal of research in ob-

jective image quality assessment is to develop quantitative

measures that can automatically predict perceived image

quality.

An objective image quality metric can play a variety of

roles in image processing applications. First, it can be

used to dynamically monitor and adjust image quality. For

example, a network digital video server can examine the

quality of video being transmitted in order to control and

allocate streaming resources. Second, it can be used to

optimize algorithms and parameter settings of image pro-

cessing systems. For instance, in a visual communication

The work of Z. Wang and E. P. Simoncelli was supported by the

Howard Hughes Medical Institute. The work of A. C. Bovik and H.

R. Sheikh was supported by the National Science Foundation and the

Texas Advanced Research Program. Z. Wang and E. P. Simoncelli are

with the Howard Hughes Medical Institute, the Center for Neural Sci-

ence and the Courant Institute for Mathematical Sciences, New York

University, New York, NY 10012 USA (email: zhouwang@ieee.org;

eero.simoncelli@nyu.edu). A. C. Bovik and H. R. Sheikh are with the

Laboratory for Image and Video Engineering (LIVE), Department

of Electrical and Computer Engineering, The University of Texas

at Austin, Austin, TX 78712 USA (email: bovik@ece.utexas.edu;

hamid.sheikh@ieee.org).

A MatLab implementation of the proposed algorithm is available

online at http://www.cns.nyu.edu/~lcv/ssim/.

system, a quality metric can assist in the optimal design of

preﬁltering and bit assignment algorithms at the encoder

and of optimal reconstruction, error concealment and post-

ﬁltering algorithms at the decoder. Third, it can be used

to benchmark image processing systems and algorithms.

Objective image quality metrics can be classiﬁed accord-

ing to the availability of an original (distortion-free) image,

with which the distorted image is to be compared. Most

existing approaches are known as full-reference, meaning

that a complete reference image is assumed to be known. In

many practical applications, however, the reference image

is not available, and a no-reference or “blind” quality as-

sessment approach is desirable. In a third type of method,

the reference image is only partially available, in the form

of a set of extracted features made available as side infor-

mation to help evaluate the quality of the distorted image.

This is referred to as reduced-reference quality assessment.

This paper focuses on full-reference image quality assess-

ment.

The simplest and most widely used full-reference quality

metric is the mean squared error (MSE), computed by aver-

aging the squared intensity diﬀerences of distorted and ref-

erence image pixels, along with the related quantity of peak

signal-to-noise ratio (PSNR). These are appealing because

they are simple to calculate, have clear physical meanings,

and are mathematically convenient in the context of opti-

mization. But they are not very well matched to perceived

visual quality (e.g., [1]–[9]). In the last three decades, a

great deal of eﬀort has gone into the development of quality

assessment methods that take advantage of known charac-

teristics of the human visual system (HVS). The majority

of the proposed perceptual quality assessment models have

followed a strategy of modifying the MSE measure so that

errors are penalized in accordance with their visibility. Sec-

tion II summarizes this type of error-sensitivity approach

and discusses its diﬃculties and limitations. In Section III,

we describe a new paradigm for quality assessment, based

on the hypothesis that the HVS is highly adapted for ex-

tracting structural information. As a speciﬁc example, we

develop a measure of structural similarity that compares lo-

cal patterns of pixel intensities that have been normalized

for luminance and contrast. In Section IV, we compare the

test results of diﬀerent quality assessment models against

a large set of subjective ratings gathered for a database of

344 images compressed with JPEG and JPEG2000.

2 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 4, APRIL 2004

Reference

signal

Distorted

signal

Quality/

Distortion

Measure

Channel

Decomposition

Error

Normalization

.

Error

Pooling

Pre-

processing

CSF

Filtering

.

Fig. 1. A prototypical quality assessment system based on error sensitivity. Note that the CSF feature can be implemented either as a

separate stage (as shown) or within “Error Normalization”.

II. Image Quality Assessment Based on Error

Sensitivity

An image signal whose quality is being evaluated can

be thought of as a sum of an undistorted reference signal

and an error signal. A widely adopted assumption is that

the loss of perceptual quality is directly related to the vis-

ibility of the error signal. The simplest implementation

of this concept is the MSE, which objectively quantiﬁes

the strength of the error signal. But two distorted images

with the same MSE may have very diﬀerent types of errors,

some of which are much more visible than others. Most

perceptual image quality assessment approaches prop osed

in the literature attempt to weight diﬀerent aspects of the

error signal according to their visibility, as determined by

psychophysical measurements in humans or physiological

measurements in animals. This approach was pioneered

by Mannos and Sakrison [10], and has been extended by

many other researchers over the years. Reviews on image

and video quality assessment algorithms can be found in

[4], [11]–[13].

A. Framework

Fig. 1 illustrates a generic image quality assessment

framework based on error sensitivity. Most perceptual

quality assessment models can be described with a simi-

lar diagram, although they diﬀer in detail. The stages of

the diagram are as follows:

Pre-processing. This stage typically performs a variety

of basic operations to eliminate known distortions from the

images being compared. First, the distorted and reference

signals are properly scaled and aligned. Second, the signal

might be transformed into a color space (e.g., [14]) that is

more appropriate for the HVS. Third, quality assessment

metrics may need to convert the digital pixel values stored

in the computer memory into luminance values of pixels on

the display device through pointwise nonlinear transforma-

tions. Fourth, a low-pass ﬁlter simulating the point spread

function of the eye optics may be applied. Finally, the ref-

erence and the distorted images may be modiﬁed using a

nonlinear point operation to simulate light adaptation.

CSF Filtering. The contrast sensitivity function (CSF)

describes the sensitivity of the HVS to diﬀerent spatial and

temporal frequencies that are present in the visual stim-

ulus. Some image quality metrics include a stage that

weights the signal according to this function (typically im-

plemented using a linear ﬁlter that approximates the fre-

quency response of the CSF). However, many recent met-

rics choose to implement CSF as a base-sensitivity normal-

ization factor after channel decomp osition.

Channel Decomposition. The images are typically sep-

arated into subbands (commonly called “channels” in the

psychophysics literature) that are selective for spatial and

temporal frequency as well as orientation. While some

quality assessment methods implement sophisticated chan-

nel decompositions that are believed to be closely re-

lated to the neural responses in the primary visual cortex

[2], [15]–[19], many metrics use simpler transforms such as

the discrete cosine transform (DCT) [20], [21] or separa-

ble wavelet transforms [22]–[24]. Channel decompositions

tuned to various temporal frequencies have also been re-

ported for video quality assessment [5], [25].

Error Normalization. The error (diﬀerence) between the

decomposed reference and distorted signals in each channel

is calculated and normalized according to a certain masking

model, which takes into account the fact that the presence

of one image component will decrease the visibility of an-

other image component that is proximate in spatial or tem-

poral location, spatial frequency, or orientation. The nor-

malization mechanism weights the error signal in a channel

by a space-varying visibility threshold [26]. The visibility

threshold at each point is calculated based on the energy

of the reference and/or distorted coeﬃcients in a neighbor-

hood (which may include coeﬃcients from within a spatial

neighborhood of the same channel as well as other chan-

nels) and the base-sensitivity for that channel. The normal-

ization process is intended to convert the error into units of

just noticeable diﬀerence (JND). Some methods also con-

sider the eﬀect of contrast resp onse saturation (e.g., [2]).

Error Pooling. The ﬁnal stage of all quality metrics must

combine the normalized error signals over the spatial extent

of the image, and across the diﬀerent channels, into a single

value. For most quality assessment methods, pooling takes

the form of a Minkowski norm:

E ({e

l,k

}) =

l,k

1/β

(1)

where e

l,k

is the normalized error of the k-th coeﬃcient in

the l-th channel, and β is a constant exponent typically

chosen to lie between 1 and 4. Minkowski pooling may be

performed over space (index k) and then over frequency

(index l ), or vice-versa, with some non-linearity between

them, or possibly with diﬀerent exponents β. A spatial

剩余14页未读，继续阅读

评论收藏

内容反馈

Jacen.L

粉丝: 208
资源: 5

ImageQualityAssessmentFromErrorVisibilitytoStructuralSimilarity

34个经典javaweb项目实例.zip

毕业设计 springBoot人力资源管理系统+毕业论文+前后端源代码

项目源码：基于Hadoop+Spark招聘推荐可视化系统 大数据项目 计算机毕业设计

基于spring boot的小区物业管理系统源码+论文+答辩ppt

毕业设计：舆情监测系统（SpringBoot+NLP）

计算机毕业设计：Flask股票数据采集分析可视化系统 python+爬虫+金融数据

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计 项目源码 毕业设计

毕业设计-基于JAVA的springboot超市进销存系统(源代码+论文）

基于深度学习的课堂行为识别和考试作弊检测系统的设计与实现（python源码）

基于51单片机的智能电子秤系统设计(含代码仿真及论文)

Python爬取智联招聘网站数据，2023.10.31测试，可跑

不错的可用来练手、课程设计、毕业设计的Javaweb项目源码：仓库管理系统.rar

计算机毕业设计源码：基于python旅游推荐系统+爬虫+分析可视化 +django框架

基于SpringBoot+Vue的学生选课管理系统的毕业设计，Vue+SpringBoot+MybatisPlus+MySQL

04基于stm32单片机智能宠物管理系统源代码+PCB+原理图+仿真+论文

计算机毕业设计：基于python微博舆情分析可视化系统+爬虫+情感分析+Flask框架 项目源码

基于Hadoop+Spark招聘推荐可视化系统 大数据项目 毕业设计（源码下载）

计算机毕业设计：基于python美食推荐系统 +协同过滤推荐算法+django框架（包含文档+源码+部署教程）

小剧场短剧影视小程序源码 全开源 带支付等模式 付费短剧小程序源码.rar

学术海报模板+论文科研+研究生

stm32毕业设计集合源码加资料

电路设计工程计算基础 (武晔卿)

02基于stm32超声波仿真系统系统（程序源码+仿真+论文）项目

第十九届研电赛-技术论文模板

时间序列预测实战(十九)魔改Informer模型进行滚动长期预测（科研版本，结果可视化）

基于eNSP模拟企业网的实现（代码＋毕业设计＋论文）

计算机毕业设计源码：基于python气象数据采集预测可视化系统 （机器学习）预测模型+爬虫

基于Java的疫情防控管理信息系统的设计与实现【附源码】

很棒的毕业设计、课程设计、练手的java项目-仓库商品管理系统(文档+视频+源码).rar

最新资源

项目源码：基于Hadoop+Spark招聘推荐可视化系统大数据项目计算机毕业设计

人脸识别系统OpenCV+dlib+python（含数据库）Pyqt5界面设计项目源码毕业设计

计算机毕业设计：基于python微博舆情分析可视化系统+爬虫+情感分析+Flask框架项目源码

基于Hadoop+Spark招聘推荐可视化系统大数据项目毕业设计（源码下载）

小剧场短剧影视小程序源码全开源带支付等模式付费短剧小程序源码.rar

计算机毕业设计源码：基于python气象数据采集预测可视化系统（机器学习）预测模型+爬虫