1328 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 10, OCTOBER 2004
deviations from translational motion significantly degrade per-
formance.
Another popular class of methods solves the problem of reso-
lution enhancement in the spatial domain. Non-iterative spatial
domain data fusion approaches were proposed in [4]–[6]. The
iterative back-projection method was developed in papers such
as [7] and [8]. In [9], the authors suggested a method based on
the multichannel sampling theorem. In [10], a hybrid method,
combining the simplicity of ML with proper prior information
was suggested.
The spatial domain methods discussed so far are generally
computationally expensive. The authors in [11] introduced a
block circulant preconditioner for solving the Tikhonov regular-
ized super-resolution problem formulated in [10] and addressed
the calculation of regularization factor for the under-determined
case by generalized cross validation in [12]. Later, a very fast
super-resolution algorithm for pure translational motion and
common space invariant blur was developed in [5]. Another
fast spatial domain method was recently suggested in [13],
where LR images are registered with respect to a reference
frame defining a nonuniformly spaced high-resolution (HR)
grid. Then, an interpolation method called Delaunay trian-
gulation is used for creating a noisy and blurred HR image,
which is subsequently deblurred. All of the above methods
assumed the additive Gaussian noise model. Furthermore,
regularization was either not implemented or it was limited to
Tikhonov regularization. Considering outliers, [14] describes a
very successful robust super-resolution method, but lacks the
proper mathematical justification ( limitations of this robust
method and its relation to our proposed method are discussed
in Appendix B). Finally, [15] and [16] have considered quan-
tization noise resulting from video compression and proposed
iterative methods to reduce compression noise effects in the
super-resolved outcome.
The two most common matrix notations used to formulate the
general super-resolution model of (1) represent the problem in
the pixel domain. The more popular notation used in [5], [11],
and [14] considers only camera lens blur and is defined as
(2)
where the
matrix is the geometric motion
operator between the HR frame
(of size ) and the
th LR frame (of size ) which are rearranged in
lexicographic order and
is the resolution enhancement factor.
The camera’s point spread function (PSF) is modeled by the
blur matrix , and matrix
represents the decimation operator. The vector
is the system noise and is the number of available LR frames.
Considering only atmosphere and motion blur, [13] recently
presented an alternate matrix formulation of (1) as
(3)
In conventional imaging systems (such as video cameras),
camera lens blur has a more important effect than the atmo-
spheric blur (which is very important for astronomical images).
In this paper, we use the model (2). Note that, under some
assumptions which will be discussed in Section II-B, blur and
motion matrices commute and the general matrix super-resolu-
tion formulation from (1) can be rewritten as
(4)
Defining
merges both models into a form
similar to (2).
In this paper, we propose a fast and robust super-resolution al-
gorithm using the
norm, both for the regularization and the
data fusion terms. Whereas the former is responsible for edge
preservation, the latter seeks robustness with respect to motion
error, blur, outliers, and other kinds of errors not explicitly mod-
eled in the fused images. We show that our method’s perfor-
mance is superior to what was proposed earlier in [5], [11], [14],
etc., and has fast convergence. We also mathematically justify a
noniterative data fusion algorithm using a median operation and
explain its superior performance.
This paper is organized as follows. Section II explains the
main concepts of robust super resolution. Section II-B justifies
using the
norm to minimize the data error term; Section II-C
justifies using our proposed regularization term. Section II-D
combines the results of the two previous sections and explains
our method and Section II-E proposes a faster implementation
method. Simulations on both real and synthetic data sequences
are presented in Section III, and Section IV concludes this paper.
II. R
OBUST SUPER
RESOLUTION
A. Robust Estimation
Estimation of an unknown HR image is not exclusively based
on the LR measurements. It is also based on many assumptions
such as noise or motion models. These models are not supposed
to be exactly true, as they are merely mathematically convenient
formulations of some general prior information.
From many available estimators, which estimate a HR image
from a set of noisy LR images, one may choose an estimation
method which promises the optimal estimation of the HR frame,
based on certain assumptions on data and noise models. When
the fundamental assumptions of data and noise models do not
faithfully describe the measured data, the estimator performance
degrades. Furthermore, existence of outliers, which are defined
as data points with different distributional characteristics than
the assumed model, will produce erroneous estimates. A method
which promises optimality for a limited class of data and noise
models may not be the most effective overall approach. Often,
suboptimal estimation methods which are not as sensitive to
modeling and data errors may produce better and more stable
results (robustness).
To study the effect of outliers, the concept of a breakdown
point has been used to measure the robustness of an algorithm.
The breakdown point is the smallest percentage of outlier con-
tamination that may force the value of the estimate outside some
range [17]. For instance, the breakdown point of the simple
mean estimator is zero, meaning that one single outlier is suffi-
cient to move the estimate outside any predicted bound. A robust
estimator, such as the median estimator, may achieve a break-
down equal to 0.5, which is the highest value for breakdown