通过自适应信息进行高光谱目标检测-具有局部约束的理论度量学习资源-CSDN文库

研究论文

196 浏览量 2021-03-12 16:40:14 上传评论收藏 2.87MB PDF 举报

资源详情

资源评论

资源推荐

remote sensing

Article

Hyperspectral Target Detection via Adaptive

Information—Theoretic Metric Learning with

Local Constraints

Yanni Dong

, Bo Du

*, Liangpei Zhang

and Xiangyun Hu

Hubei Subsurface Multi–Scale Imaging Key Laboratory, Institute of Geophysics and Geomatics, China

University of Geosciences, Wuhan 430074, China; dongyanni@cug.edu.cn (Y.D.); xyhu@cug.edu.cn (X.H.)

School of Computer, Wuhan University, Wuhan 430079, China

State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan

University, Wuhan 430079, China; zlp62@whu.edu.cn

* Correspondence: gunspace@163.com; Tel.: +86-138-7146-1059

Received: 18 July 2018; Accepted: 3 September 2018; Published: 6 September 2018



 

Abstract:

By using the high spectral resolution, hyperspectral images (HSIs) provide signiﬁcant

information for target detection, which is of great interest in HSI processing. However, most classical

target detection methods may only perform well based on certain assumptions. Simultaneously,

using limited numbers of target samples and preserving the discriminative information is also a

challenging problem in hyperspectral target detection. To overcome these shortcomings, this paper

proposes a novel adaptive information-theoretic metric learning with local constraints (ITML-ALC)

for hyperspectral target detection. The proposed method ﬁrstly uses the information-theoretic metric

learning (ITML) method as the objective function for learning a Mahalanobis distance to separate

similar and dissimilar point-pairs without certain assumptions, needing fewer adjusted parameters.

Then, adaptively local constraints are applied to shrink the distances between samples of similar pairs

and expand the distances between samples of dissimilar pairs. Finally, target detection decision can

be made by considering both the threshold and the changes between the distances before and after

metric learning. Experimental results demonstrate that the proposed method can obviously separate

target samples from background ones and outperform both the state-of-the-art target detection

algorithms and the other classical metric learning methods.

Keywords: hyperspectral image; target detection; metric learning; local constraints

1. Introduction

A hyperspectral image (HSI) obtained by remote sensing systems can provide signiﬁcant

information. Each pixel of HSI contains a continuous spectrum with hundreds or even thousands of

spectral bands, of which the width of each band is about 5–10 nm, to detect and characterize target of

interest in the scene [

]. Target detection is one of the most wide applications of hyperspectral image

processing, and it plays an important role in the real world, such as detecting humanmade objects

in reconnaissance applications, searching rare minerals in geology, and researching environmental

pollution [

–

]. Based on speciﬁc spectral signatures (prior information), the purpose of target detection

is to decide whether a target of interest is present or not present (background) in a pixel-under-test,

which can be viewed as a binary classiﬁer [6,7].

A number of classical target detection algorithms have been proposed in HSI analysis. Most of

them are based on the linear models and statistical hypothesis tests, which can maximize the detection

probability for ﬁxed false alarm probability, such as orthogonal subspace projection (OSP) and adaptive

cosine/coherence estimator (ACE). The former OSP method proposed by Harsanyi et al. [

] suppresses

Remote Sens. 2018, 10, 1415; doi:10.3390/rs10091415 www.mdpi.com/journal/remotesensing

Remote Sens. 2018, 10, 1415 2 of 16

the background signatures by projecting each pixel’s spectrum onto a subspace, which is orthogonal

to the background signatures. The well-known ACE method proposed by Kraut et al. [

] assumes

that the additive noise has been included in background, which is an unstructured background

detector. However, most classical algorithms depend on the speciﬁc statistical hypothesis tests, and

may only perform well under certain conditions, e.g., the ACE detector assumes that the background

is homogeneous, which is unrealistic in the real world.

In recent years, the machine learning techniques have been introduced into HSI target detection,

which has been paid great attention [

]. Typical examples of these methods are kernel-based

detectors, such as the kernel matched subspace detectors (KMSD) [

], kernel spectral matched

ﬁlter (KSMF) [

], and kernel OSP [

]. The kernel-based methods map the original feature space

into a potentially high-dimensional kernel space to solve the linearly inseparable problem in the

original space. Apparently, as mentioned in the article [

], kernel-based methods are also based on

statistical hypothesis test, and inherit the shortcomings of traditional target detection methods. It can

be concluded that kernel-based methods attempt to ﬁnd a stable and credible feature space (distance

metric) for separating potential target pixels and background ones [16–18].

Otherwise, the spectral resolution of HSIs is so high that these spectral bands are often highly

correlated. For decreasing spectral redundancy and releasing computational complexity, it is necessary

to reduce dimension by discarding redundant features for HSI target detection [

]. There are such

few target pixels of interest that HSI target detection rarely takes into consideration dimensionality

reduction, which may hide the accuracy of detecting targets. That is to say, target detection is usually

in a dilemma whether to reduce spectral redundancy or preserve discriminative information [

Thus, how to develop a proper metric with a low dimensionality for measuring the separability

between target pixels and background ones becomes the key for HSI target detection [23].

In fact, metric learning methods have proved to be a more straightforward and effective way

to obtain such a distance metric [

–

]. To date, there are a few metric learning methods that have

been proposed for HSI target detection. For example, Zhang et al. [

] learned an objective function of

the supervised distance maximization by putting a similarity propagation constraint and imposing a

manifold smoothness regularization. Dong et al. [

] presented the maximum margin metric learning

(MMML) method, which utilizes the maximum margin framework as the objective function to learn

distance metric space and can maximally separate target samples from background ones without

certain assumptions. Dong et al. [

] presented random forest metric learning (RFML) method, which

adopts random forests as the underlying representation of the metric learning, to deal with limited

numbers of target samples by merging the standard relative position and the absolute pairwise position.

In general, by using metric learning, we can ﬁnd the distance metric matrix, so as to transform the

original space into the metric feature space. Then, we can detect the desired targets, especially when

the samples are imbalanced and the number of target samples is very limited.

In addition, a number of metric learning methods have been proposed to learn the distance metric,

such as neighborhood component analysis (NCA) method [

], large margin nearest neighbor (LMNN)

method [

], and so on. For each instance, NCA method expresses the probability of selecting the same

class instances as the neighbors, which can maximize the stochastic variance of leave-one-out k-nearest

neighbor (KNN) score on the training samples. LMNN method aims to ﬁnd a distance metric such that

the instances from different classes are effectively separated by a large margin within the neighborhood,

where the margin is deﬁned as the difference between the between-class and within-class distances.

Furthermore, the information-theoretic metric learning (ITML) method, proposed by Davis et al. [

expresses the weakly supervised metric learning problem as a Bregman optimization problem and can

handle a variety of constraints and incorporate a priori information on the distance function.

However, the existing metric learning based methods still have some obstacles to be addressed.

The major problem is that most methods mentioned above are global metric learning with global

constraints, making decisions by comparing their Mahalanobis distance d and judging d is lower or

higher than the a ﬁxed threshold b, which is insufﬁcient and suboptimal. Therefore, in this paper, ITML

Remote Sens. 2018, 10, 1415 3 of 16

method, which works in a weakly supervised manner, is innovatively introduced for hyperspectral

target detection with adaptively local constraints (ITML-ALC, for short). The proposed ITML-ALC

method explores adaptively local constraints to relax the ﬁxed threshold, which can be used to compute

the Mahalanobis distance d and judge if given samples are targets by considering both b and the changes

between the distances before and after metric learning. By considering local constraints and avoiding

adopting those conﬂicting constraints, the separability between target samples and background ones

can be enhanced. Besides, non-square matrix

can be found for handling high-dimensional data

problems by transforming the original space into a metric learning space with a low dimensionality.

Compared with existing algorithms, ITML-ALC has several obvious advantages:

The proposed ITML-ALC algorithm can use limited numbers of target samples to detect targets

without certain assumptions, compared with traditional target detection methods.

ITML-ALC needs only one parameter to be adjusted, and the detection results are relatively

stable for different values of parameter.

ITML-ALC can remain the locality information and improve the detection performance via

considering both the threshold and the changes between the distances before and after metric

learning, while existing metric learning based methods uses ﬁxed threshold to make decision.

The rest of this paper is organized as follows. In Section 2, a brieﬂy introduce of the original

ITML method is provided, and the proposed ITML-ALC method is then presented. The experimental

results of the proposed method using several challenging HSIs are detailed in Section 3, followed by

the discussion and conclusions in Sections 4 and 5.

2. Methods

2.1. Related Work

The ITML methodology minimizes the LogDet divergence subject to linear constraints. There are

two key techniques of ITML. One is the ability to handle a wide variety of constraints and to optionally

incorporate a priori information on the distance function. The other key technique is that it is fast

and scalable.

Suppose that we have a set of L-dimensional training samples

{

, x

, · · · , x

}

∈ R

L×n

, in which

n represents the number of training samples and L is the number of feature dimensions.

∈ (+

−

)

denotes the relationship between the training samples

and

. Considering relationships of the

similarity or dissimilarity between pairs of samples, distances between samples in the same class can

be constrained as similar, and ones in different classes can be constrained as dissimilar. Then, we have

a set of similar constraints S and a set of dissimilar constraints D as Equation (1):

S : ∀(x

, x

) ∈ S x

, x

∈ same class, z

= 1,

D : ∀(x

, x

) ∈ D x

, x

∈ different class, z

= −1.

(1)

Metric learning aims to learn metric matrix

, which speciﬁes the Mahalanobis distance

)

between any pairs of samples x

and x

as:

, x

) =

− x

)

M(x

− x

). (2)

In order to ensure that

)

is a meaningful distance, the learned metric matrix

must be

symmetric and positive semideﬁnite (PSD) variance matrix, guaranteeing that

)

is symmetrical,

non-negativite, and has triangle inequality [

]. Considering the high dimensional of HSIs and

PSD matrix, a nonsquare matrix W ∈ R

L×D

(D  L), deﬁning a mapping from the high-dimensional

space into a low-dimensional embedding, can be established, and M = WW

[34–36].

In the Equation (2), our objective is to ﬁnd the PSD matrix

(or

) and the corresponding

distance threshold b such that for any pairs

) ∈ S

the distance between them is smaller than b,

Remote Sens. 2018, 10, 1415 4 of 16

and for any pairs

) ∈ D

the distance between them is greater than b, which can be described as

Equation (3):

, x

) ≤ b (x

, x

) ∈ S,

, x

) ≥ b (x

, x

) ∈ D.

(3)

The ITML method can minimize the differential relative entropy between two multivariate

Gaussians and handle a variety of constraints on the distance function via a natural information-

theoretic approach. Thus, given a Mahalanobis distance parameterized by

, its corresponding

multivariate Gaussian can be expressed as:

p(x; M) =

exp(−

(x, µ)), (4)

where

is the mean of Gaussians,

is a normalizing constant in the Equation (4). By using the

bijection, the distance between two Mahalanobis distance functions parameterized by

and

can

be measured by the differential relative entropy of corresponding multivariate Gaussians:

KL(p(x; M

)||p(x; M)) =

p(x; M

) log

p(x; M

)

p(x; M)

dx, (5)

In the Equation (5),

is a given Mahalanobis distance function, such as identity matrix.

In conjunction with given pairs of similar points

and pairs of dissimilar points

, the distance

metric learning can be summarized as the following optimization problems:

min

KL(p(x; M

)||p(x; M))

subject to d

, x

) ≤ b

, x

) ∈ S,

, x

) ≥ b

, x

) ∈ D,

(6)

where b

, b

are given upper and lower bounds, respectively.

Some research has shown that the differential relative entropy of corresponding multivariate

Gaussians is equivalent to the LogDet divergence between the covariance matrices [37]:

KL(p(x; M

)||p(x; M)) =

log det

−1

, M

−1

) =

log det

(M, M

), (7)

where M

−1

, M

−1

are the covariance of the distributions.

Taking into account that a feasible solution of Equation (6) may not exist, we incorporate slack

variable

into Equation (6) to guarantee the existence of the metric matrix

. Thus, the Equation (6)

can be represented as the following optimization problem with Equation (7):

min

M≥0,ξ

log det

(M, M

) + γ·d

log det

(diag(ξ), diag(ξ

))

s.t. tr(M(x

− x

)(x

− x

)

) ≤ ξ

c(i,j)

, x

) ∈ S,

tr(M(x

− x

)(x

− x

)

) ≥ ξ

c(i,j)

, x

) ∈ D,

(8)

where

denotes initialized slack variables, and

c(i

is the index of the

j) − th

constraint.

is the

tradeoff parameter, which controls the tradeoff between satisfying the constraints and minimizing

log det

2.2. Combining ITML and Adaptively Local Constraints

The ITML method uses ﬁxed threshold to make decision, which makes it less effective to handle

data with complex distributions even if the associated metric is correct. To address this issue, this paper

proposes an adaptively local decision rule to design pairwise constraints to relax the ﬁxed threshold

for target detection. We design a local decision function

f (d

)

to achieve this goal, where

is the

剩余15页未读，继续阅读

评论收藏

内容反馈

weixin_38693419

粉丝: 6
资源: 931

通过自适应信息进行高光谱目标检测-具有局部约束的理论度量学习

评论0

最新资源

通过自适应信息进行高光谱目标检测-具有局部约束的理论度量学习

评论0

通过自适应联合稀疏表示和具有位置信息的多任务学习进行高光谱目标检测

人工智能-项目实践-自适应学习-自适应学习模型-应用于教育领域-知识图谱

基于稀疏表示和自适应模型的高光谱目标检测

论文研究-基于自适应背景模型的运动目标检测 .pdf

自适应（项目反应理论）算法-python

自适应模糊整定PID----

自适应模糊控制源程序-----

教育行业AI+教育：自适应学习推动因材施教的教育革命-1105-中金公司-29页.pdf

自适应滤波器(第五版－－英文版)

Matlab神经网络自适应算法的Matla盲源分离-神经网络自适应算法的Matla盲源分离.rar

自适应背景更新opencv目标检测.zip资源matlab opencv运动目标检测程序资料

人工智能-目标检测-用于小目标检测的二维NLMS自适应滤波器的设计与实现.pdf

论文研究-基于局部约束的自适应图标签传递方法.pdf

学校实训毕业商用项目-HTML5响应式自适应酒店设计室内装修网站源码-手机自适应.zip

OPENCV目标跟踪_opencv_目标检测_自适应背景更新_opencv目标检测_VC++.zip

混合属性离群点检测-基于邻域值差异度量的离群点检测(NVDMOD)算法

行人惯性导航零速检测算法

基于CORDIC的反正弦和反余弦计算的FPGA实现

混合动力汽车基于规则的控制和ECMS与优化等效因子的实时能源管理策略

BA无标度网络中的SIR模型

使用3DCNN和卷积LSTM进行手势识别学习时空特征

基于三次贝塞尔曲线的类汽车曲率连续路径平滑

基于机器学习的设备剩余寿命预测方法综述

基于无差拍预测控制和扰动观测器的永磁同步电机电流控制

基于FPGA的奇异值和特征值分解的快速实现。

基于BP神经网络的人口预测

两轮平衡车的建模与控制研究

最新资源