位置敏感字典学习的图像分类资源-CSDN文库

32 浏览量 2021-03-06 17:34:03 上传评论收藏 748KB PDF 举报

本文针对基于稀疏表示的字典学习在图像分类中的卓越性能以及非线性特性在提高图像表示性能方面的应用，提出了一种具有全局一致性和平滑性约束的局部敏感字典学习算法，以克服这种局限性。以相对较低的成本限制线性度。具体而言，将图像特征以局部敏感的方式分为几组，并将全局一致性正则化器嵌入局部敏感的字典学习算法中。所提出的算法有效地捕获了复杂的非线性结构。在几个基准数据集上的实验结果证明了我们提出的局部敏感词典学习算法的效率。 ### 位置敏感字典学习的图像分类 #### 摘要与引言本文提出了一种结合了全局一致性和平滑性约束的位置敏感字典学习算法（Locality Sensitive Dictionary Learning, LSDL），旨在解决现有基于稀疏表示的字典学习方法在图像分类任务中的线性限制问题。该算法通过将图像特征按位置敏感的方式划分成多个组，并在这些组中引入全局一致性正则化器来克服线性的局限性，从而能够更有效地捕捉到图像中的复杂非线性结构。 #### 字典学习与稀疏表示字典学习是机器学习中一种用于无监督学习的方法，它通过学习一组代表数据特征的基础元素（即字典）来重构输入数据。在计算机视觉领域，特别是图像分类任务中，字典学习被广泛应用于提取图像的特征并进行分类。传统的字典学习方法主要依赖于K-means聚类等技术来构建视觉词汇表，这些词汇表由一系列视觉词组成，用于编码不同类别图像的低层次视觉信息。近年来，基于稀疏表示的字典学习方法因其在图像分类任务中的卓越性能而受到广泛关注。这种方法的核心思想是利用稀疏表示理论来寻找一个字典，使得输入数据可以通过字典中的少量元素线性组合得到较好的近似。Yang等人首次将基于稀疏表示的字典学习算法引入图像分类任务中，并取得了显著的效果。 #### 位置敏感与全局一致性然而，现有的基于稀疏表示的字典学习方法存在一定的局限性，尤其是在处理具有高度非线性特性的图像数据时，线性的假设往往无法充分表达图像的复杂结构。为了解决这一问题，本文提出了位置敏感字典学习算法，其特点在于： - **位置敏感**：通过将图像特征按位置敏感的方式分组，可以更好地保留图像中局部区域的信息，这对于捕获图像的关键特征至关重要。 - **全局一致性**：在每个位置敏感的分组中加入全局一致性正则化器，以确保整个图像的一致性和连贯性。这种方式不仅能够提高特征表示的质量，还能降低模型的复杂度。 #### 实验结果为了验证所提出算法的有效性，研究者们在多个基准数据集上进行了实验测试。实验结果表明，位置敏感字典学习算法相较于传统方法在图像分类任务上表现出了更高的准确率和鲁棒性。这归功于算法能够更精细地捕捉到图像中的非线性结构，从而提升了整体的分类性能。 #### 结论位置敏感字典学习算法通过结合位置敏感性和全局一致性两种策略，有效克服了传统字典学习方法在线性假设下的局限性。该算法不仅能够在保持较低计算成本的同时提高图像特征的表示能力，而且还在多个基准数据集上的实验中展现出了良好的分类效果。这一研究成果为图像分类领域的研究者提供了一个新的视角，对于推动计算机视觉技术的发展具有重要意义。

资源推荐

资源详情

资源评论

LOCALITY SENSITIVE DICTIONARY LEARNING FOR IMAGE CLASSIFICATION

Bao-Di Liu, Bin Shen*, Xue Li

College of Information and Control Engineering China University of Petroleum Qingdao, 266580, China

Google Research, New York, USA

Department of Electronic Engineering Tsinghua University Beijing, 100084, China

thu.liubaodi@gmail.com, bshen@google.com, xue-li11@mails.tsinghua.edu.cn

ABSTRACT

In this paper, motivated by the superior performance of sparse rep-

resentation based dictionary learning for application of image clas-

siﬁcation and the usage of nonlinearity property in improving per-

formance of image representation, we propose a locality sensitive

dictionary learning algorithm with global consistency and smooth-

ness constraint to overcome the restriction of linearity at relatively

low cost. Speciﬁcally, the image features are partitioned into several

groups in a locality sensitive way and a global consistency regular-

izer is embedded into locality sensitive dictionary learning algorith-

m. The proposed algorithm is efﬁcient to capture complex nonlin-

ear structure. Experimental results on several benchmark data sets

demonstrate the efﬁciency of our proposed locality sensitive dictio-

nary learning algorithm.

Index Terms— Dictionary Learning, Sparse Representation,

Locality Sensitive, Image Classiﬁcation

1. INTRODUCTION

Image classiﬁcation task, which aims at automatically associate im-

ages with semantic labels, has become quite a signiﬁcant topic in

computer vision area. The most common framework for image clas-

siﬁcation is the discriminative model [1, 2, 3, 4, 5]. Five main step-

s, include image feature extraction, dictionary learning, image fea-

ture coding, image pooling, and SVM-based classiﬁcation, construct

the discriminative model [1], where dictionary learning plays a key

role. A dictionary is usually composed of visual words, which en-

code low level visual information of images across different classes.

The primitive versions of vocabulary are learnt by typically k-means

clustering [6]. Yang et al. [2] ﬁrst introduced sparse representa-

tion based dictionary learning algorithm and obtained state-of-the-

art performance in image classiﬁcation.

On the other hand, various sparse representation based dictio-

nary learning algorithms were emerged. And sparse representation

based dictionary learning technique has been gradually revealed in

computer vision areas, such as image classiﬁcation[7, 8, 9], im-

age inpainting [10, 11], image resolution, face recognition [12, 13]

etc. Different from other factorization methods, such as PCA, non-

negative matrix factorization [14, 15, 16, 17, 18, 19, 20], tensor fac-

This work was done when Bin was with Department of Computer Sci-

ence, Purdue University, West Lafayette.

This work was supported by the National Natural Science Foundation

of China (Grant No.61402535 No.61271407), the Natural Science Founda-

tion for Youths of Shandong Province, China (Grant No.ZR2014FQ001),

Qingdao Science and Technology Project (No.14-2-4-111-jch), and the Fun-

damental Research Funds for the Central Universities, China University of

Petroleum (East China) (Grant No. 14CX02169A).

Fig. 1. Framework of the proposed LSDL. The data matrix X is

partitioned into C clusters, then a dictionary is learnt for each region.

The dictionary W learnt for the data matrix X is the smoothness and

consistency of the local dictionaries learnt for each region.

torization [21, 22, 23, 24] and low-rank factorization [25, 26], sparse

representation based dictionary learning has the ability to represent

data by sparse linear combination of bases. Recently, more and more

researchers focused on locality-preserving dictionary learning algo-

rithms due that locality was more essential than sparsity [27]. Wang

et al. [3] considered each word in the vocabulary lying on a man-

ifold to preserve the local information. Wei et al. [28] embedded

the local data structure into sparse representation based dictionary

learning algorithm. Gao et al. [29] incorporated the histogram in-

tersection kernel based laplacian matrix into the sparse representa-

tion based dictionary learning algorithm to enforce the consistence

in sparse representation of similar local features. Zheng et al. [30]

explicitly embedded the vector quantization based laplancian matrix

into dictionary learning algorithm. Liu et al. [5] proposed sparse

representation in k-nearest neighbor to construct the graph model to

improve the accuracy and robustness of image representation.

However, all these methods are restricted to linear dictionary

learning methods or graph embedded dictionary learning methods,

and thus unable to capture complex nonlinear properties. Many real

word data require complex nonlinearity in dictionary learning due to

their distribution. For example, handwritten digits form manifolds

or human face form manifolds in the feature space is hard to cap-

ture such complex nonlinearity structure by conventional dictionary

learning methods, especially when facing large data. Moreover, k-

ernel trick needs to apply the kernel function to all pairs of sam-

ples which cost a lot of computation. To improve the nonlinearity

and to keep the computational efﬁciency, a locality sensitive dictio-

nary learning (LSDL) method is proposed to approximate the global

nonlinear dictionary to capture complex correlation structures. The

assumption is that the global dictionary learning is nonlinear, but it

is linear locally, i.e. dictionary learning method is applicable when

3807

ICIP 2015

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38592332

粉丝: 7
资源: 887

位置敏感字典学习的图像分类

matlab_k-svd算法用于稀疏表示的图像去噪，字典学习算法

用于图像分类的极限学习机和自适应稀疏表示

相似图像的检测方法.docx

相似图像的检测方法.pdf

图像集闭包建模的协同表示人脸识别算法.pdf

论文研究-一种改进的考虑噪声约束的过点分配方法.pdf

通过稀疏表示使用多个字典进行超分辨率映射

稀疏编码 SAE 学习不变特征

OCR图像识别

sharingcode-LCKSVD

Image Classification Based on Saliency Coding with Category-specific Codebooks

dect-multra-master_谱聚类_CT重建_

基于对称Gabor特征和稀疏表示的人脸识别.pdf

K-SVD-master_kmeanspython_

机器学习总结之五 数据集.docx

基于多尺度稀疏表示的场景分类

TOP-SIFT: A New Method for SIFT Descriptor Selection

SRC:基于稀疏表示的分类

应用机器学习：应用机器学习入门的分步指南

基于低秩矩阵恢复和Gabor特征的遮挡人脸识别.docx

数据压缩课件 第二版

时变物联网数据压缩算法.pptx

分布式物联网数据压缩优化.pptx

信息安全_数据安全_Exploiting IOSurface 0.pdf

PAGreen磁盘管理工具

全部数据和代码目录及详细说明

stata软件安装包（stata18）（stata软件安装包下载与安装）

Origin绘制相关性热图插件(Correlation Plot)

最新资源

机器学习总结之五数据集.docx

数据压缩课件第二版