一种基于稀疏表示的图像标签显着性排序算法资源-CSDN文库

194 浏览量 2021-03-03 13:57:40 上传评论收藏 416KB PDF 举报

本文提出了一种基于稀疏表示的图像标签显着性排序算法，该算法通过多示例学习（Multi-instance Learning）将图像级别的标签传播到区域级别，并利用视觉注意模型对现有标签进行显着性分析。这种方法与现有的图像标签排序方法相比，能够取得更好的效果，并展示出更好的性能。稀疏表示是机器学习和信号处理领域的一个重要概念，它基于这样一个假设：一个信号可以通过少量元素的线性组合进行表示，这些元素中大部分都是零值，因此称之为“稀疏”。在图像处理中，稀疏表示可以通过特定算法如正交匹配追踪（Orthogonal Matching Pursuit）或者基追踪（Basis Pursuit）来实现。这种方法在图像去噪、图像超分辨率等领域中得到广泛应用。多示例学习是一种机器学习框架，它主要针对这样的情况：一个训练样本包含多个实例（子样本），但只有样本整体的标签，而单个实例的标签是未知的。这种学习方式特别适合于那些无法将标签精确对应到每个实例的情况，比如文本分类或图像分类中，一个文档或图片可能涉及到多个主题或类别，但通常只会被标记为一种类别。在图像标签排序的上下文中，多示例学习允许通过稀疏表示将图像级别的标签转化为更细粒度的区域级别的标签。 Diverse Density是一种多示例学习算法，它可以用来确定哪些实例共同出现在正面的包（bag）中，并通过稀疏组合的方式重构出目标实例。在本研究中，Diverse Density被用来对图像中的区域进行显着性分析，通过非零重构系数识别出与目标实例相似的实例。视觉注意模型（Visual Attention Model）是一个被广泛研究的人工智能模型，它试图模拟人类的视觉注意力机制。该模型基于人类视觉系统的工作原理，即我们倾向于关注图像中的一些特定区域，而忽略其他区域。在图像处理中，视觉注意模型可以用来识别图像中的关键区域，从而提取图像的显着特征或进行图像分割。在图像标签排序的研究领域，这些方法被用来提高图像检索的准确性。图像检索是多媒体信息处理中的一个重要领域，它涉及从大量图像数据中检索出符合用户查询条件的图像。由于网络用户的文化背景、知识结构和关注点的不同，同一个或相似的图像会被赋予完全不同的标签。因此，网络上的图像标签列表往往是不准确和杂乱无章的。基于稀疏表示的图像标签显着性排序算法的提出，为图像标签的排序和检索提供了一种新的解决方案，从而提高图像检索的质量。本文提出的算法结合了稀疏表示、多示例学习和视觉注意模型的原理，旨在通过改进传统图像标签排序方法，以更有效地对图像标签列表进行排序，从而提高图像检索的效率和准确性。这一研究工作不仅促进了多媒体信息处理领域的发展，也为图像检索系统的优化提供了新的思路和方法。

资源推荐

资源详情

资源评论

A NOVEL IMAGE TAG SALIENCY RANKING ALGORITHM BASED ON SPARSE

REPRESENTATION

caixia Wang, Beijing Jiaotong University, 11120480@bjtu.edu.cn

zehai Song, Beijing Jiaotong University, zhsong@bjtu.edu.cn

songhe Feng, Beijing Jiaotong University, shfeng@bjtu.edu.cn

congyan Lang, Beijing Jiaotong University, cylang@bjtu.edu.cn

shuicheng Yan, National University of Singapore, eleyans@nus.edu.cn

ABSTRACT

As the explosive growth of the web image data, image tag

ranking used for image retrieval accurately from mass

images is becoming an active research topic. However, the

existing ranking approaches are not very ideal, which

remains to be improved. This paper proposed a new image

tag saliency ranking algorithm based on sparse

representation. we firstly propagate labels from image-level

to region-level via Multi-instance Learning driven by sparse

representation, which means reconstructing the target

instance from positive bag via the sparse linear combination

of all the instances from training set, instances with nonzero

reconstruction coefficients are considered to be similar to

the target instance; then visual attention model is used for

tag saliency analysis. Comparing with the existing

approaches, the proposed method achieves a better effect

and shows a better performance.

Index Terms — tag saliency ranking, sparse

representation, multi-instance learning, Diverse Density,

visual attention model

1. INTRODUCTION

In recent year, as the rapid development of multimedia

information and internet technology, there appears many

online image sharing sites, such as Flickr, Facebook etc,

which not only allow users to upload their own photos, but

allow users to add tags manually to explain and describe the

content of the image. However, because of the difference of

the network user’s cultural background, knowledge

structure and concerns, different people will add very

different tags on the same or similar images. So image label

listing on the network is often inaccurate and disorder and

far from satisfied with the quality of image retrieving.

Consequently, image tag ranking is becoming an active

research field of multimedia information in recent years. In

this paper, for reordering the original tag list more

effectively, we improve the conventional tag saliency

ranking algorithm mentioned by S.Feng et al. in [1] via

proposing a new tag saliency ranking algorithm based on

sparse representation.

Existing main image tag ranking algorithm is roughly

divided into two categories, including the tag relevance

ranking algorithm and tag saliency ranking algorithm. As to

tag relevance ranking, it mainly reorder labels list based on

the relevancy between label and image. Li et al.

[2]proposed it based on neighbor voting, we firstly find k

nearest neighbor images set based on low-level features;

then calculate the frequency of the tag list of the given

image appear in the k nearest images, finally reorder the

given label list according to the frequency. In addition, Liu

et al. also implements the tag relevance ranking through the

fusion of Kernel Density Estimation (KDE) [3] and random

walk [3, 10] algorithm, firstly, we estimate the initial

correlation score of the each label through the probability

model established for it; then modify the relevance score

through random-walk algorithm, finally reorder the given

label list according to the relevance score. As to tag saliency

ranking, S.Feng et al. proposed it in [1]. Because of the

saliency degree of the tag is highly relies on the saliency

information of the corresponding regions, how to build a

semantic mapping between the regions of a given image

and the associated tags is an important issue. However, in

most annotated image databases and photo sharing websites,

tags are usually associated with image instead of individual

regions, and MIL (multi-instance learning) as a

generalized-supervised learning algorithm can well deal

with the situation that tags spread from image-level to

region-level. So, given an image, firstly, Multi-instance

Learning (MIL) algorithm is used to propagate the tags

from image level to region level; then using visual attention

model to analyze saliency degree of each segment, finally

disorder it according to the saliency degree of the

corresponding area. However, the tag relevance ranking

algorithm is mainly for large-scale data, but not very ideal

for small data. Although the tag saliency ranking algorithm

manages to deal with small data, the efficiency and

accuracy of it is not very ideal. Since the tag saliency

ranking algorithm relies heavily on the instance prototypes

selected through MIL algorithm, we consider improving the

accuracy of this algorithm in choosing instance prototypes.

In this paper, we proposed a new image tag saliency

ranking algorithm based on sparse representation, which

improves in the generation details of instance prototype

compared with the existing tag saliency ranking algorithm.

During the instance prototypes selection, to avoid the

limitation from one-to-one similarity measure in the context

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

weixin_38623707

粉丝: 5
资源: 923

一种基于稀疏表示的图像标签显着性排序算法

基于稀疏表示和标签传播的显著性检测算法

一种基于分析稀疏表示的图像重建算法

基于稀疏表示的高光谱图像分类omp算法demo

基于稀疏表示的图像超分辨率重建快速算法

图像标签排序算法研究

层次分析matlab代码-Code-for-HSCS-Method:HSCS：基于分层稀疏性的RGBD图像共显着性检测，IEEETMM，201

一种基于群稀疏特征选择的图像检索方法

基于OpenCV的图像主成分分析

基于编码复杂度的混合结构稀疏人脸识别方法.pdf

一种基于结构稀疏的图像修复算法

一种基于稀疏系数匹配学习的图像去雾算法

一种基于L0稀疏约束的图像滤波算法

基于谱聚类和稀疏表示的高光谱图像分类算法

基于稀疏斑块表示的标签融合方法在脑部MRI图像分割中的应用

具有视图相关概念表示的图像标签优化

基于稀疏超完备表示的目标检测算法

基于稀疏分解的交通图像压缩

显着性等级：一种新的显着对象检测方法

基于图像旋转的迭代式CT重建算法软件工程研究.docx

最新资源