NeighborhoodCorrelationAnalysisforSemi-pairedTwo-ViewData资源-CSDN文库

2 浏览量 2021-02-09 00:06:57 上传评论收藏 635KB PDF 举报

资源推荐

资源详情

资源评论

1 23

Neural Processing Letters

ISSN 1370-4621

Neural Process Lett

DOI 10.1007/s11063-012-9251-z

Neighborhood Correlation Analysis for

Semi-paired Two-View Data

Xudong Zhou, Xiaohong Chen &

Songcan Chen

Neural Process Lett

DOI 10.1007/s11063-012-9251-z

Neighborhood Correlation Analysis for Semi-paired

Two-View Data

Xudong Zhou · Xiaohong Chen · Songcan Chen

Abstract Canonical correlation analysis (CCA) is a widely used technique for analyzing

two datasets (two views of the same objects). However, CCA needs that the samples of the

two views are fully-paired. Actually, we are often faced up with the semi-paired scenario

where the number of available paired samples is limited and yet the number of unpaired

samples is sufﬁcient. For such a scenario, CCA is generally prone to overﬁtting and thus

performs poorly, since its definition itself makes it only able to utilize those paired samples.

To overcome such a shortcoming, several semi-paired variants of CCA have been proposed.

However, unpaired samples in these methods are just used in the way of single-view leaning

to capture individual views’ structure information for regularizing CCA. Intuitively, using

unpaired samples in the way of two-view learning should be more natural and more attrac-

tive since CCA itself is a two-view learning method. As a result, a novel CCAs semi-paired

variant named Neighborhood Correlation Analysis (NeCA), which uses unpaired samples

in the two-view learning way, is developed through incorporating between-view neighbor-

hood relationships into CCA. The relationships are acquired through leveraging within-view

neighborhood relationships of each view’s all data (including paired and unpaired data) and

between-view paired information. Thus, it can take more sufﬁcient advantage of the unpaired

samples and then mitigate overﬁtting effectively caused by the limited paired data. Promising

X. Zhou · S. Chen (

)

College of Computer Science and Technology, Nanjing University of Aeronautics & Astronautics,

Nanjing 210016, China

e-mail: s.chen@nuaa.edu.cn

X. Zhou

e-mail: xdzhou@nuaa.edu.cn

X. Zhou

Information Engineering College, Yangzhou University, Yangzhou 225127, China

X. Chen

College of Science, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China

S. Chen

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China

123

Author's personal copy

X. Zhou et al.

experiments results on several popular multi-view datasets show its feasibility and effective-

ness.

Keywords Canonical correlation analysis · Semi-paired learning · Two-view learning ·

Neighborhood relationship · Neighborhood correlation

1 Introduction

High-dimensional co-occurring data associated with an object frequently and abundantly

emerge in the real world. For example, an Internet web page as an object can be repre-

sented as (co-occurring) page text and links to the page, and a human can be represented as

co-occurring visual and audio contents. A lot of works have been done for analyzing this

kind of data [1–7]. Among these works, canonical correlation analysis (CCA) is one of the

most widely adopted methods [8–12].

CCA is a classical but useful multivariate statistical analysis method [13]. It aims to ﬁnd

maximally correlated projections between two sets of variables, which can be considered

as two views (views x and y) or representations of the same set of objects. However CCA

requires that such two views be fully-paired, i.e., each sample in view x should have a corre-

spondence in view y, and vice versa. Conversely, we are often faced such a scenario where

most samples in view x have no correspondences in view y, and vice versa, thus forming

the semi-paired scenario called here. For such a scenario, CCA is generally prone to overﬁt-

ting and thus performs poorly, since its definition itself makes it only suitful for the paired

scenario, so its applications are limited in the real world. Actually, abundant unpaired sam-

ples (i.e. x-andy-only samples) often contain much useful information which will beneﬁt

the learning task, just as the unlabeled samples beneﬁt semi-supervised leaning [14,15]by

exploiting the intrinsic data structure under clustering assumption or manifold assumption.

Recently, several works have concerned such new scenario [16–18]. Blaschko et al. [16]pro-

posed a semi-supervised Laplacian regularization of kernel CCA (SemiLRKCCA), which

utilizes intrinsic geometry structure of each view to regularize kernel CCA (KCCA) [19].

As a result, SemiLRKCCA can ﬁnd a set of meaningful directions which not only make the

two view’s paired samples highly correlated but also capture each view’s manifold struc-

ture. SemiCCA [17] utilizes global structure of each view’s whole training samples (paired

and unpaired samples together) to regularize CCA in order to bridge CCA and principal

component analysis (PCA) [20,21] seamlessly. Both SemiLRKCCA and SemiCCA can take

sufﬁcient advantage of unpaired samples in addition to paired samples, and consequently

achieve better results than CCA just based on the paired samples. It is necessary to mention

that the actual meaning of “semi-” in SemiLRKCCA and SemiCCA is “semi-paired” rather

than “semi-supervised” in popular semi-supervised learning literature [14,15]. Compared

with SemiLRKCCA and SemiCCA, more recent work termed as semi-paired and semi-

supervised generalized correlation analysis (S

GCA) [18] make further research for dealing

with semi-paired and semi-supervised scenario. S

GCA utilizes within-view structural infor-

mation and within-view discriminant information jointly, to preserve the individual view’s

structure of unlabeled data and separate labeled data in different classes from each other

simultaneously. Without semi-supervised information, S

GCA is similar to SemiLRKCCA

and SemiCCA.

In SemiLRKCCA, SemiCCA and S

GCA, unpaired samples are just used in the way of

single-view leaning to capture individual views’ structure information for regularizing KCCA

or CCA. Consequently, CCA and its variants (SemiLRKCCA, SemiCCA and S

GCA) only

123

Author's personal copy

剩余21页未读，继续阅读

评论收藏

内容反馈

weixin_38663197

粉丝: 8
资源: 926

Neighborhood Correlation Analysis for Semi-paired Two-View Data

Python库 | neighborhood_analysis-0.2.3-cp39-none-win_amd64.whl

PyPI 官网下载 | neighborhood_analysis-0.2.3-cp39-none-win_amd64.whl

Python库 | neighborhood_analysis-0.1.1-cp37-none-win_amd64.whl

Python库 | neighborhood_analysis-0.2.0-cp38-none-win_amd64.whl

Python库 | neighborhood_analysis-0.2.3-cp38-none-win_amd64.whl

Neighborhood rough sets based multi-label classification for automatic image annotation

Locally regularized Anchored Neighborhood Regression for fast Super-Resolution

Fuzzy Neighborhood Preserving Analysis with QR-Decomposition：使用带有QR分解的模糊判别分析的特征减少（投影）。-matlab开发

variable neighborhood search for the second type of two-sided assembly line balancing problem

A superlinearly convergent wide-neighborhood predictor-corrector interior- point algorithm for linear programming

UCSD 博士论文 Priors and Learning Based Methods for Super-Resolution

neighborhood-map-project-:纳米项目5

2019-Neighborhood Enlargement in Graph Neural Networks-网文+笔记1

Deviation-based neighborhood model for context-aware QoS prediction of cloud and IoT services

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

AutoSAR标准协议4.2.2

光伏-储能并网系统仿真.rar

XCP协议的规范文档

GD32替换STM32注意事项.pdf

NPPJSONViewer.zip

蓝牙BLE协议中文版.pdf

CANoe通过CAPL脚本实现自动测试

电路分析基础第二版PDF电子书免费下载

qt样式表一键生成（花狗Fdog）

最新资源

Qt 5实现串口调试助手（源工程文件、0积分下载）