LearningDeepandWide:ASpectralMethodforLearningDeepNetworks

需积分: 18 170 浏览量 2018-08-05 19:53:47 上传评论收藏 777KB PDF 举报

在当今的人工智能和计算机视觉领域，智能系统能够从高维感官数据中提取高级别表征是解决许多相关任务的关键所在。为了解决这一问题，本研究提出了多光谱神经网络（MSNN），旨在从多列深度神经网络中学习特征，并将倒数第二层的判别流形嵌入到紧凑的表示中。为了更深入地理解MSNN，需要先了解几个核心概念：深度网络、谱方法以及学习的广度和深度。深度网络是指拥有多个隐藏层的神经网络，它能够在每一层提取数据的层级特征。深度网络特别擅长于捕捉复杂的模式和结构，因此在图像识别、语音识别以及自然语言处理等领域取得了显著的成绩。MSNN作为一种特殊的深度网络，其关键在于“多列”这一概念，这表明该网络能够集成来自多个子网络的信息，以期达到更好的学习效果。谱方法在深度学习中是指通过光谱分解技术来分析和处理数据。在多光谱神经网络中，谱方法用来探索不同视角下的互补属性，这些属性被集成到一个低维嵌入中。低维嵌入之所以能增强鲁棒性，是因为每个视角的分布足够平滑。在标记数据较少的情况下，这一点尤为重要，因为它能够确保学习到的模型具有较好的泛化能力。学习的广度和深度是指深度神经网络在宽度和深度两个维度上的扩展。在宽度上，网络可以通过增加神经元的数量来增加学习能力；在深度上，则通过增加层数来使得网络能够学习到数据中的更深层次特征。MSNN通过多个深度网络的结合，不仅在深度上进行了挖掘，也在宽度上进行了扩展，从而在处理多维数据时能够更全面地捕捉到信息。在文章中还提到了如何通过光谱嵌入探索多列网络的最优输出。光谱嵌入可以将多个深度网络进行整合，这样的整合有助于降低单一深度网络在学习时的错误率。实验表明，通过这种光谱嵌入的方法，能够有效地从多个网络中得到更优的输出，并且在多个任务中表现出比单个深度网络更低的错误率。本研究中还提到了计算机视觉任务的重要性，包括图像识别和视频序列分类。计算机视觉作为人工智能领域的一个重要分支，涉及到从图像中提取信息并理解视觉世界的各种算法。MSNN在这一领域的应用有望解决高维数据处理中遇到的挑战，为智能系统提供更强的特征提取能力。另外，文章中还提到了深度学习中的一个关键问题——如何利用少量标记数据进行高效学习。在深度学习的许多应用场景中，获取大量标记数据是昂贵和困难的。MSNN通过其设计特点，能够较好地在有限的标记数据条件下进行学习，并提高模型的泛化能力。文章提到了神经网络发展的一些背景，包括对层次化无监督学习、预训练、监督学习以及模式分类的改进。这些背景知识有助于理解深度学习的发展历史，以及MSNN是如何在现有基础上进行创新和改进的。 MSNN作为一种综合了多列深度网络和谱方法的深度学习架构，在计算机视觉任务中表现出较好的学习能力和泛化能力，特别是在处理高维数据和少量标记样本时展现出了优越性。

资源推荐

资源详情

资源评论

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 12, DECEMBER 2014 2303

Learning Deep and Wide: A Spectral Method

for Learning Deep Networks

Ling Shao, Senior Member, IEEE, Di Wu, and Xuelong Li, Fellow, IEEE

Abstract—Building intelligent systems that are capable of extracting

high-level representations from high-dimensional sensory data lies at

the core of solving many computer vision-related tasks. We propose

the multispectral neural networks (MSNN) to learn features from

multicolumn deep neural networks and embed the penultimate hier-

archical discriminative manifolds into a compact representation. The

low-dimensional embedding explores the complementary property of

different views wherein the distribution of each view is sufﬁciently smooth

and hence achieves robustness, given few labeled training data. Our

experiments show that spectrally embedding several deep neural net-

works can explore the optimum output from the multicolumn networks

and consistently decrease the error rate compared with a single deep

network.

Index Terms—Deep networks, multispectral embedding,

representation learning.

I. I

NTRODUCTION

Recent publications suggest that unsupervised pretraining

of deep, hierarchical neural networks improves supervised

pattern classiﬁcation [1]–[4]. Learning machines that are able

to automatically build feature extractors instead of hand-crafting

them is a wide research area in pattern recognition. The main

beneﬁt of these models is their high generation since they can

automatically learn to extract salient patterns directly from the raw

input, without any use of prior knowledge. Recent advancement

and applications using learned features have yielded excellent

results in several tasks, e.g., object recognition and video sequence

classiﬁcation. Krizhevsky et al. [5] train a large, deep convolutional

neural network (CNN) to classify 1000 different classes;

Baccouche et al. [6] learn a sparse shift-invariant representation of

the local salient information using a spatio-temporal convolutional

Manuscript received July 21, 2013; accepted February 22, 2014. Date of

publication March 11, 2014; date of current version November 17, 2014.

This work was supported in part by the National Basic Research Program of

China (973 Program) under Grant 2012CB316400, in part by the University

of Shefﬁeld, in part by the China Scholarship Council, in part by the National

Natural Science Foundation of China under Grant 61125106, and in part by

the Shaanxi Key Innovation Team of Science and Technology under Grant

2012KCT-04.

L. Shao is with the College of Electronic and Information Engineer-

ing, Nanjing University of Information Science and Technology, Nanjing

210044, China, and also with the Department of Electronic and Electri-

cal Engineering, University of Shefﬁeld, Shefﬁeld S1 3JD, U.K. (e-mail:

ling.shao@shefﬁeld.ac.uk).

D. Wu is with the Department of Electronic and Electrical Engineering, Uni-

versity of Shefﬁeld, Shefﬁeld S1 3JD, U.K. (e-mail: stevenwudi@gmail.com).

X. Li is with the Center for Optical Imagery Analysis and Learning, State

Key Laboratory of Transient Optics and Photonics, Xi’an Institute of Optics

and Precision Mechanics, Chinese Academy of Sciences, Xi’an 710119, China

(e-mail: xuelong_li@opt.ac.cn).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TNNLS.2014.2308519

sparse autoencoder, without any use of prior knowledge, and classify

each sequence by a long short-term memory recurrent neural

network [7]. Meanwhile, various architectures and techniques have

been proposed to enhance the learning capacity: a multiresolution

deep belief network (DBN) [8] combines a Laplacian pyramid

with deep learning to learn coarse structures from low-resolution

images, leading to a better generative model; multicolumn deep

neural networks proposed by Cire¸san et al. [9]–[11] use GPUs to

train several deep neural columns and average the output of each

individual network under the condition that given enough labeled

data, their networks do not need additional heuristics, such as

unsupervised pretraining or carefully prewired synapses.

Inspired by microcolumns of neurons in the cerebral cortex, several

deep neural columns are trained and become experts to unfold

their potential when they are wide. Conventional multicolumn deep

neural networks average the output of the prediction under the

condition that there are enough labeled training data and an individual

neural network is close to the global optimum. However, simple

output averaging may not achieve the model’s optimum if only few

labeled data are provided. As indicated in [9], several deep neural

columns are trained to become experts to unfold their potential

when they are wide. However, if the labeled training instances are

few, i.e., ﬁne-tuning information is scarce, the deep networks can

suffer from overﬁtting. Such a setting is pervasive in real-world

applications, such as the gender prediction (Section IV-C), where

the randomized, controlled experiments may be costly, unethical, and

intrusive.

In this brief, we show how combining several deep network

columns as a basic building block into the multicolumn deep nets and

embedding the spectral relationships can further enhance robustness

and hence decrease the error rate. We deﬁne the wide deep net

as the juxtaposition of multiple randomly initialized nonconvex

deep nets, and refer to our proposed architecture as multispectral

neural networks (MSNN). The multicolumn procedure can be easily

implemented in a parallelized, multithreaded fashion that requires no

signiﬁcant extra training time for MSNN. Our architecture does this

by combining several techniques in a novel way.

1) Through encouraging the neural networks to learn deep models

reusing intermediate features to extract more abstract repre-

sentations that are more correlated with the underlying causes

generating the data, we utilize the penultimate layer of the

hierarchy as our intermediate feature space in contrast to the

paradigm that outputs the top predictor layer (also known as

softmax output layer). Such nets can be DBNs or CNNs with

fully connected penultimate layers.

2) Our architecture renders the networks to learn wide horizontally

to explore the feature space admitting stochasticity of the deep

nets, rendering a mixture-of-experts style ﬁeld. Unlike the

conventional multicommittee systems that extract the trivial 1-

D winner-take-all regions, that is, the top part of the hierarchy

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余5页未读，立即下载

评论收藏

内容反馈

monotonomo

粉丝: 14
资源: 11

Learning Deep and Wide: A Spectral Method for Learning Deep Netw...

最新资源

Learning Deep and Wide: A Spectral Method for Learning Deep Netw...

Active Transfer Learning Network: A Unified Deep Joint Spectral-Spatial Feature Learning Model for Hyperspectral Image Classification

Spectral method for fatigue life

Neural Networks and Deep Learning

Deep Learning in Neural Networks: An Overview

Learning and Transferring Deep Joint Spectral–Spatial Features for Hyperspectral Classification

Neural Networks and Deep Learning.zip

Neural Networks and Deep Learning week 1

Wide & Deep Learning for Recommender Systems

Deep learning Methods and Applications

Deep Learning in Neural Networks

Deep Learning in Neural Networks An Overview

Deep plug-and-play priors for spectral snapshot compressive imaging

图书Chebyshev and Fourier Spectral Methods

Numerical Methods for StochasticComputations-A Spectral Method Approach

Graph Neural Networks_ A Review of Methods and Applications----清华大学周杰.pdf

MassiveMIMONetworks_Spectral Energy andHardwareEfficiency[.pdf

Saliency Detection: A Spectral Residual Approach

Neural-Networks-and-Deep-Learning-models

《Saliency Detection: A Spectral Residual Approach》MatLab实现代码

MPD.rar_MPD_basis pursuit_data analysis_seismic_spectral method

A LEGENDRE GALERKIN SPECTRAL METHOD FOR OPTIMAL CONTROL PROBLEMS

A LEGENDRE-GALERKIN SPECTRAL METHOD FOR FLOW OPTIMALCONTROL PROBLEM WITH $H^1$-NORM STATE CONSTRAINT

最新资源