【免费】第一代gcn的应用1资源-CSDN文库

需积分: 0 198 浏览量 2022-08-03 14:52:56 上传评论收藏 4.57MB PDF 举报

资源详情

资源评论

资源推荐

Deep Convolutional Networks on Graph-Structured

Data

Mikael Henaff

Courant Institute of Mathematical Sciences

New York University

mbh305@nyu.edu

Joan Bruna

University of California, Berkeley

joan.bruna@berkeley.edu

Yann LeCun

Courant Institute of Mathematical Sciences

New York University

yann@cs.nyu.edu

Abstract

Deep Learning’s recent successes have mostly relied on Convolutional Networks,

which exploit fundamental statistical properties of images, sounds and video data:

the local stationarity and multi-scale compositional structure, that allows express-

ing long range interactions in terms of shorter, localized interactions. However,

there exist other important examples, such as text documents or bioinformatic

data, that may lack some or all of these strong statistical regularities.

In this paper we consider the general question of how to construct deep architec-

tures with small learning complexity on general non-Euclidean domains, which

are typically unknown and need to be estimated from the data. In particular, we

develop an extension of Spectral Networks which incorporates a Graph Estima-

tion procedure, that we test on large-scale classiﬁcation problems, matching or

improving over Dropout Networks with far less parameters to estimate.

1 Introduction

In recent times, Deep Learning models have proven extremely successful on a wide variety of tasks,

from computer vision and acoustic modeling to natural language processing [9]. At the core of their

success lies an important assumption on the statistical properties of the data, namely the stationarity

and the compositionality through local statistics, which are present in natural images, video, and

speech. These properties are exploited efﬁciently by ConvNets [8, 7], which are designed to extract

local features that are shared across the signal domain. Thanks to this, they are able to greatly

reduce the number of parameters in the network with respect to generic deep architectures, without

sacriﬁcing the capacity to extract informative statistics from the data. Similarly, Recurrent Neural

Nets (RNNs) trained on temporal data implicitly assume a stationary distribution.

One can think of such data examples as being signals deﬁned on a low-dimensional grid. In this

case stationarity is well deﬁned via the natural translation operator on the grid, locality is deﬁned

via the metric of the grid, and compositionality is obtained from downsampling, or equivalently

thanks to the multi-resolution property of the grid. However, there exist many examples of data that

lack the underlying low-dimensional grid structure. For example, text documents represented as

bags of words can be thought of as signals deﬁned on a graph whose nodes are vocabulary terms and

whose weights represent some similarity measure between terms, such as co-occurence statistics. In

medicine, a patient’s gene expression data can be viewed as a signal deﬁned on the graph imposed

by the regulatory network. In fact, computer vision and audio, which are the main focus of research

efforts in deep learning, only represent a special case of data deﬁned on an extremely simple low-

dimensional graph. Complex graphs arising in other domains might be of higher dimension, and

the statistical properties of data deﬁned on such graphs might not satisfy the stationarity, locality

1

arXiv:1506.05163v1 [cs.LG] 16 Jun 2015

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余9页未读，立即下载

评论0

内容反馈

又可乐

粉丝: 63
资源: 309

最新资源

资源上传下载、课程学习等过程中有任何疑问或建议，欢迎提出宝贵意见哦~我们会及时处理！点击此处反馈

feedback-tip