图神经网络（GNN）【WSDM2020】.zip资源-CSDN文库

共5个文件

pdf：5个

需积分: 49 120 浏览量 2019-11-19 20:30:21 上传评论 1 收藏 6.91MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

WSDM2020.zip （5个子文件）

Initialization for Network Embedding A Graph Partition Approach.pdf 733KB

Dynamic Graph Representation Learning via Self-Attention Networks.pdf 545KB

Robust Graph Neural Network Against Poisoning Attacks via Transfer Learning.pdf 1.13MB

Relation Learning on Social Networks with Multi-Modal Graph Edge Variational Autoencoders.pdf 1.7MB

A Structural Graph Representation Learning Framework.pdf 3.47MB

A Structural Graph Representation Learning Framework

Ryan A. Rossi

Adobe Research

Nesreen K. Ahmed

Intel Labs

Eunyee Koh

Adobe Research

Sungchul Kim

Adobe Research

Anup Rao

Adobe Research

Yasin Abbasi-Yadkori

VinAI

ABSTRACT

The success of many graph-based machine learning tasks highly

depends on an appropriate representation learned from the graph

data. Most work has focused on learning node embeddings that

preserve proximity as opposed to structural role-based embeddings

that preserve the structural similarity among nodes. These methods

fail to capture higher-order structural dependencies and connectiv-

ity patterns that are crucial for structural role-based applications

such as visitor stitching from web logs. In this work, we formu-

late higher-order network representation learning and describe a

general framework called HONE for learning such structural node

embeddings from networks via the subgraph patterns (network mo-

tifs, graphlet orbits/positions) in a nodes neighborhood. A general

diusion mechanism is introduced in HONE along with a space-

ecient approach that avoids explicit construction of the k-step

motif-based matrices using a k-step linear operator. Furthermore,

HONE is shown to be fast and ecient with a worst-case time

complexity that is nearly-linear in the number of edges. The ex-

periments demonstrate the eectiveness of HONE for a number of

important tasks including link prediction and visitor stitching from

large web log data.

KEYWORDS

Structural node embeddings, role-based embeddings, structural

similarity, roles, network motifs, graphlets, structural embeddings

ACM Reference Format:

Ryan A. Rossi, Nesreen K. Ahmed, Eunyee Koh, Sungchul Kim, Anup Rao,

and Yasin Abbasi-Yadkori. 2020. A Structural Graph Representation Learning

Framework. In The Thirteenth ACM International Conference on Web Search

and Data Mining (WSDM ’20), February 3–7, 2020, Houston, TX, USA. ACM,

New York, NY, USA, 9 pages. https://doi.org/10.1145/3336191.3371843

1 INTRODUCTION

Structural role discovery [

] aims to reveal nodes with topologi-

cally similar neighborhoods while being possibly far away in the

graph or even in dierent graphs altogether. Intuitively, two nodes

belong to the same role if they are structurally similar (with re-

spect to the general connectivity and subgraph patterns in a nodes

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,

to post on servers or to redistribute to lists, requires prior specic permission and/or a

fee. Request permissions from permissions@acm.org.

WSDM ’20, February 3–7, 2020, Houston, TX, USA

ACM ISBN 978-1-4503-6822-3/20/02.. . $15.00

https://doi.org/10.1145/3336191.3371843

neighborhood). Roles may represent higher-order subgraph pat-

terns (network motifs) such as star-center (hub) nodes, star-edge

nodes, near-cliques or bridge nodes connecting dierent regions of

the graph. Most work on embeddings have focused on preserving

the notion of proximity (closeness) as opposed to the notion of

structural similarity proposed in [

]. As such, two nodes with

similar proximity-based embeddings are guaranteed to be near one

another in the graph (a property of communities [

]). However,

learning structural role-based embeddings that preserve the notion

of structural similarity are important for many predictive modeling

applications [

] such as the visitor stitching task where the goal

is to predict the web sessions that belong to the same user.

To address this problem and learn more appropriate structural

role-based embeddings for such applications, we propose a general

framework called Higher-Order Network Embeddings (HONE) for

learning higher-order structural node embeddings based on net-

work motifs (graphlets). The approach leverages all available motif

counts by deriving a weighted motif graph

from each network

motif

∈ H

and uses these as a basis to learn higher-order struc-

tural node embeddings. The HONE framework expresses a new

class of structural node embedding methods based on a set of motif-

based matrices and their powers. We also introduce diusion-based

HONE variants that leverage a general diusion mechanism to im-

prove predictive performance. Furthermore, this work describes a

space-ecient approach to avoid explicit construction of the k-step

motif-based matrices by dening a k-step linear operator. The time

complexity of HONE is shown to be linear in the number of edges

and therefore is fast and scalable for large networks. Empirically,

we investigate the HONE variants and their properties extensively

in Section 4. The experiments demonstrate the eectiveness of

higher-order structural embeddings for link prediction as we achieve

a mean relative gain in AUC of 19% over all embedding methods and

network data sets. In addition, the diusion-based HONE variants

achieve a mean gain of 1

97% in AUC over the other HONE variants.

We also demonstrate the eectiveness of HONE for visitor stitching

using two real-world company data sets with known ground-truth.

Finally, HONE is shown to capture roles (structural similarity) as it

successfully uncovers the actual exact role assignments in graphs

with known ground-truth.

Contributions

: This work makes three important contributions.

First, we introduce the problem of higher-order (motif-based) net-

work embedding. Second, we propose a general class of methods for

learning such structural (role-based) embeddings via higher-order

network motifs. The resulting embeddings are shown to be role-

based (structural) and capture the notion of structural similarity

(roles) [

] as opposed to proximity/community-based embeddings

that have been the focus of most previous work. Third, we demon-

strate the eectiveness of learning structural higher-order network

embeddings for link prediction and visitor stitching of web logs.

2 HIGHER-ORDER NETWORK EMBEDDINGS

This section proposes a new class of embedding models called

Higher-Order Network Embeddings (HONE) and a general frame-

work for deriving them. The class of higher-order network embed-

ding methods is dened as follows:

Definition 1 (Higher-Order Network Embeddings). Given

a network

G = (V , E)

, a set of network motifs

H = {H

, . . . , H

}

the goal of higher-order network embedding (HONE) is to learn a

function

V → R

that maps nodes to

-dimensional structural

node embeddings using network motifs H .

The particular family of higher-order structural node embeddings

presented in this work are based on learning a function

V →

that maps nodes to

-dimensional embeddings using (powers

of) weighted motif graphs derived from a structural motif matrix

function

. However, many other families of higher-order structural

node embedding methods exist in the class of higher-order network

embeddings (Denition 1).

We summarize the main steps of the HONE framework in Al-

gorithm 1 (with the exception of attribute diusion discussed in

Section 2.6 and normalization, which are both optional). For clar-

ity, Algorithm 1 also summarizes the organization and shows the

connections between Sections 2.1-2.5.

2.1 Network Motifs

The HONE framework can use graphlets or orbits; and both can be

computed fast in only a few seconds for very large networks, see [

]. Recall that the term network motif is used generally in this work

and may refer to graphlets or orbits (graphlet automorphisms) [

]. A graphlet

= (V

, E

)

is an induced subgraph consisting

of a subset

⊂ V

vertices from

G = (V , E)

together with all

edges whose endpoints are both in this subset

= {∀e ∈ E | e =

(u, v) ∧ u, v ∈ V

}

. Alternatively, the nodes of every graphlet can

be partitioned into a set of automorphism groups called orbits [

It is important to consider the position of an edge in a graphlet, for

instance, an edge in the 4-node path (Figure 1) has two dierent

unique positions, namely, an edge on the outside of the 4-node path

(

in Figure 1) or the edge in the center of the path (

). Each

unique edge position in a graphlet is called an orbit. In this work,

we use all (2-4)-vertex connected edge orbits and denote this set as

H (Figure 1).

To print go to file, print, then PDF, Adobe PDF, and select Highest Quality Print

-saveas-print-AdobeHighQualityPrint.pdf

Figure 1: All (2-4)-vertex connected edge orbits. For

graphlets with more than one edge orbit (e.g., 4-path-edge

orbit H

and 4-path-center orbit H

), the gray edge (between

the unshaded nodes) is used to distinguish between the dif-

ferent edge orbits of the graphlet.

Network motifs is used generally to refer to either induced subgraphs/graphlets or

graphlet orbits (Section 2.1).

Algorithm 1 Higher-Order Network Embedding (HONE)

Step 1:

Given a network

G = (V, E)

with

N = |V |

nodes and a set

H = {H

, . . . , H

}

motifs (Section 2.1), form the weighted

motif adjacency matrices

W =



, . . . , W



where

)

i j

# of instances of motif

∈ H

between node

and

(Section 2.2).

Step 2: Derive all k-step motif matrices for all T motifs and K steps:

(k)

= Ψ(W

), for k = 1, . . . , K and t = 1, . . . , T

where Ψ is a motif matrix function from Section 2.3.

Step 3:

Find low-rank “local” structural node embeddings for each k-step

motif matrix S

(k)

by solving Eq. 12 (Section 2.4).

Step 4:

Concatenate all low-dimensional structural node embeddings for

all T network motifs and K steps to obtain Y (Eq. 15).

Step 5:

Given

, nd a “global” low-dimensional rank-

structural node

embedding matrix

by solving Eq. 17 (Section 2.5) and return

Z ∈ R

N ×D

2.2 Weighted Motif Graphs

Given a network

G = (V , E)

with

N = |V |

nodes,

M = |E|

edges, and

a set

H = {H

, . . . , H

}

network motifs, we form the weighted

motif adjacency matrices: W =



, W

, . . . , W



where

)

i j

= # occurrences of motif H

∈ H that contain (i, j) ∈ E

The weighted motif graphs dier from the original graph in two

important and fundamental ways. First, the edges in each motif

graph is likely to be weighted dierently. This is straightforward to

see as each network motif can appear at a dierent frequency than

another arbitrary motif for a given edge. Intuitively, the edge motif

weights when combined with the structure of the graph reveal

important structural properties with respect to the weighted motif

graph. Second, the motif graphs are often structurally dierent

as shown in Figure 2. For instance, if edge

(i, j) ∈ E

exists in the

original graph

, but

)

i j

0 for some arbitrary motif

, then

(i, j) < E

where

is the edge set for motif

∈ H

. Hence,

the motif graphs encode relationships between nodes that have

a sucient number of motifs. To generalize the above weighted

motif graph formulation, we replace the edge constraint that says

an edge exists between

and

if the number of instances of motif

∈ H

that contain nodes

and

is 1 or larger, by enforcing an

edge constraint that requires each edge to have at least

motifs.

In other words, dierent motif graphs can arise using the same

motif

by enforcing an edge constraint that requires each edge to

have at least

motifs. This is an important property of the above

formulation.

2.3 Structural Motif Matrix Functions

To generalize HONE for any motif-based matrix formulation, we

dene

as a function

N ×N

→ R

N ×N

over a weighted motif

adjacency matrix W

∈ W. Using Ψ we derive

= Ψ(W

), for t = 1, 2, . . . , T (1)

The term motif-based matrix refers to any motif matrix

derived

from

Ψ(W)

We summarize a few motif matrix functions

below.

• Weighted Motif Graph

: Given a graph

and a network motif

∈ H

, form

where

)

i j

= number of instances of

For convenience, W denotes a weighted adjacency matrix for an arbitrary motif.

that contain nodes

and

. In the case of using HONE directly

with a weighted motif adjacency matrix W, then

Ψ : W → IW (2)

The number of paths weighted by motif counts from node

node j in k-steps is given by

)

i j



W · · · W

| {z }



i j

(3)

• Weighted Motif Transition Matrix

: The random walk on a

graph W weighted by motif counts has transition probabilities

i j

(4)

where

i j

is the motif degree of node

. The random

walk motif transition matrix

for an arbitrary weighted motif

graph W is dened as:

P = D

−1

W (5)

where

D = diag(We) = diag(w

, w

, . . . , w

)

is a

N × N

diago-

nal matrix with the motif degree

i j

of each node

on the diagonal called the diagonal motif degree matrix and

e =



1 1

· · ·



is the vector of all ones.

is a row-stochastic

matrix with

i j

= p

e =

1 where

∈ R

is a column vector

corresponding to the

-th row of

. For directed graphs, the motif

out-degree is used. However, one can also leverage the motif in-

degree or total motif degree (among other quantities). The motif

transition matrix

represents the transition probabilities of a

non-uniform random walk on the graph that selects subsequent

nodes with probability proportional to the connecting edge’s mo-

tif count. Therefore, the probability of transitioning from node

to node

depends on the motif degree of

relative to the total

sum of motif degrees of all neighbors of

. The probability of

transitioning from node i to j in k-steps is given by

)

i j



P · · · P

| {z }



i j

(6)

• Weighted Motif Laplacian

: The motif Laplacian for a weighted

motif graph W is dened as:

L = D − W (7)

where

D = diag(We)

is the diagonal motif degree matrix dened

i j

. For directed graphs, we can use either in-motif

degree or out-motif degree.

• Normalized Weighted Motif Laplacian

: Given a graph

weighted by the counts of an arbitrary network motif

∈ H

the normalized motif Laplacian is dened as

L = I − D

−1/2

(8)

where

is the identity matrix and

D = diag(We)

is the

N × N

diagonal matrix of motif degrees.

• Random Walk Normalized Weighted Motif Laplacian

: For-

mally, the random walk normalize d motif Laplacian is

= I − D

−1

W (9)

where

is the identity matrix,

is the motif degree diagonal

matrix with

= w

, ∀i =

, . . . , N

, and

is the weighted

motif adjacency matrix for an arbitrary motif

∈ H

. Observe

that

= I − P

where

P = D

−1

is the motif transition matrix

of a random walker on the weighted motif graph.

Notice that all variants are easily formulated as functions

in terms

of an arbitrary motif weighted graph W.

(a) Initial graph

(b) Weighted

To print go to file, print, then PDF, Adobe PDF, and select Highest Quality Print

-saveas-print-AdobeHighQualityPrint.pdf

-graph

To print go to file, print, then PDF, Adobe PDF, and select Highest Quality Print

-saveas-print-AdobeHighQualityPrint.pdf

-graph

Figure 2: Motif graphs dier in structure and weight. Size

(weight) of nodes and edges in the triangle

To print go to file, print, then PDF, Adobe PDF, and select Highest Quality Print

-saveas-print-AdobeHighQualityPrint.pdf

and 4-star

To print go to file, print, then PDF, Adobe PDF, and select Highest Quality Print

-saveas-print-AdobeHighQualityPrint.pdf

graphs correspond to the frequency of triangles and 4-stars.

2.4 K-Step Motif-based Structural Embeddings

We describe the local higher-order structural node embeddings

learned for each network motif

∈ H

and

-step where

k ∈

{

, . . . , K}

. The term local refers to the fact that structural node

embeddings are learned for each individual motif and k-step inde-

pendently. We dene

-step motif-based matrices for all

motifs

and K steps as follows:

(k)

= Ψ(W

), for k = 1, . . . , K and t = 1, . . . , T (10)

where

Ψ(W

) = Ψ(W

· · · W

| {z }

) (11)

These k-step motif-based matrices can densify quickly and therefore

the space required to store the k-step motif-based matrices can grow

fast as

increases. For large graphs, it is often impractical to store

the k-step motif-based matrices for any reasonable

. To overcome

this issue, we avoid explicitly constructing the k-step motif-based

matrices entirely. Hence, no additional space is required and we

never need to store the actual

-step motif-based matrices for

k >

We discuss and show this for any

-step motif-based matrix later

in this subsection.

Given a k-step motif-based matrix

(k)

for an arbitrary network

motif

∈ H

, we nd an embedding by solving the following

optimization problem:

arg min

(k)

∈C



(k)

∥ Φ⟨U

(k)

⟩



, ∀k = 1,...,K and t = 1,...,T (12)

where

is a generalized Bregman divergence (and quanties

≈

in the HONE embedding model

(k)

≈ Φ⟨U

(k)

⟩

) with match-

ing linear or non-linear function

and

is constraints (e.g., non-

negativity constraints

U ≥

V ≥

0, orthogonality constraints

U = I

V = I

). The above optimization problem nds low-rank

评论收藏

内容反馈

syp_net

粉丝: 158
资源: 1196

图神经网络（GNN）【WSDM 2020】.zip

最新资源

图神经网络（GNN）【WSDM 2020】.zip

图神经网络（Graph Neural Networks）

WSDM2020最佳论文出炉（Best paper）.zip

2019-A Comprehensive Survey on Graph Neural Network图神经网络GNN综述.pdf.zip

近期必读的5篇AI顶会CVPR 2020（图神经网络GNN) 相关论文.zip

神经网络.pdf

Bengio 团队力作：GNN 对比基准横空出世，图神经网络的「ImageNet」来了.pdf

GraphNeuralNetwork:《深入浅出图神经网络：GNN原理解析》配套代码

图神经网络（GNN）的一些论文介绍

一份简短入门《图神经网络GNN》笔记小册.pdf

图神经网络（Graph Neural Network，GNN）综述.zip

近期必读的【图神经网络（GNN）】相关论文-发表于WWW 2020.zip

AAAI 2020最新「图神经网络GNN模型与应用」【附305页ppt】.zip

近期必读的8篇 AAAI 2020【图神经网络（GNN）】相关论文.zip

近期必读的5篇顶会CVPR 2020【图神经网络（GNN）】相关论文-Part2.zip

图神经网络构建代码

《图神经网络》(值得一看的资源)

胶囊图神经网络20190823.pdf

DL、图 神 经 网 络-课程.pdf

图神经网络 - 南洋理工大学 - lecture14_graph_neural_networks.zip

图像理解中的卷积神经网络pdf

漫谈图神经网络模型（一）.pdf

近期必读的9篇 NeurIPS 2019【图神经网络（GNN）】.zip

图神经网络GNN论文2019-2020顶会列表【附多篇经典论文】.zip

2020年9月13日-图神经网络 GNN 之图卷积网络1

《深度贝叶斯数据挖掘》【WSDM 2020】.zip

emnlp - 2019 如何用图神经网络（GNN）做自然语言处理 （GNN for NLP）.zip

《使用DGL训练大规模图神经网络》马超.pdf

图神经网络表达能力的研究综述【日本京都大学】.pdf

GNN2讲义pdf超详细

最新资源

DL、图神经网络-课程.pdf

emnlp - 2019 如何用图神经网络（GNN）做自然语言处理（GNN for NLP）.zip