基于Python实现的在异质信息网络上的自监督图嵌入学习源码.zip资源-CSDN文库

共25个文件

py：12个

pyc：9个

rar：1个

版权申诉

毕业设计

课程设计

课程大作业

项目源码

166 浏览量 2023-11-28 12:08:54 上传评论收藏 3.51MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

基于Python实现的在异质信息网络上的自监督图嵌入学习源码.zip （25个子文件）

Heterogeneous Graph Information Bottleneck.pdf 1.48MB

environment 112B

data.rar 2.13MB

main.py 6KB

说明.MD 1KB

utils

input_data.py 3KB

node_cluster.py 3KB

functions.py 3KB

args.py 179B

__pycache__

input_data.cpython-36.pyc 2KB

node_cluster.cpython-36.pyc 3KB

args.cpython-36.pyc 351B

functions.cpython-36.pyc 2KB

node_classification.cpython-36.pyc 2KB

node_classification.py 2KB

visualization.py 2KB

models

gcn.py 948B

hgib.py 2KB

mi_estimator.py 736B

logreg.py 587B

encoder.py 542B

__pycache__

encoder.cpython-36.pyc 1KB

logreg.cpython-36.pyc 1KB

mi_estimator.cpython-36.pyc 991B

gcn.cpython-36.pyc 1KB

Heterogeneous Graph Information Bottleneck

Liang Yang

1,2,3

, Fan Wu

, Zichen Zheng

, Bingxin Niu

1,3

, Junhua Gu

1,3

Chuan Wang

, Xiaochun Cao

and Yuanfang Guo

4∗

School of Artiﬁcial Intelligence, Hebei University of Technology, Tianjin, China

State Key Laboratory of Information Security, Institute of Information Engineering, CAS, Beijing, China

Hebei Province Key Laboratory of Big Data Calculation, Hebei University of Technology, China

Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Computer Science

and Engineering, Beihang University, Beijing, China

yangliang@vip.qq.com, andyguo@buaa.edu.cn

Abstract

Most attempts on extending Graph Neural Net-

works (GNNs) to Heterogeneous Information Net-

works (HINs) implicitly take the direct assump-

tion that the multiple homogeneous attributed net-

works induced by different meta-paths are com-

plementary. The doubts about the hypothesis of

complementary motivate an alternative assumption

of consensus. That is, the aggregated node at-

tributes shared by multiple homogeneous attributed

networks are essential for node representations,

while the speciﬁc ones in each homogeneous at-

tributed network should be discarded. In this pa-

per, a novel Heterogeneous Graph Information Bot-

tleneck (HGIB) is proposed to implement the con-

sensus hypothesis in an unsupervised manner. To

this end, information bottleneck (IB) is extended

to unsupervised representation learning by leverag-

ing self-supervision strategy. Speciﬁcally, HGIB

simultaneously maximizes the mutual information

between one homogeneous network and the repre-

sentation learned from another homogeneous net-

work, while minimizes the mutual information be-

tween the speciﬁc information contained in one ho-

mogeneous network and the representation learned

from this homogeneous network. Model analysis

reveals that the two extreme cases of HGIB corre-

spond to the supervised heterogeneous GNN and

the infomax on homogeneous graph, respectively.

Extensive experiments on real datasets demonstrate

that the consensus-based unsupervised HGIB sig-

niﬁcantly outperforms most semi-supervised SOTA

methods based on complementary assumption.

1 Introduction

Heterogeneous Information Networks (HINs) possess the ad-

vantage of modeling rich relations in real work compared to

homogeneous networks, which have been well studied by the

researchers from mathematics, physics and computer science

[

Shi et al., 2017; Wang et al., 2020

]

. Thus, by effectively

∗

Corresponding author.

exploiting these multiple relations via meta-paths, HINs pro-

vide more clues for accurate network analysis, e.g. network

embedding

[

Dong et al., 2017

]

, and have been successfully

applied to recommendation system

[

Shi et al., 2019

]

, natural

language processing

[

Hu et al., 2019

]

and knowledge graph.

Graph neural networks (GNNs)

[

Wu et al., 2021

]

, espe-

cially graph convolutional neural networks (GCNNs)

[

Kipf

and Welling, 2017; Bruna et al., 2014

]

, have became a pow-

erful tool for homogeneous attributed network embedding.

And, their success can be attributed to the Laplacian smooth-

ing

[

Li et al., 2018

]

from spatial perspective or the low-pass

ﬁltering

[

Wu et al., 2019

]

from spectral perspective.

Recent attempts extend GNNs to heterogeneous informa-

tion networks

[

Wang et al., 2019; Fu et al., 2020; Yun

et al., 2019; Hu et al., 2020

]

. Most of them follow the

pipeline of transforming a heterogeneous attributed network

with multiple relations into multiple attributed homogeneous

networks via meta-paths and combining the embedding re-

sults of multiple homogeneous attributed networks obtained

from GNNs. And, the supervision information is utilized

to learn how to map from node feature to label and how to

combine the multiple embedding results

[

Wang et al., 2019;

Yun et al., 2019

]

. These semi-supervised methods implic-

itly take the direct assumption that the multiple homoge-

neous attributed networks induced by different meta-paths are

complementary. That is, the information contained in each

homogeneous attributed network is insufﬁcient to represent

nodes, thus, multiple homogeneous attributed networks are

necessary to complete the information.

Here, the direct assumption of complementarity is investi-

gated. The doubts about this hypothesis stem from both the

characteristic of the homogeneous attributed networks and

the nature of the adopted GNNs. First, the homogeneous at-

tributed networks induced by meta-paths are not independent.

In fact, they share the common node attributes (feature) and

possess different network topologies. Second, the essence

of GNNs, which are applied to each homogeneous attributed

network, is the attributes smoothing according to the topol-

ogy, i.e., discarding noises. Based on these two characteris-

tics, the same attributes are smoothed according to the dif-

ferent topologies of multiple homogeneous networks. Thus,

the smoothed node attributes in each homogeneous attributed

network may not be signiﬁcantly different.

Therefore, contrary to hypothesis of complementarity, an-

other alternative assumption may be the consensus, where

the aggregated node attributes shared by multiple homoge-

neous attributed networks are essential for node represen-

tations. In other words, to seek robust node representation,

the aggregated node attributes, which are speciﬁc in each ho-

mogeneous attributed network, should be discarded. This

assumption shares the common philosophy with consensus

clustering. Note that this alternative assumption reduces the

requirement for labels, thus is more suitable for unsupervised

tasks.

In this paper, a novel Heterogeneous Graph Information

Bottleneck (HGIB) is proposed to implement the consensus

hypothesis in an unsupervised manner. To this end, informa-

tion bottleneck (IB), which has been widely used in super-

vised tasks, is extended to unsupervised representation learn-

ing by leveraging self-supervision strategy. That is, each in-

duced homogeneous attributed network is utilized as the self-

supervision information for the representation learning task

on other induced homogeneous attributed networks. Specif-

ically, HGIB simultaneously maximizes the mutual informa-

tion between the representation learned from one homoge-

neous network and another homogeneous network, and min-

imizes the mutual information between the speciﬁc informa-

tion contained in one homogeneous network and the rep-

resentation learned from this homogeneous network. The

model analysis reveals that HGIB degrades to the supervised

heterogeneous GNN or the infomax on homogeneous graph,

respectively, if the two adopted meta-paths are extremely sim-

ilar or dissimilar.

The main contributions are summarized as follows.

• We investigate the widely-adopted complementary as-

sumption in designing GNNs for HINs, and present

an alternative one, i.e., consensus hypothesis, which is

more suitable for unsupervised tasks.

• We propose a well-behavior Heterogeneous Graph

Information Bottleneck (HGIB) by leveraging self-

supervised learning strategy, which facilitates the adop-

tion of information bottleneck for unsupervised tasks.

• We reveal that the two extreme cases of HGIB corre-

spond to the supervised heterogeneous GNN and the in-

fomax on homogeneous graph, respectively.

• Extensive experiments demonstrate that the consensus-

based unsupervised HGIB signiﬁcantly outperforms

most semi-supervised SOTA methods based on comple-

mentary assumption.

2 Preliminaries

2.1 Heterogeneous Information Network

A heterogeneous information network (HIN)

[

Sun and Han,

2012

]

, denoted as G = (V, E, φ, ϕ), consists of a node

set V and a link set E associating with a node type map-

ping function φ : V 7→ T and link type mapping func-

tion ϕ : E 7→ R, respectively. In the network, each object

v ∈ V belongs to one speciﬁc object type φ(v) ∈ T and

each link e ∈ E belongs to a speciﬁc relation ϕ(e) ∈ R,

where |T | + |R| > 2. A meta-path P of length l is denoted

in the form of T

−−→ T

−−→ ...

−→ T

l+1

, which deﬁnes

a composite relation R = R

◦ R

◦ ... ◦ R

between types

and T

l+1

with ◦ standing for the composition operator on

relations.

2.2 Information Bottleneck

To investigate the discriminative ability of the representation,

the amount of label information that remains accessible af-

ter encoding the data, is known as sufﬁciency

[

Achille and

Soatto, 2018

]

. A representation h of data x is sufﬁcient for

the label y if and only if I(x; y|h) = 0. That is, the amount

of information regarding the task is unchanged by the encod-

ing procedure, i.e.

I(x; y) = I(h; y). (1)

where I(·; ·) stands for the mutual information. To make the

representation robustness (generalization), Information Bot-

tleneck principle (IB)

[

Tishby et al., 2000

]

attempts to dis-

card all information from the input, which is not helpful for

a given task. To this end, IB

[

Alemi et al., 2017

]

directly

minimizes the mutual information between the data x and its

representation h, I(x; h), while at the same time maximizes

the mutual information between h and the label y, I(y; h).

Its objective function can be formulated as follows

(θ) = I

(y; h) − βI

(x; h). (2)

where θ denotes the parameters of the representation encoder

(h|x) and β controls the tradeoff. The second term I(x; h)

can be subdivided into two components by using the chain

rule of mutual information as

I(x; h) = I(x; h|y) + I(y; h), (3)

where the second term I(y; h) is independent of the repre-

sentation h, since h is sufﬁcient for y as shown in Eq. (1).

The ﬁrst term I(x; h|y) represents the information in h that is

not predictive of y, i.e. superﬂuous information. Therefore,

minimizing the mutual information I(x; h) is equivalent to

minimizing the superﬂuous information I(x; h|y)

[

Federici

et al., 2020

]

, and the objective of IB in Eq. (4) can be refor-

mulated as

(θ) = I(y; h) − βI(x; h|y). (4)

Note that maximizing the IB can be done directly only in su-

pervised settings, i.e. y is given.

3 Heterogeneous Graph Information

Bottleneck

In this section, Heterogeneous Graph Information Bottleneck

(HGIB) is proposed. First, the assumption and the overview

are provided by transforming the unsupervised heterogeneous

graph neural network as a self-supervised task. Then, the for-

mula of the self-supervised information bottleneck is intro-

duced based on the supervised one in Sec 2.2. Finally, the

objective function and the optimization are elaborated.

(1)

(2)

Heterogeneous Graph Information Bottleneck

Original Heterogeneous

Information Network G

Homogeneous

Attributed

Network G

(1)

Homogeneous

Attributed

Network G

(2)

max I(h

(2)

(1)

)

max I(h

(1)

(2)

)

min I(v

(1)

(2)

)

min I(v

(2)

(1)

)

Meta

path

p(h

(1)

)

p(h

(2)

)

(1)

(2)

Specific

Common

Specific

Common

(1)

(2)

!(v

(1)

)

!(v

(2)

)

(1)

(2)

p(h

(1)

)

p(h

(2)

)

Specific

Common

Specific

Common

(1)

(2)

!(v

(1)

)

!(v

(2)

)

-D(p(h

(1)

)||p(h

(2)

)) I(h

(1)

(2)

)

(a) F ramework Overview (b) Objective Function

Figure 1: The illustration of the proposed Heterogeneous Graph Information Bottleneck (HGIB) and its objective function.

3.1 Assumption and Overview

There may exist multiple relations between each pair of

nodes, which are induced by different meta-paths, in hetero-

geneous. Taking ACM as an example, each pair of papers can

be connected by the same author or same subject. Contrary to

the complementary assumption taken by most existing GNNs

for heterogeneous, another alternative assumption, i.e., con-

sensus, is investigated here. Consensus assumption considers

that the learned node representations shared by multiple sub-

graphs are essential for node representations. In other words,

to seek robust node representation, the node representations,

which are speciﬁc in each sub-graph, should be discarded.

To implement the consensus assumption, the Heteroge-

neous Graph Information Bottleneck (HGIB) is proposed. Its

illustration is shown in Fig. 1. In the heterogeneous graph,

circle, square and triangle denote three kinds of nodes, while

solid, dashed and dotted lines stand for three kinds of edges.

First, the heterogeneous graph G = (V, E, φ, ϕ) is decom-

posed into two sub-graphs G

(1)



V, E

(1)



(upper graph)

and G

(2)



V, E

(2)



(lower graph) according to the meta-

paths “circle-square-circle” and “circle-triangle-circle”, re-

spectively, where

V represents the set of nodes with the type

of circle. The adjacency matrices of these two sub-graphs

are denoted as A

(1)

and A

(2)

. The attributes of the nodes of

circle type are collected in matrix X, and x is employed to

represent the original attributes of one node.

Here, the widely-adopted GCN

[

Kipf and Welling, 2017

]

is adopted as the encoders to obtain the node representations

on two sub-graphs, as shown in the four gray boxes in Fig.

1, where the light gray boxes and dark gray boxes represent

the propagations without learnable parameter and trainable

mapping functions, respectively. The formula is as follows

(1)

= p(H

(1)

) = σ



(1)



= σ





(1)



−

(1)



(1)



−

XΘ

(1)



(2)

= p(H

(2)

) = σ



(2)



= σ





(2)



−

(2)



(2)



−

XΘ

(2)



where

A = A + I stands for the adjacency matrix with self-

loop,

D denotes the degree matrix of

A with the diagonal

elements as the degrees of the nodes, V =

−

stands for the representations after propagation but without

learnable parameters, and Θ represents the learnable param-

eters (The matrices with superscripts ·

(1)

and ·

(2)

correspond

to sub-graph G

(1)

and G

(2)

, respectively). Besides, v and h,

which are the rows of V and H, respectively, are used to

represent the attributes after propagation and ﬁnal represen-

tation corresponding to one node, respectively. σ(·) denotes

the nonlinear mapping function, such as ReLU or softmax.

According to the consensus assumption mentioned above,

both v

(1)

(light orange box) and v

(2)

(light green box) share

some common and inherent characteristics (yellow part) and

possess speciﬁc characteristics (dark orange and dark green

components), as shown in Fig. 1. Thus, we would like to

learn the representation h

(1)

(or h

(2)

) from v

(1)

(or v

(2)

)

that discards as much information as possible without los-

ing any label information. In the next subsection, the IB for

supervised task provided in Sec. 2.2 will be extended to self-

supervised one for discarding as much speciﬁc information as

possible in the heterogeneous graph information bottleneck.

3.2 Self-supervised Information Bottleneck

In this section, the assumption and overview will be formu-

lated by extending semi-supervised IB to self-supervised one.

First, the consensus assumption can be formalized as the re-

dundancy: v

(1)

is redundant with respect to v

(2)

for y if and

only if I(y; v

(1)

(2)

) = 0. Whenever v

(1)

and v

(2)

are mu-

tually redundant, any representation which contains all the

information shared by both is as predictive as their joint ob-

servation.

Second, heterogeneous graph information bottleneck

(HGIB) is formalized via self-supervised information bot-

tleneck (SSIB). SSIB extends IB by considering the mutual

redundancy assumption. Note that HGIB aims at exploring

shared information by discarding as much speciﬁc informa-

tion as possible. The IB formula in Eq. (4) can be extend to

SSIB

(θ) = I

(2)

; h

(1)

) − βI

(1)

; h

(1)

). (5)

where the ﬁrst term I(v

(2)

; h

(1)

) maximizes the shared in-

formation between the learned representation h

(1)

from sub-

评论收藏

内容反馈

版权申诉

极客程序设计

粉丝: 7534
资源: 3596

基于Python实现的在异质信息网络上的自监督图嵌入学习源码.zip

基于监督学习的web入侵检测系统源码.zip

基于Python实现无监督正样本训练并进行图片中缺陷检测源码+项目说明.zip

毕业设计-基于监督学习的web入侵检测系统源码.zip

基于半监督学习训练yolov7源码+全部数据（课程设计）.zip

基于python爬虫学习项目源码.zip

基于机器学习的入侵检测系统python源码+详细注释(课程设计新项目).zip

基于Python实现的五子棋tkinter版小游戏源码.zip

基于python的贪吃蛇源码.zip

基于 python subprocess 实现的定时任务系统源码.zip

基于python实现文本文档水印的嵌入和提取项目源码（课程设计）.zip

基于半监督学习和集成学习的情感分析研究源码.zip

基于无监督学习模型MVSNet和Monodepth2实现物体三维重建python源码.zip

基于半监督深度学习实现的木马流量检测python源码.zip

基于Matlab实现监督法学习（源码）.rar

基于python的各大网站爬虫学习项目源码.zip

课程设计基于python实现的成绩查询系统源码.zip

基于python的天气预测项目源码.zip

基于Python深度学习的实验室自动签到与监控系统源码.zip

基于Python深度学习ResNet网络实现毒蘑菇识别系统源码.zip

基于深度学习与词嵌入的情感分析系统设计与实现【毕业设计源码+答辩PPT+论文】

基于LSB实现文本语音嵌入提取含Matlab源码.zip

【图像隐藏】基于 DCT算法实现彩色数字水印嵌入提取含Matlab源码.zip

【信号隐藏-文本】基于LSB实现文本语音嵌入提取含Matlab源码.zip

【图像隐藏】基于 FFT实现数字水印嵌入含Matlab源码.zip

课程设计-基于python实现遥感图像道路提取算法源码.zip

基于Python的电影数据可视化分析系统源码.zip

基于MindSpore实现ConvLSTM预测网络python源码.zip

基于python实现中文医学文本实体关系抽取源码.zip

基于Python+Flask实现的新冠疫情可视化项目源码.zip

最新资源