【免费】2019-KDD-GCN-MF,Disease-GeneAssociationIdentificationByGrap资源-CSDN文库

需积分: 0 30 浏览量 2022-08-04 11:20:49 上传评论收藏 1.35MB PDF 举报

资源详情

资源评论

资源推荐

GCN-MF: Disease-Gene Association Identification By Graph

Convolutional Networks and Matrix Factorization

Peng Han

King Abdullah University of Science

and Technology

peng.han@kaust.edu.sa

Peng Yang

Cognitive Computing Lab

Baidu Research USA

yangpeng1985521@gmail.com

Peilin Zhao

∗

Tencent AI Lab

peilinzhao@hotmail.com

Shuo Shang

∗

University of Electronic Science and

Technology of China

Inception Institute of Articial

Intelligence

jedi.shang@gmail.com

Yong Liu

Alibaba-NTU Singapore Joint

Research Institute, Nanyang

Technological University

stephenliu@ntu.edu.sg

Jiayu Zhou

Michigan State University

jiayuz@msu.edu

Xin Gao

King Abdullah University of Science

and Technology

xin.gao@kaust.edu.sa

Panos Kalnis

King Abdullah University of Science

and Technology

panos.kalnis@kaust.edu.sa

ABSTRACT

Discovering disease-gene association is a fundamental and crit-

ical biomedical task, which assists biologists and physicians to

discover pathogenic mechanism of syndromes. With various clin-

ical biomarkers measuring the similarities among genes and dis-

ease phenotypes, network-based semi-supervised learning (NSSL)

has been commonly utilized by these studies to address this class-

imbalanced large-scale data issue. However, most existing NSSL

approaches are based on linear models and suer from two major

limitations: 1) They implicitly consider a local-structure represen-

tation for each candidate; 2) They are unable to capture nonlinear

associations between diseases and genes. In this paper, we propose

a new framework for disease-gene association task by combin-

ing Graph Convolutional Network (GCN) and matrix factorization,

named GCN-MF. With the help of GCN, we could capture non-

linear interactions and exploit measured similarities. Moreover, we

dene a margin control loss function to reduce the eect of spar-

sity. Empirical results demonstrate that the proposed deep learning

algorithm outperforms all other state-of-the-art methods on most

of metrics.

CCS CONCEPTS

• Computing methodologies → Semantic networks.

∗

Corresponding Author.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,

to post on servers or to redistribute to lists, requires prior specic permission and/or a

fee. Request permissions from permissions@acm.org.

KDD ’19, August 4–8, 2019, Anchorage, AK, USA

ACM ISBN 978-1-4503-6201-6/19/08.. . $15.00

https://doi.org/10.1145/3292500.3330912

KEYWORDS

graph convolutional networks; deep learning; disease-gene associa-

tion

ACM Reference Format:

Peng Han, Peng Yang, Peilin Zhao, Shuo Shang, Yong Liu, Jiayu Zhou, Xin

Gao, and Panos Kalnis. 2019. GCN-MF: Disease-Gene Association Identi-

cation By Graph Convolutional Networks and Matrix Factorization. In The

25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

(KDD ’19), August 4–8, 2019, Anchorage, AK, USA. ACM, New York, NY, USA,

9 pages. https://doi.org/10.1145/3292500.3330912

1 INTRODUCTION

Identifying disease genes from human genome is an important and

fundamental problem in biomedical research [

]. Despite

many publications of machine learning methods have been applied

to discover new disease genes, it still remains a challenge. Because

the set of genes pleiotropy is large, and the number of conrmed

disease genes among whole genome and the genetic heterogeneity

of diseases is limited. Recent approaches have applied the concept

of ’guilty by association’ to investigate the association between

a disease phenotype and its causative genes, which means that

candidate genes with similar characteristics as known disease genes

are more likely to be associated with diseases.

However, due to the imbalance issues (few genes are experimen-

tally conrmed as disease related genes within human genome)

in disease-gene identication, semi-supervised approaches, like

label propagation approaches and positive-unlabeled learning, are

widely used to identify candidate disease-gene links [

]. These

methods make use of unknown genes for training typically in the

scenario of a small amount of conrmed disease-genes (labeled

data) with a large amount of unknown genome (unlabeled data).

The performance of disease-gene association models are limited by

potential bias of single learning models, incompleteness and noise

Research Track Paper

KDD ’19, August 4–8, 2019, Anchorage, AK, USA

705

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余8页未读，立即下载

评论收藏

内容反馈

曹多鱼

粉丝: 19
资源: 314

2019-KDD-GCN-MF, Disease-Gene Association Identification By Grap

评论0

最新资源

2019-KDD-GCN-MF, Disease-Gene Association Identification By Grap

评论0

2019-KDD-Cluster-GCN, An Efficient Algorithm for Training Deep a

NSL-KDD 入侵检测数据集.zip

NSL-KDD_NSL-KDD_NSL-KDD数据集_测试集_

NSL-KDD-Dataset-master_NSL-KDD数据集_入侵检测_KDD_

2019-KDD-KGAT, Knowledge Graph Attention Network for Recommendat

2019-KDD-Automating Feature Subspace Exploration via Multi-Agent

NSL-KDD数据集

ids-kdd99 ids-kdd99

Intrusion-Detection-on-NSL-KDD-master_lstm分类_NSL-KDD_NSL-KDDlstm

NSL-KDD数据集各文件下载

随机森林、决策树的matlab源码，NSL-KDD分类数据集

2019-KDD-DEMO-Net Degree-specific Graph Neural Networks for Node

2019-KDD-Conditional Random Field Enhanced Graph Convolutional N

2019-KDD-Graph Recurrent Networks with Attributed Random Walks-网

NSL-KDD数据集arff格式

NSL-KDD.zip（KDDCUP99改进版）

BurpLoaderKeygen.jar.zip

最新版ISO/IEC 27001:2022、ISO 27002:2022中英文合集

Goby红队版-win-x64-2.4.7版本

Chrome Header Editor 插件

ISO SAE 21434-2021 中文版.pdf

OpenVAS GVM 中文翻译补丁

安全认证cisp教材全套

STM32F103C8T6核心板-电路原理图1.PDF

软件工程导论(第六版)课后习题答案1

OpenVAS离线资源

现代永磁同步电机控制原理及MATLAB仿真__袁雷编著1

goby红队&社区版-win-64-2.4.7

最新资源