Graph-based Natural Language Processing and Information Retrieval


-
Graphs are ubiquitous. There is hardly any domain in which objects and their relations cannot be intuitively represented as nodes and edges in a graph. Graph theory is a well-studied sub-discipline of mathematics, with a large body of results and a large number of efficient algorithms that operate on graphs. Like many other disciplines, the fields of natural language processing (NLP) and information retrieval (IR) also deal with data that can be represented as a graph. In this light, it is somewhat surprising that only in recent years the applicability of graph-theoretical frameworks to language technology became apparent and increasingly found its way into publications in the field of computational linguistics. Using algorithms that take the overall graph structure of a problem into account, rather than characteristics of single objects or (unstructured) sets of objects, graph-based methods have been shown to improve a wide range of NLP tasks. In a short but comprehensive overview of the field of graph-based methods for NLP and IR, Rada Mihalcea and Dragomir Radev list an extensive number of techniques and examples from a wide range of research papers by a large number of authors. This book provides an excellent review of this research area, and serves both as an introduction and as a survey of current graph-based techniques in NLP and IR. Because the few existing surveys in this field concentrate on particular aspects, such as graph clustering (Lancichinetti and Fortunato 2009) or IR (Liu 2006), a textbook on the topic was very much needed and this book surely fills this gap. The book is organized in four parts and contains a total of nine chapters. The first part gives an introduction to notions of graph theory, and the second part covers natural and random networks. The third part is devoted to graph-based IR, and part IV covers graph-based NLP. Chapter 1 lays the groundwork for the remainder of the book by introducing all necessary concepts in graph theory, inc
GRAPH-BASED NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL Graph theory and the fields of natural language processing and information retrieval are well-studied disciplines. Traditionally, these areas have been per- ceived as distinct, with different algorithms, different applications, and different potential end-users. However, recent research has shown that these disciplines are intimately connected, with much variety in the way that natural language processing and information retrieval applications find efficient solutions within graph-theoretical frameworks This book is a comprehensive description of the use of graph-based algo rithms for natural language processing and information retrieval. It brings together topics as diverse as lexical semantics, text summarization, text min ing, ontology construction, text classification, and text retrieval, which are connected by the common underlying theme of the use of graph-theoretical methods for text-and information-processing tasks. Readers will gain a firm understanding of the major methods and applications in natural language pro cessing and information retrieval that rely on graph-based representations and algorithms Rada mihalcea is an associate Professor in the department of computer sci ence and Engineering at the University of North Texas, where she leads the Language and Information Technologies research group In 2009, she received the Presidential Early Career Award for Scientists and Engineers, awarded by President Barack Obama. She served on the editorial board of several journals, including Computational Linguistics, Journal of Natural Language engineer ing, and Language Resources and Evaluations, and she cochaired the empirical Methods in Natural Language Processing Conference in 2009 and the Associa- tion for Computational linguistics Conference in 2011. She has been published in IEEE Intelligent SystemS, Journal of Natural Language Engineering, Jour- nal of Machine Translation, Computational Intelligence, International Journal of semantic Computing, and Artificial Intelligence magazine Dragomir Radev is a Professor in the School of Information, the Depart- ment of Electrical Engineering and Computer Science, and the Department of Linguistics at the University of Michigan, where he is the leader of the Com putational Linguistics and Information Retrieval(CLaIr) research group. He has had more than 100 publications in conferences and journals such as Com- munications of the Association for Computing Machinery(ACM), Journal of Artificial Intelligence research, BioinformaticS, Computational Linguistics, Information Processing and Management, and American Journal of political Science, among others. He is on the editorial boards of information retrieval Journal of Natural language Engineering, and Journal of artificial Intelligence Research. Professor radev is an acm distinguished scientist as well as coach of the U.S. high school team in computational linguistics. He is also an adjunct Professor in computer science at Columbia University GRAPH-BASED NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL RADA MIHALCEA University of North Texas, Department of Computer Science and Engineering DRAGOMIR RADEV University of michigan School of information Department of Electrical Engineering and Computer Science Department of linguistics 罗 CAMBRIDGE UNIVERSITY PRESS CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape to Singapore, Sao Paulo, Delhi, Tokyo, Mexico City Cambridge University Press 32 Avenue of the americas. New york. ny 10013-2473 USA www.cambridge.org Informationonthistitlewww.cambridge.org/9780521896139 C Rada Mihalcea and Dragomir Radev 2011 This publication is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements no reproduction of any part may take place without the written permission of Cambridge University Press First published 2011 Printed in the united states of america A catalog record for this publication is available from the british library Library of Congress Cataloging in Publication data Mihalcea Rada, 1974- raph Based Natural Language Processing and Information Retrieval /Rada mihalcea Dragomir radev Includes bibliographical references and index ISBN978-0-521-89613-9 1. Natural language processing( Computer science) 2. Graphical user interfaces Computer systems)I Radev, Dragomir, 1968- Il. Title QA76.9N38M532011 005437dc22 2010044578 isbn 978-0-521-89613-9 Hardback Cambridge University Press has no responsibility for the persistence or accuracy of URLs for external or third-party Internet Web sites referred to in this publication and does not guarantee that any content on such Web sites is, or will remain, accurate or appropriate Contents Introduction page I 0. 1 Background 0.2 Book Organization 0.3 Acknowledgments 347 Part I. Introduction to Graph Theory 1 Notations, Properties, and representations 1.1 Graph terminology and notations 1.2 Graph properties 1.3 Graph Types 14 1. 4 Representing Graphs as Matrices 15 1.5 USing Matrices to Compute Graph Properties 16 1.6 Representing graphs as linked lists 17 1.7 Eigenvalues and Eigenvectors 18 2 Graph-Based Algorithms 20 2. 1 Depth-First Graph Traversal 20 2.2 Breadth-First Graph Traversal 22 2.3 Minimum Spanning Trees 23 2. 4 Shortest-Path algorithms 26 2.5 Cuts and flows 29 2.6 Graph Matching 31 2. 7 Dimensionality reduction 32 2.8 Stochastic Processes on graphs 34 2.9 Harmonic functions 38 2.10 Random Walks 40 2. 11 Spreading activation 41 2. 12 Electrical Interpretation of Random Walks 42 2.13 Power Method 44 Contents 2.14 Linear Algebra Methods for Computing Harmonic Functions 45 2. 15 Method of relaxations 46 2.16 Monte Carlo method Part network 3 Random networks 3. 1 Networks and graphs 3.2 Random graphs 54 3.3 Degree Distributions 3.4 Power laws 57 3.5 Zipfs Law 58 3.6 Preferential Attachment 61 3.7 Giant Component 62 3.8 Clustering Coefficient 62 3.9 Small Worlds 63 3.10 Assortativity 65 3. 1 1 Centrality 67 3. 12 Degree Centralit 67 3. 13 Closeness Centrality 68 3. 14 Betweenness Centrality 6 3. 15 Network Example 3.16 Dynamic Processes: Percolation 3. 17 Strong and weak ties 74 3.18 Assortative Mixing 76 3. 19 Structural holes 76 4 Language Networks 78 4.1 Co-Occurrence Networks 4.2 Syntactic Dependency Networks 80 4.3 Semantic Networks 81 4.4 Similarity Networks 85 Part Ill. Graph-Based Information Retrieval 5 Link analysis for the World wide Web 91 5.1 The Web as a graph 91 5.2 PageRank 92 5.3 Undirected graphs 95 5.4 Weighted Graphs 95 Contents 5.5 Combining PageRank with Content Analysis 97 5.6 Topic-Sensitive Link Analysis 97 5.7 Query-Dependent Link Analysis 100 5.8 Hyperlinked-Induced Topic Search 101 5.9 Document Reranking with Induced Links 103 6 Text Clustering 106 6. 1 Graph-Based Clustering 108 6.2 Spectral Methods 111 6.3 The Fiedler Method 113 6.4 The Kernighan-Lin Method 114 6.5 Betweenness-Based Clustering 115 6.6 Min-Cut Clustering 117 6.7 Text Clustering Using Random Walks 119 Part IV. Graph-Based Natural Language Processing 7 Semantics 123 7. 1 Semantic Classes 123 7.2 Synonym Detection 125 7.3 Semantic Distance 126 7.4 Textual Entailment 129 7.5 Word-Sense Disambiguation 131 7.6 Name Disambiguation 134 7.7 Sentiment and Subjectivity 135 8 Syntax 140 8. 1 Part-of-Speech Tagging 140 8.2 Dependency Parsing 141 8.3 Prepositional-Phrase Attachment 144 8. 4 Co-Reference resolution 146 9 Applications 149 9.1 Summarization 149 9.2 Semi-supervised Passage Retrieval 150 9.3 Keyword Extraction 154 9.4 Topic Identification 156 9.5 Topic Segmentation 161 9.6 Discourse 162 9.7 Machine Translation 165 Contents 9.8 Cross-Language Information Retrieval 166 9.9 Information Extraction 169 9.10 Question Answering 9.11 Term Weighting 174 Bibliography ndex 191

-
2016-04-27
-
2015-08-31
-
2015-06-03
-
2014-05-26
1.40MB
Graph-Based Semi-Supervised Learning
2019-01-09While labeled data is expensive to prepare, ever increasing amounts of unlabeled data is becoming wi
23KB
论文Efficient Graph- Based Image Segmentation的实现代码,c/c++版本的,支持jpg格式输入输出
2017-07-24该代码是论文Efficient Graph- Based Image Segmentation的实现代码,c/c++版本的,主要代码来自于该论文的作者ppf的实现代码,在此基础上加入了格式转换的接口使
9.63MB
Graph-based Knowledge Representation: Computational Foundations of Conceptual Graphs
2010-06-26This book provides a definition and study of a knowledge representation and reasoning formalism stem
4.12MB
Natural Language Processing and Text Mining
2017-12-03The topic this book addresses originated from a panel discussion at the ACM SIGKDD (Special Interest
2.35MB
A Tutorial on Graph-Based SLAM
2018-09-11A Tutorial on Graph-Based SLAM 全英文,值得仔细研究
725KB
Landmark Graph-based Indoor Localization-2020.pdf
2020-08-04《Landmark Graph-based Indoor Localization-2020》Fuqiang Gu, Member, IEEE, Shahrokh Valaee, Fellow, IE
3.51MB
Machine learning for graph-based representations
2019-02-12In this paper, we develop an alternative network reduction approach to characterizing transport in D
10KB
图像分割—基于图的图像分割(Graph-Based Image Segmentation)C++源代码
2017-12-28C++版源代码,分享给大家,Graph-Based Segmentation 是经典的图像分割算法,作者Felzenszwalb也是提出DPM算法的大牛。该算法是基于图的贪心聚类算法,实现简单,速度比
579KB
Efficient Graph-Based Image Segmentation
2014-09-03Efficient Graph-Based Image Segmentation论文的实现代码,以及给了一个测试样例,在Linux下运行通过
8.41MB
Graph-based Knowledge Representation(基于图的知识表示)
2019-02-20This book provides a definition and study of a knowledge representation and reasoning formalism stem
7.3MB
Efficient Graph-Based Image Segmentation.rar
2019-07-09《Efficient Graph-Based Image Segmentation》论文C++代码,交流学习下
371KB
2019-Session-based Recommendation with Graph Neural Networks.pdf
2019-06-112019-Session-based Recommendation with Graph Neural Networks
363KB
边缘化Consistent Node Removal and Edge Sparsification for Graph-based SLAM.pdf
2021-02-19边缘化Consistent Node Removal and Edge Sparsification for Graph-based SLAM.pdf
212KB
论文研究-LIVER SEGMENTATION BASED ON GRAPH-BASED AND QUADTREE.pdf
2019-08-20基于Graph-Based和四叉树的肝脏自动提取,韦琴林,房斌,在此论文中,采用图论和四叉树相结合的方法对肝脏CT图像进行自动提取。首先我们采用各向性非线性扩散滤波对图像进行平滑去噪,然�
5.20MB
neo4j-graph-algorithms-3.5.0.1.zip
2019-01-06neo4j-graph-algorithms-3.5.0.1.zip图形数据库编码及应用,安装
210KB
Saliency Detection via Graph-Based Manifold Ranking代码
2013-07-252013年经典CVPR文章代码 Saliency Detection via Graph-Based Manifold Ranking
661KB
Graph-based reasoning model for multiple relation extraction.pdf
2021-03-02Graph-based reasoning model for multiple relation extraction.pdf
1.41MB
Graph-based path planning for autonomous robotic exploration
2021-02-01title={Graph-based path planning for autonomous robotic exploration in subterranean environments}, a
10.74MB
基于图结构的平面目标追踪算法(亮风台)
2017-09-02Gracker: A Graph-based Planar Object Tracker(基于图结构的平面目标追踪算法)的源码比较详尽的注解。本文件下含有一份代码和对应论文。
3.76MB
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Rec.pptx
2020-06-05Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition 分离与统一的图卷积用于基于骨架的
1.39MB
neo4j-graph-algorithms-algo-3.5.4.0.jar
2020-07-20用于neo4j数据库中的算法插件包,下载后需要放入plugin文件夹下,并修改neo4j的config文件,最后重启服务可以使用
1.61MB
Meta-Graph Based Recommandation
2018-12-10基于子图的推荐系统,融合异构信息网络Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks
12.37MB
GraphCut-based Optimisation for Computer Vision
2018-07-19Motivation Min-Cut / Max-Flow (Graph Cut) Algorithm Markov and Conditional Random Fields Random Fiel
178KB
CCS3.3里面,使用View -Graph-Image显示图片 DSP图片显示
2015-09-15CCS3.3里面,使用View -Graph-Image显示图片 DSP图片显示。自己试验,配图显示过程,希望能帮助你!
14.29MB
graph-tool-2.27.tar.bz2
2019-05-23Graph-tool is an efficient Python module for manipulation and statistical analysis of graphs (a.k.a.
15.24MB
Graph-Cuts versus Level-Sets,ECCV06_tutorial_partI-partIV
2011-12-29Graph-Cuts versus Level-Sets,ECCV06_tutorial_partI-partIV
3.15MB
graph-based SLAM.pdf
2021-02-23小秋SLAM笔记
18KB
json-graph-specification, 在JSON中,表示图结构( 节点/边)的一.zip
2019-09-18json-graph-specification, 在JSON中,表示图结构( 节点/边)的一 json-graph-specificationJSON中表示图形结构的一种。链接持续集成 Gitter
947.10MB
janusgraph-0.5.x.zip
2020-08-19之前上传的是0.4和0.3版本的,这次的是0.5所有版本的,截至时间为2020-08-19 11:00
332KB
Generalization performance of graph-based semi-supervised classification
2019-12-30基于图的半监督学习算法的推广误差,陈洪,李落清,半监督学习算法近几年来研究十分活跃,许多方法被引入.这些方法在仿真实验和实际数据处理中显示出良好的性能.但是其算法的理论基础
-
博客
【报错】JSONArray cannot be cast to java.util.ArrayList
【报错】JSONArray cannot be cast to java.util.ArrayList
-
下载
天津市_矢量路网_行政地名.zip
天津市_矢量路网_行政地名.zip
-
博客
java 五子棋_Java GUI 单机版五子棋
java 五子棋_Java GUI 单机版五子棋
-
下载
部编语文三上PPT课件 06 秋天的雨4.ppt
部编语文三上PPT课件 06 秋天的雨4.ppt
-
学院
C/C++反汇编解密
C/C++反汇编解密
-
博客
python 函数调用图_[宜配屋]听图阁
python 函数调用图_[宜配屋]听图阁
-
下载
《声音频率测试》软件.rar
《声音频率测试》软件.rar
-
博客
总理2021年报告全文
总理2021年报告全文
-
学院
2021年软考系统规划与管理师-上午历年真题解析视频课程
2021年软考系统规划与管理师-上午历年真题解析视频课程
-
下载
计算机等级考试三级数据库应用题大全.
计算机等级考试三级数据库应用题大全.
-
下载
apache-beam-2.26.0.zip
apache-beam-2.26.0.zip
-
下载
p29997937_190000_MSWIN-x86-64.zip
p29997937_190000_MSWIN-x86-64.zip
-
博客
计算机视觉-语义分割任务常用指标
计算机视觉-语义分割任务常用指标
-
下载
最新全国省市区街道四级联动json数据
最新全国省市区街道四级联动json数据
-
博客
时间戳 java_java中获取时间戳的方法
时间戳 java_java中获取时间戳的方法
-
学院
零基础一小时极简以太坊智能合约开发环境搭建并开发部署
零基础一小时极简以太坊智能合约开发环境搭建并开发部署
-
下载
批量修改文件名称工具-绿色-免费.zip
批量修改文件名称工具-绿色-免费.zip
-
博客
SpringMVC - SpringMVC拦截器的实现、过滤器与拦截器关系、执行顺序等
SpringMVC - SpringMVC拦截器的实现、过滤器与拦截器关系、执行顺序等
-
学院
MySQL 事务和锁
MySQL 事务和锁
-
学院
MySQL 数据类型和运算符
MySQL 数据类型和运算符
-
博客
[Java]流程控制
[Java]流程控制
-
下载
S9300V100R003C00SPC200.web.zip
S9300V100R003C00SPC200.web.zip
-
博客
window搭建hadoop
window搭建hadoop
-
博客
「瞰见·未来」大会合规论坛:合规是区块链和加密产业未来发展大趋势
「瞰见·未来」大会合规论坛:合规是区块链和加密产业未来发展大趋势
-
学院
华为1+X——网络系统建设与运维(高级)
华为1+X——网络系统建设与运维(高级)
-
学院
牛牛量化策略交易
牛牛量化策略交易
-
学院
QT编程思想【C++,基于QT 6】
QT编程思想【C++,基于QT 6】
-
学院
云开发后台+微信扫码点餐小程序+cms网页管理后台 含后厨端和用户端
云开发后台+微信扫码点餐小程序+cms网页管理后台 含后厨端和用户端
-
学院
MySQL 多平台多模式(安装、配置和连接 详解)
MySQL 多平台多模式(安装、配置和连接 详解)
-
下载
钢梁abaqus数值仿真
钢梁abaqus数值仿真