没有合适的资源?快使用搜索试试~ 我知道了~
Introduction to Information Retrieval
需积分: 19 26 下载量 68 浏览量
2018-02-07
20:50:24
上传
评论
收藏 7.09MB PDF 举报
温馨提示
Introduction to Information Retrieval is the first textbook with a coherent treat- ment of classical and web information retrieval, including web search and the related areas of text classification and text clustering. Written from a computer science perspective, it gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents and of methods for evaluating systems, along with an introduction to the use of machine learning methods on text collections.
资源推荐
资源详情
资源评论
P1: KRU/IRP
irbook CUUS232/Manning 978 0 521 86571 5 May 27, 2008 12:8
Introduction to Information Retrieval
Introduction to Information Retrieval is the first textbook with a coherent treat-
ment of classical and web information retrieval, including web search and
the related areas of text classification and text clustering. Written from a
computer science perspective, it gives an up-to-date treatment of all aspects
of the design and implementation of systems for gathering, indexing, and
searching documents and of methods for evaluating systems, along with an
introduction to the use of machine learning methods on text collections.
Designed as the primary text for a graduate or advanced undergraduate
course in information retrieval, the book will also interest researchers and
professionals. A complete set of lecture slides and exercises that accompany
the book are available on the web.
Christopher D. Manning is Associate Professor of Computer Science and Lin-
guistics at Stanford University.
Prabhakar Raghavan is Head of Yahoo! Research and a Consulting Professor
of Computer Science at Stanford University.
Hinrich Sch
¨
utze is Chair of Theoretical Computational Linguistics at the In-
stitute for Natural Language Processing, University of Stuttgart.
i
P1: KRU/IRP
irbook CUUS232/Manning 978 0 521 86571 5 May 27, 2008 12:8
ii
P1: KRU/IRP
irbook CUUS232/Manning 978 0 521 86571 5 May 27, 2008 12:8
Introduction
to
Information
Retrieval
Christopher D. Manning
Stanford University
Prabhakar Raghavan
Yahoo! Research
Hinrich Sch
¨
utze
University of Stuttgart
iii
P1: KRU/IRP
irbook CUUS232/Manning 978 0 521 86571 5 May 27, 2008 12:8
cambridge university press
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, S
˜
ao Paulo, Delhi
Cambridge University Press
32 Avenue of the Americas, New York, NY 10013-2473, USA
www.cambridge.org
Information on this title: www.cambridge.org/9780521865715
C
Cambridge University Press 2008
This publication is in copyright. Subject to statutory exception
and to the provisions of relevant collective licensing agreements,
no reproduction of any part may take place without
the written permission of Cambridge University Press.
First published 2008
Printed in the United States of America
AcatalogrecordforthispublicationisavailablefromtheBritishLibrary.
Library of Congress Cataloging in Publication data
Manning, Christopher D.
Introduction to information retrieval / Christopher D. Manning, Prabhakar
Raghavan, Hinrich Sch
¨
utze.
p. cm.
Includes bibliographical references and index.
ISBN 978-0-521-86571-5 (hardback)
1. Text processing (Computer science) 2. Information retrieval. 3. Document
clustering. 4. Semantic Web. I. Raghavan, Prabhakar. II. Sch
¨
utze, Hinrich.
III. Title.
QA76.9.T48M26 2008
025.04 – dc22 2008001257
ISBN 978-0-521-86571-5 hardback
Cambridge University Press has no responsibility for
the persistence or accuracy of URLs for external or
third-party Internet Web sites referred to in this publication
and does not guarantee that any content on such
Web sites is, or will remain, accurate or appropriate.
iv
P1: KRU/IRP
irbook CUUS232/Manning 978 0 521 86571 5 May 27, 2008 12:8
Contents
Table o f Notation page xi
Preface xv
1Booleanretrieval 1
1.1 An example information retrieval problem 3
1.2 A first take at building an inverted index 6
1.3 Processing Boolean queries 9
1.4 The extended Boolean model versus ranked retrieval 13
1.5 References and further reading 16
2Thetermvocabularyandpostingslists 18
2.1 Document delineation and character sequence decoding 18
2.2 Determining the vocabulary of terms 21
2.3 Faster postings list intersection via skip pointers 33
2.4 Positional postings and phrase queries 36
2.5 References and further reading 43
3Dictionariesandtolerantretrieval 45
3.1 Search structures for dictionaries 45
3.2 Wildcard queries 48
3.3 Spelling correction 52
3.4 Phonetic correction 58
3.5 References and further reading 59
4Indexconstruction 61
4.1 Hardware basics 62
4.2 Blocked sort-based indexing 63
4.3 Single-pass in-memory indexing 66
4.4 Distributed indexing 68
4.5 Dynamic indexing 71
v
剩余503页未读,继续阅读
资源评论
胖虎不会写代码
- 粉丝: 0
- 资源: 1
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- (源码)基于Spring Boot框架的报表管理系统.zip
- (源码)基于树莓派和TensorFlow Lite的智能厨具环境监测系统.zip
- (源码)基于OpenCV和Arduino的面部追踪系统.zip
- (源码)基于C++和ZeroMQ的分布式系统中间件.zip
- (源码)基于SSM框架的学生信息管理系统.zip
- (源码)基于PyTorch框架的智能视频分析系统.zip
- (源码)基于STM32F1的Sybertooth电机驱动系统.zip
- (源码)基于PxMATRIX库的嵌入式系统显示与配置管理.zip
- (源码)基于虚幻引擎的舞蹈艺术节目包装系统.zip
- (源码)基于Dubbo和Redis的用户中台系统.zip
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功