没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
©Manning Publications Co. Please post comments or corrections to the Author Online forum:
http://www.manning-sandbox.com/forum.jspa?forumID=828
MEAP Edition
Manning Early Access Program
Solr in Action
version 6
Copyright 2013 Manning Publications
For more information on this and other Manning titles go to
www.manning.com
Licensed to Jiwen Zhang <kevinzhang29@gmail.com>
©Manning Publications Co. Please post comments or corrections to the Author Online forum:
http://www.manning-sandbox.com/forum.jspa?forumID=828
brief contents
PART 1: MEET SOLR
1 Introduction to Solr
2 Getting to know Solr
3 Key Solr concepts
4 Configuring Solr
5 Indexing
6 Text analysis
PART 2: CORE SOLR CAPABILITIES
7 Performing queries & handling results
8 Faceted search
9 Hit highlighting
10 Search suggestions
11 Result Grouping / Field Collapsing
12 Taking Solr to production
PART 3: TAKING SOLR TO THE NEXT LEVEL
13 Scaling Solr / SolrCloud
14 Multi-lingual Search
15 Complex data operations
16 Relevancy tuning
17 Thinking outside the box
APPENDIXES
A Building Solr from source
B Working with the Solr community
Licensed to Jiwen Zhang <kevinzhang29@gmail.com>
©Manning Publications Co. Please post comments or corrections to the Author Online forum:
http://www.manning-sandbox.com/forum.jspa?forumID=828
1
Introduction to Solr
This chapter covers
Characteristics of the types of data handled by search engines
Common search engine use cases
Key components of Solr
Reasons to choose Solr
Feature overview
With fast-growing technologies like social media, cloud computing, mobile applications,
and big data, these are exciting times to be in computing. One of the main challenges facing
software architects is the need to handle massive volumes of data consumed and produced
by a huge global user base. In addition, users expect online applications to always be
available and responsive. To address the scalability and availability needs of modern web
applications, we’ve seen a growing interest in specialized, non-relational data storage and
processing technologies, collectively known as NoSQL (Not only SQL). These systems share a
common design pattern of matching the storage and processing engine to specific types of
data rather than forcing all data into the once de facto standard relational model. In other
words, NoSQL technologies are optimized to solve a specific class of problems for specific
types of data. The need to scale has led to hybrid architectures composed of a variety of
NoSQL and relational databases; gone are the days of the one-size-fits-all data processing
solution.
This book is about a specific NoSQL technology, Apache Solr, which, like its non-relational
brethren, is optimized for a unique class of problems. Specifically, Solr is a scalable, ready-
to-deploy enterprise search engine that’s optimized to search large volumes of text-centric
data and return results sorted by relevance. That was a bit of a mouthful, so let’s break the
previous statement down into its basic parts:
1
Licensed to Jiwen Zhang <kevinzhang29@gmail.com>
©Manning Publications Co. Please post comments or corrections to the Author Online forum:
http://www.manning-sandbox.com/forum.jspa?forumID=828
Scalable—Solr scales by distributing work (indexing and query processing) to multiple
servers in a cluster
Ready to deploy—Solr is open-source, is easy to install and configure, and provides
a preconfigured example to help you get started
Optimized for search—Solr is fast and can execute complex queries in subsecond
speed, often only 10’s of milliseconds
Large volumes of documents—Solr is designed to deal with indexes containing
millions of documents
Text-centric—Solr is optimized for searching natural language text, like emails, web
pages, resumes, PDF documents, and social messages like tweets or blogs
Results sorted by relevance—Solr returns documents in ranked order based on how
relevant each document is to the user’s query
In this book, you’ll learn how to use Solr to design and implement scalable search
solutions. We’ll begin our journey by learning about the types of data and uses cases Solr
supports. This will help you understand where Solr fits into the big picture of modern
application architectures and which problems Solr is designed to solve.
1.1 Why do I need a search engine?
We suspect that because you’re looking at this book, you already have an idea about why
you need a search engine. Therefore, rather than speculate on why you’re considering Solr,
we’ll get right down to the hard questions you need to answer about your data and use cases
in order to decide if a search engine is right for you. In the end, it comes down to
understanding your data and users and then picking a technology that works for both. Let’s
start by looking at the properties of data that a search engine is optimized to handle.
1.1.1 Managing text-centric data
A hallmark of modern application architectures is matching the storage and processing
engine to your data. If you’re a programmer, then you know to select the best data structure
based on how you use the data in an algorithm, that is, you don’t use a linked list when you
need fast random lookups. The same principle applies with search engines. There are four
main characteristics of data search engines like Solr are optimized to handle.
1. Text-centric
2. Read-dominant
3. Document-oriented
4. Flexible schema
A possible fifth characteristic is having a large volume of data to deal with, that is, “big
data,” but our focus is on what makes a search engine special among other NoSQL
technologies. It goes without saying that Solr can deal with large volumes of data.
2
Licensed to Jiwen Zhang <kevinzhang29@gmail.com>
剩余262页未读,继续阅读
资源评论
- 玄月初心2013-11-02虽然不够完整,但仍然不错,谢谢
- yunfeng_lin2013-06-17这个试读版,好多内容都没有...不过还是可以看看的
- stingu2013-06-07是试读版,也很不错,但要注意里面的一些内容都还没正式确定下来。
- ldcadai2013-08-29非常不错,很有用
- high00482014-01-10虽然不够完整,但仍然有参考意义
manorn
- 粉丝: 2
- 资源: 89
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功