毕业设计说明书
搜索引擎的设计与实现
搜索引擎的设计与实现
摘要: 我们处在一个大数据的时代,伴随着网络信息资源的庞大,人们越
来越多地注重怎样才能快速有效地从海量的网络信息中,检索出自己需要的、潜
在的、有价值的信息,从而可以有效地在日常工作和生活中发挥作用。因为搜索
引擎这一技术很好的解决了用户搜索网上大量信息的难题,所以在当今的社会,
无论是发展迅猛的计算机行业,还是作为后起之秀的信息产业界,都把 Web 搜索
引擎的技术作为了争相探讨与专研的方向。
搜索引擎的定义就是指按照既定的策略与方法,采取相关的计算机程序,通
过在互联网中进行寻找信息,并显示信息,最后把找到的信息进行整理和筛选,
为搜索引擎的使用者提供检索信息的服务,终极目标是为了提供给使用者,他所
搜索信息相关的资料的计算机系统。搜索引擎的种类繁多,既可以进行全文的索
引,还可以进行目录的索引,不仅有集合式的搜索引擎,还有垂直搜索的引擎以
及元搜索引擎。除此之外,还有门户搜索引擎和免费链接列表等等。
本文首先介绍了搜索引擎出现的必要性,以及什么是搜索引擎、搜索引擎的
分类、处理流程、核心技术,同时也对如何才能提高搜索引擎的精准度以及关联
度进行了更加深入的研究。
关键词: Web 搜索引擎、信息检索、人机交互、Lucene 全文检索引擎
I
Development and implementation of search engine
Abstract:We are in an era of big data, with the network
information resources is huge, more and more people pay attention
to how to quickly and efficiently from the massive network
information, searching for their own needs, potential, valuable
information, which can effectively play a role in the daily work and
life. Because the search the engine of this technology is a good
solution to the problem of Internet users to search a large amount of
information, so in today's society, whether it is the rapid
development of computer industry, information industry as the
bright younger generation, the Web search engine technology as the
direction to explore and research.The search engine (Search Engine)
refers to the strategy and methods established, take computer related
procedures, through the Internet search and display information, then
the information sorting and filtering, provide information retrieval
services for users of search engines, the ultimate goal is to provide to
the user, the computer system of his search data information related
to the type of search engine. There can be the index, can also be a
directory index, not only has the integrated search engine, and the
vertical search engine and meta search engine. In addition, there is
II
the portal search engine and free chain Access list and so on
This paper first introduces the necessity of the search engine, and
what is the search engine, search engine classification, treatment
process, the core technology, but also on how to improve the search
accuracy and relevance of a more in-depth study.
Keywords: Web search engine、information retrieval、
human-computer interaction,、Lucene full text search engine
III
目 录
目 录........................................................................................................................IV
1 绪论............................................................................................................................1
1.1 项目背景........................................................1
1.2 国内外发展现状及分类............................................2
1.3 本论文组织结构介绍..............................................3
2 相关技术介绍............................................................................................................5
2.1 什么是搜索引擎 ..................................................5
2.2 sqlserver 数据库 ...............................................6
2.3 Tomcat 服务器 ...................................................7
3 搜索引擎的基本原理................................................................................................8
3.1 搜索引擎的基本组成及其功能 ......................................8
3.2 搜索引擎的详细工作流程 .........................................11
4 系统分析与设计......................................................................................................14
4.1 系统分析 .......................................................14
4.2 系统概要设计 ...................................................14
4.2 系统实现目标 ...................................................15
5 系统详细实现..........................................................................................................16
5.1 实现环境配置 ...................................................16
5.2 功能实现 .......................................................16
5.2.1 建立索引....................................................16
5.2.2 文件搜索实现................................................18
5.2.3 数据库的连接配置............................................20
5.2.4 数据库搜索实现..............................................20
5.2.5 后台数据编辑实现............................................22
6 系统测试..................................................................................................................24
6.1 测试重要性 .....................................................24
6.2 测试用例 .......................................................25