============================================
Sphider - a lightweight search engine in PHP
Version 1.2.x
By Ando Saabas ando(a t)cs.ioc.ee
============================================
Sphider is a lightweight web spider and search engine written in PHP, using MySQL as its back end database. It is
suitable for adding search functionality to small or medium sites (up to 10-20,000 pages).
--------
Features
--------
1. Spidering
- Can index both static and dynamic pages.
- Finds links in <a href=...>, <frame ...>, <area ...> and <meta ...> tags, and can also follow links given in
javascript as strings via window.location and window.open.
- Respects robots.txt protocol.
- Follows server side redirections.
- Allows spidering to be limited by depth (ie maximum number of clicks from the starting page), by (sub)domain or by
directory.
- Supports indexing of pdf and doc files (using external binaries for file conversion).
- Allows resuming paused spidering.
2. Indexing
- Full text indexing.
- Possbility to exclude common words from being indexed.
- Option to define your custom page ranking function, which can depend on the number of times a given word occurs in the
webpage, whether the word occurs in the domain name, path, or title of the document and also the relative "deepness" of
the url (so that the same page in www.domain.com/ is ranked higher than in www.domain.com/dir1/dir2/foo.html)
3. Searching
- Uses AND operator by default, if more than one query word is used, it finds pages that include all the query words.
- Supports phrase searching.
- Supports excluding words (by putting a '-' in front of a word, any page including the word will be omitted from the
results).
- Option to add and group sites into categories
- Possibility to limit searching to a given category and its subcategories.
4. Size and speed
- Sphider uses reguler expressions to extract links from webpages, so indexing is not particularly fast. Searching is
quite fast, if the database size is reasonable.
-Sphider is very small, its source code being under 70kb in size, probably making it the smallest search engine with
such functionality out there (a pretty good indication of PHP as a rapid prototyping tool).
5. Compatibility
It is a typical LAMP application (but of course it can also be run under Windows). Sphider was designed to be compatible
with older versions of PHP and MySQL, it should work with at least PHP 3 and MySQL 3.23.
6. Licence
Sphider is licenced under GNU General Public Licence.
--------------------------------------------------------------------------------
Ando Saabas 2004
Contact: ando (at) cs.ioc.ee
没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
【项目资源】:包含前端、后端、移动开发、操作系统、人工智能、物联网、信息化管理、数据库、硬件开发、大数据、课程资源、音视频、网站开发等各种技术项目的源码。包括STM32、ESP8266、PHP、QT、Linux、iOS、C++、Java、python、web、C#、EDA、proteus、RTOS等项目的源码。【项目质量】:所有源码都经过严格测试,可以直接运行。功能在确认正常工作后才上传。【适用人群】:适用于希望学习不同技术领域的小白或进阶学习者。可作为毕设项目、课程设计、大作业、工程实训或初期项目立项。【附加价值】:项目具有较高的学习借鉴价值,也可直接拿来修改复刻。对于有一定基础或热衷于研究的人来说,可以在这些基础代码上进行修改和扩展,实现其他功能。【沟通交流】:有任何使用上的问题,欢迎随时与博主沟通,博主会及时解答。鼓励下载和使用,并欢迎大家互相学习,共同进步。
资源推荐
资源详情
资源评论
收起资源包目录
[搜索链接]Sphider--开源的蜘蛛程序_sphider.rar (36个子文件)
[搜索链接]Sphider--开源的蜘蛛程序_sphider
sphider
include
index_header.inc 206B
commonfuncs.php 4KB
common.txt 623B
connect.php 476B
index_footer.inc 16B
index.css 376B
conf.php 3KB
sql
tables.sql 1KB
upgrade_to_1.2.5.sql 160B
upgrade_to_1.2.6.sql 221B
upgrade_to_1.2.sql 171B
install.txt 6KB
languages
pt-language.php 950B
it-language.php 1024B
nl-language.php 1003B
ee-language.php 932B
cn-language.php 867B
es-language.php 998B
en-language.php 961B
de-language.php 1018B
upgrading.txt 335B
admin
auth.php 1KB
admin.css 395B
spider.php 15KB
spiderfuncs.php 21KB
auth_old.php 757B
tmp
admin.php 40KB
auth.php.bak 1KB
messages.php 3KB
ext.txt 170B
install.php 3KB
search.php 14KB
安装说明.txt 549B
searchfuncs.php 8KB
readme.txt 3KB
search.css 1KB
共 36 条
- 1
资源评论
心梓知识
- 粉丝: 6171
- 资源: 3万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功