Block Sort Based Indexer README Manual
--------------------------------------
NOTE: The implementation in the TextFileTermListWriter class is currently NOT implemented.
Implementation will be done when time permits.
To use this software (in it's current state) you need to do the following:
1. Run the project under Java 1.6 or later (some code used does not work in < 1.6)
2. Create a folder "documents" and "index" in the project-folder (program checks for these folders)
3. Dump a bunch of .txt-documents into the documents folder that can be indexed (filenames should be unique)
4. Wait for the program to finish up (console outputs a message letting you know).
5. Browse block and index files to see that everything went OK...
This program is a simple example of BSBI (block sort based indexing). It is obviously not supposed to
be used professionally. If you want to contribute, please feel free to do so! The indexer is currently
very dependent on the implementation of the index and type of input/output-source (i.e. text documents).
Some abstraction could be useful. Error handling could probably be improved. Work on the indexer will be
continued quite irregularly.
TODO
1. Implement a MySQL-based index reader/writer to increase performance.
2. Add more documentation to make code more easy to read.
基于分块的外存倒排索引
4星 · 超过85%的资源 需积分: 9 139 浏览量
2015-06-17
16:56:50
上传
评论
收藏 20KB ZIP 举报
u010672325
- 粉丝: 0
- 资源: 1
最新资源
- 新能源汽车行人提醒装置AVAS流程图
- 海信电视刷机数据 LED32EC290N(0000)BOM1通用LED32K220(0000)BOM1生产用软件数据U盘升级文件
- 基于BS模式的冷链物流系统的设计与实现(论文+源码)-kaic.zip
- 红绿灯现实测试数据集,红绿灯识别检测
- unity自制模式(MVC与命令模式相结合)
- 基于tensorflow卷积神经网络实现图像风格的迁移源码(高分项目).zip
- 学习参考数据 1.参考书 2.ppt 3.源码
- 基于深度学习的Xilinx DDR3存储器接口解决方案,适合FPGA的初学者,也适合需要使用DDR进行FPGA设计的设计人员
- arduino ide 2.3.2 Windows
- 花旗杯基于深度学习的模型分析与特征提取
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈