Lucene Change Log
$Id: CHANGES.txt,v 1.94 2004/07/01 17:40:41 cutting Exp $
1.4 final
1. Added "an" to the list of stop words in StopAnalyzer, to complement
the existing "a" there. Fix for bug 28960
(http://issues.apache.org/bugzilla/show_bug.cgi?id=28960). (Otis)
2. Added new class FieldCache to manage in-memory caches of field term
values. (Tim Jones)
3. Added overloaded getFieldQuery method to QueryParser which
accepts the slop factor specified for the phrase (or the default
phrase slop for the QueryParser instance). This allows overriding
methods to replace a PhraseQuery with a SpanNearQuery instead,
keeping the proper slop factor. (Erik Hatcher)
4. Changed the encoding of GermanAnalyzer.java and GermanStemmer.java to
UTF-8 and changed the build encoding to UTF-8, to make changed files
compile. (Otis Gospodnetic)
5. Removed synchronization from term lookup under IndexReader methods
termFreq(), termDocs() or termPositions() to improve
multi-threaded performance. (cutting)
6. Fix a bug where obsolete segment files were not deleted on Win32.
1.4 RC3
1. Fixed several search bugs introduced by the skipTo() changes in
release 1.4RC1. The index file format was changed a bit, so
collections must be re-indexed to take advantage of the skipTo()
optimizations. (Christoph Goller)
2. Added new Document methods, removeField() and removeFields().
(Christoph Goller)
3. Fixed inconsistencies with index closing. Indexes and directories
are now only closed automatically by Lucene when Lucene opened
them automatically. (Christoph Goller)
4. Added new class: FilteredQuery. (Tim Jones)
5. Added a new SortField type for custom comparators. (Tim Jones)
6. Lock obtain timed out message now displays the full path to the lock
file. (Daniel Naber via Erik)
7. Fixed a bug in SpanNearQuery when ordered. (Paul Elschot via cutting)
8. Fixed so that FSDirectory's locks still work when the
java.io.tmpdir system property is null. (cutting)
9. Changed FilteredTermEnum's constructor to take no parameters,
as the parameters were ignored anyway (bug #28858)
1.4 RC2
1. GermanAnalyzer now throws an exception if the stopword file
cannot be found (bug #27987). It now uses LowerCaseFilter
(bug #18410) (Daniel Naber via Otis, Erik)
2. Fixed a few bugs in the file format documentation. (cutting)
1.4 RC1
1. Changed the format of the .tis file, so that:
- it has a format version number, which makes it easier to
back-compatibly change file formats in the future.
- the term count is now stored as a long. This was the one aspect
of the Lucene's file formats which limited index size.
- a few internal index parameters are now stored in the index, so
that they can (in theory) now be changed from index to index,
although there is not yet an API to do so.
These changes are back compatible. The new code can read old
indexes. But old code will not be able read new indexes. (cutting)
2. Added an optimized implementation of TermDocs.skipTo(). A skip
table is now stored for each term in the .frq file. This only
adds a percent or two to overall index size, but can substantially
speedup many searches. (cutting)
3. Restructured the Scorer API and all Scorer implementations to take
advantage of an optimized TermDocs.skipTo() implementation. In
particular, PhraseQuerys and conjunctive BooleanQuerys are
faster when one clause has substantially fewer matches than the
others. (A conjunctive BooleanQuery is a BooleanQuery where all
clauses are required.) (cutting)
4. Added new class ParallelMultiSearcher. Combined with
RemoteSearchable this makes it easy to implement distributed
search systems. (Jean-Francois Halleux via cutting)
5. Added support for hit sorting. Results may now be sorted by any
indexed field. For details see the javadoc for
Searcher#search(Query, Sort). (Tim Jones via Cutting)
6. Changed FSDirectory to auto-create a full directory tree that it
needs by using mkdirs() instead of mkdir(). (Mladen Turk via Otis)
7. Added a new span-based query API. This implements, among other
things, nested phrases. See javadocs for details. (Doug Cutting)
8. Added new method Query.getSimilarity(Searcher), and changed
scorers to use it. This permits one to subclass a Query class so
that it can specify it's own Similarity implementation, perhaps
one that delegates through that of the Searcher. (Julien Nioche
via Cutting)
9. Added MultiReader, an IndexReader that combines multiple other
IndexReaders. (Cutting)
10. Added support for term vectors. See Field#isTermVectorStored().
(Grant Ingersoll, Cutting & Dmitry)
11. Fixed the old bug with escaping of special characters in query
strings: http://issues.apache.org/bugzilla/show_bug.cgi?id=24665
(Jean-Francois Halleux via Otis)
12. Added support for overriding default values for the following,
using system properties:
- default commit lock timeout
- default maxFieldLength
- default maxMergeDocs
- default mergeFactor
- default minMergeDocs
- default write lock timeout
(Otis)
13. Changed QueryParser.jj to allow '-' and '+' within tokens:
http://issues.apache.org/bugzilla/show_bug.cgi?id=27491
(Morus Walter via Otis)
14. Changed so that the compound index format is used by default.
This makes indexing a bit slower, but vastly reduces the chances
of file handle problems. (Cutting)
1.3 final
1. Added catch of BooleanQuery$TooManyClauses in QueryParser to
throw ParseException instead. (Erik Hatcher)
2. Fixed a NullPointerException in Query.explain(). (Doug Cutting)
3. Added a new method IndexReader.setNorm(), that permits one to
alter the boosting of fields after an index is created.
4. Distinguish between the final position and length when indexing a
field. The length is now defined as the total number of tokens,
instead of the final position, as it was previously. Length is
used for score normalization (Similarity.lengthNorm()) and for
controlling memory usage (IndexWriter.maxFieldLength). In both of
these cases, the total number of tokens is a better value to use
than the final token position. Position is used in phrase
searching (see PhraseQuery and Token.setPositionIncrement()).
5. Fix StandardTokenizer's handling of CJK characters (Chinese,
Japanese and Korean ideograms). Previously contiguous sequences
were combined in a single token, which is not very useful. Now
each ideogram generates a separate token, which is more useful.
1.3 RC3
1. Added minMergeDocs in IndexWriter. This can be raised to speed
indexing without altering the number of files, but only using more
memory. (Julien Nioche via Otis)
2. Fix bug #24786, in query rewriting. (bschneeman via Cutting)
3. Fix bug #16952, in demo HTML parser, skip comments in
javascript. (Christoph Goller)
4. Fix bug #19253, in demo HTML parser, add whitespace as needed to
output (Daniel Naber via Christoph Goller)
5. Fix bug #24301, in demo HTML parser, long titles no longer
hang things. (Christoph Goller)
6. Fix bug #23534, Replace use of file timestamp of segments file
with an index version number stored in the segments file. This
resolves problems when running on file systems with low-resolution
timestamps, e.g., HFS under MacOS X. (Christoph Goller)
7. Fix QueryParser so that TokenMgrError is not thrown, only
ParseException. (Erik Hatcher)
8. Fix some bugs introduced by change 11 of RC2. (Christoph Goller)
9. Fixed a problem compiling TestRussianStem. (Christoph Goller)
10. Cleaned up some build stuff. (Erik Hatcher)
1.3 RC2
1. Added getFieldNames(boolean) to IndexReader, SegmentReader, and
SegmentsReader. (Julie
没有合适的资源?快使用搜索试试~ 我知道了~
lucene-1.4-final
共374个文件
html:324个
java:17个
gif:11个
4星 · 超过85%的资源 需积分: 0 8 下载量 73 浏览量
2009-07-25
16:37:38
上传
评论
收藏 1.62MB ZIP 举报
温馨提示
lucene-1.4-final非src
资源推荐
资源详情
资源评论
收起资源包目录
lucene-1.4-final (374个子文件)
stylesheet.css 1KB
lucene_green_300.gif 2KB
lucene_outline_300.gif 2KB
lucene_green_250.gif 2KB
lucene_outline_250.gif 1KB
lucene_green_200.gif 1KB
lucene_outline_200.gif 1KB
lucene_green_150.gif 1KB
lucene_outline_150.gif 895B
lucene_green_100.gif 696B
lucene_outline_100.gif 515B
inherit.gif 57B
index-all.html 393KB
overview.html 88KB
fileformats.html 81KB
IndexReader.html 76KB
Query.html 72KB
QueryParser.html 64KB
queryparsersyntax.html 56KB
FilterIndexReader.html 54KB
MultiReader.html 51KB
serialized-form.html 50KB
IndexWriter.html 43KB
IndexReader.html 40KB
Field.html 38KB
RemoteSearchable.html 38KB
benchmarks.html 37KB
TokenStream.html 36KB
FSDirectory.html 36KB
contributions.html 36KB
constant-values.html 36KB
Term.html 36KB
overview-tree.html 35KB
MultiSearcher.html 34KB
SortField.html 33KB
IndexSearcher.html 32KB
Analyzer.html 32KB
MultiFieldQueryParser.html 32KB
Similarity.html 32KB
luceneplan.html 32KB
QueryParserConstants.html 31KB
RAMDirectory.html 30KB
InputStream.html 29KB
FastCharStream.html 28KB
ParallelMultiSearcher.html 28KB
StandardTokenizer.html 28KB
FastCharStream.html 28KB
BooleanQuery.html 27KB
Sort.html 27KB
OutputStream.html 27KB
Query.html 27KB
QueryParserTokenManager.html 26KB
FilterIndexReader.FilterTermDocs.html 26KB
Filter.html 25KB
MultipleTermPositions.html 25KB
Document.html 25KB
Directory.html 25KB
FieldCache.html 25KB
StandardTokenizerTokenManager.html 24KB
Searcher.html 24KB
ParseException.html 24KB
Searchable.html 24KB
ParseException.html 24KB
SpanNearQuery.html 23KB
SpanNotQuery.html 23KB
WildcardTermEnum.html 22KB
SpanFirstQuery.html 22KB
MultiTermQuery.html 22KB
Directory.html 21KB
RangeQuery.html 21KB
DateFilter.html 21KB
SpanOrQuery.html 21KB
ParseException.html 21KB
GermanStemFilter.html 21KB
SpanTermQuery.html 21KB
StopFilter.html 21KB
QueryTermVector.html 21KB
index.html 21KB
FuzzyTermEnum.html 20KB
FilteredQuery.html 20KB
PhraseQuery.html 20KB
Token.html 20KB
Token.html 20KB
DefaultSimilarity.html 20KB
FilteredTermEnum.html 20KB
GermanAnalyzer.html 20KB
package-summary.html 19KB
TokenMgrError.html 19KB
RAMOutputStream.html 19KB
TokenMgrError.html 19KB
StandardTokenizerConstants.html 19KB
Token.html 19KB
FilterIndexReader.FilterTermEnum.html 19KB
PhrasePrefixQuery.html 19KB
WordlistLoader.html 19KB
PrefixQuery.html 19KB
SpanQuery.html 18KB
TermEnum.html 18KB
CharStream.html 18KB
PriorityQueue.html 18KB
共 374 条
- 1
- 2
- 3
- 4
资源评论
- 阿里瓜瓜2014-04-05nice. 还不错,能够用的啊
fiocgs
- 粉丝: 0
- 资源: 14
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 人工智能实验四 感知器算法的设计实现
- java小项目多线程多线程 复制文件 冒泡排序 群聊
- 四数之和(java代码).docx
- 701837906919458TapScanner v3.0.10 (Pro).apk
- 青岛大学人工智能实验二 利用α-β搜索的博弈树算法编写一字棋游戏
- ### 1、项目介绍 本项目Scrapy进行数据爬取,并使用Django框架+PyEcharts实现可视化大屏 效果如下:
- # 微信小程序-健康菜谱 基于微信小程序的一个查找检索菜谱的应用 ### 效果 !动态图(./res/gif/demo
- zabbix-get命令包资源
- 289ssm-mysql-jsp 计算机课程实验管理系统.zip(可运行源码+数据库文件+文档)
- 毕业设计,基于PyQt5实现的可视化界面的Python车牌自动识别系统源码
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功