*******************************************************************************
* Developers *
* *
* Do not update this change log manually. *
* Instead, make your commit messages informative, and these will be harvested *
* at integration time by the cvs2cl.pl script and added here. *
* For details on this script see: *
* http://www.red-bean.com/cvs2cl/ *
* *
* For guidelines on what to put in commit messages see: *
* http://www.red-bean.com/cvs2cl/changelogs.html *
* *
* This file has been trimmed of changes prior to version 1.4. For access to *
* earlier change histories, please consult htmlparser.cvs.sorceforge.net. *
* *
*******************************************************************************
Release Build 1.6 - 20060610
--------------------------------
2006-06-10 10:39 derrickoswald
* docs/faq.html:
add faq to docs
2006-06-05 19:53 derrickoswald
* src/org/htmlparser/tests/InstanceofPerformanceTest.java:
Remove InstanceofPerformanceTest, no longer needed.
2006-06-04 15:17 derrickoswald
* src/org/htmlparser/tests/AllTests.java,
src/org/htmlparser/tests/ParserTest.java,
src/org/htmlparser/tests/tagTests/BodyTagTest.java,
src/org/htmlparser/tests/tagTests/FormTagTest.java,
src/org/htmlparser/tests/tagTests/LabelTagTest.java,
src/org/htmlparser/tests/tagTests/LinkTagTest.java,
src/org/htmlparser/tests/tagTests/ObjectCollectionTest.java,
build.xml, src/org/htmlparser/Parser.java,
src/org/htmlparser/StringNodeFactory.java,
src/org/htmlparser/Tag.java,
src/org/htmlparser/tests/lexerTests/TagTests.java,
src/org/htmlparser/tests/scannersTests/ScriptScannerTest.java,
src/org/htmlparser/util/LinkProcessor.java,
src/org/htmlparser/util/SpecialHashtable.java,
src/org/htmlparser/util/Translate.java,
src/org/htmlparser/nodes/TagNode.java,
src/org/htmlparser/tags/LinkTag.java:
Eliminate deprecated classes and methods.
Removed nodeDecorator package, StringNodeFactory, LinkProcesor, SpecialHashTable,
and methods for linkData, non-Ex Attributes and FindAllNodesThatAre.
2006-06-01 23:14 derrickoswald
* src/org/htmlparser/: Parser.java, lexer/Lexer.java,
util/NodeTreeWalker.java:
Fix Javadoc warnings.
2006-06-01 22:43 derrickoswald
* src/org/htmlparser/: http/ConnectionManager.java,
lexer/Page.java:
implement RFE #1394144 handle deflate encoding
InflaterInputStream needed an additional Inflater argument.
2006-06-01 21:48 derrickoswald
* src/org/htmlparser/: http/ConnectionManager.java,
http/HttpHeader.java, Parser.java:
implement RFE #1436082 Follow redirections with cookie processing
Use ConnectionManager.setRedirectionProcessingEnabled(true).
Probably only useful if combined with ConnectionManager.setCookieProcessingEnabled(true).
2006-05-30 22:10 derrickoswald
* src/org/htmlparser/: tests/utilTests/NodeListTest.java,
Node.java, nodes/AbstractNode.java, nodes/RemarkNode.java,
nodes/TagNode.java, nodes/TextNode.java, tags/CompositeTag.java,
tags/ScriptTag.java, util/NodeList.java:
implement task #93148 toHtml(boolean verbatim)
To avoid printing generated end tags use toHtml(true).
2006-05-29 23:11 derrickoswald
* src/org/htmlparser/Parser.java:
Update javadoc for new Parser constructor behaviour.
2006-05-29 22:53 derrickoswald
* src/org/htmlparser/Parser.java:
Allow passing HTML in the Parser constructor.
So now it allows HTML, a URL or a file name.
2006-05-29 21:30 derrickoswald
* src/org/htmlparser/http/ConnectionManager.java:
Handle bad cookie names.
Traps cookie name problems, but ignores any following cookies.
2006-05-29 21:07 derrickoswald
* src/org/htmlparser/: beans/StringBean.java,
tests/utilTests/BeanTest.java:
fix bug#1496863 StringBean collapse() adds extra whitespace
Keep collapsing state machine state as member variable.
Integration Build 1.6 - 20060527
--------------------------------
2006-05-27 10:36 derrickoswald
* src/org/htmlparser/: scanners/ScriptScanner.java,
tests/scannersTests/ScriptScannerTest.java:
fix bug #1457371 Script tag consumes too much from document being parsed
The default for ScriptScanner.STRICT was set to true.
If you want the older, more lax, script parsing, set it to false.
2006-05-27 10:03 derrickoswald
* src/org/htmlparser/: nodes/RemarkNode.java,
tests/tagTests/FormTagTest.java:
fix bug #1488951 RemarkNode.toPlainTextString() incorrect behaviour
RemarkNode.toPlainTextString() now always returns an empty string
if you want the remark text use getText()
2006-05-27 10:02 derrickoswald
* src/org/htmlparser/: lexer/Lexer.java,
tests/FunctionalTests.java, tests/lexerTests/LexerTests.java,
tests/parserHelperTests/RemarkNodeParserTest.java:
fix bug #1345049 HTMLParser should not terminate a comment with --->
add static STRICT_REMARKS to Lexer class, which when true follows the specification for remarks
2006-05-16 05:14 ian_macfarlane
* src/org/htmlparser/filters/: AndFilter.java, OrFilter.java:
Incorrect grammar in javadoc. Changed [it's] to [its].
2006-05-16 05:11 ian_macfarlane
* src/org/htmlparser/filters/XorFilter.java:
New class that does XOR logic (to round out our NOT, AND and OR filters).
2006-05-16 03:58 ian_macfarlane
* src/org/htmlparser/filters/: AndFilter.java, OrFilter.java:
Added constructors to OrFilter/AndFilter that take an array of NodeFilter's.
2006-04-24 18:12 derrickoswald
* src/org/htmlparser/Parser.java:
Fix incorrect example.
2006-04-23 07:59 derrickoswald
* src/org/htmlparser/tags/TableHeader.java:
Change copyright as per request by P.I.M. Schrama
2006-04-17 20:08 derrickoswald
* src/org/htmlparser/tests/: lexerTests/KitTest.java,
PerformanceTest.java:
Move non-junit test code to Request For Enhancement (RFE) as attachments.
2006-04-17 19:45 derrickoswald
* src/org/htmlparser/tests/: ParserTestCase.java,
PerformanceTest.java:
Fix unit tests.
2006-04-17 09:53 derrickoswald
* src/org/htmlparser/tests/: ParserTest.java,
lexerTests/LexerTests.java, tagTests/InputTagTest.java,
tagTests/TableTagTest.java,
utilTests/CharacterTranslationTest.java:
Fix unit tests. Move failing test cases to downloads on corresponding RFE artifacts.
2006-04-17 09:51 derrickoswald
* bin/: translate.cmd, beanybaby.cmd, filterbuilder.cmd, lexer.cmd,
linkextractor.cmd, parser.cmd, sitecapturer.cmd,
stringextractor.cmd, thumbelina.cmd:
Allow execution from directory name containing spaces on Windows.
2006-04-14 18:18 derrickoswald
* build.xml, src/org/htmlparser/Parser.java,
src/org/htmlparser/http/ConnectionManager.java,
src/org/htmlparser/lexer/Lexer.java,
src/org/htmlparser/util/NodeList.java:
Cleanup to isolate htmllexer jar build.
2006-04-11 08:03 derrickoswald
* src/org/htmlparser/tests/: AllTests.java, MemoryTest.java:
Move failing unit test to RFE as a download.
2006-04-10 17:38 derrickoswald
* src/org/htmlparser/lexer/Page.java:
Fix Bug #1467712 Page#getCharset never works
Use Content-Type header field instead of connection's getContentType method.
2006-04-08 09:33 derrickoswald
* src/org/htmlparser/tests/utilTests/CharacterTranslationTest.java:
Typo.
2006-04-06 20:58 derrickoswald
* src/org/htmlparser/: lexer/Page.java,
tests/lexerTests/PageTests.java:
Fix Bug #1461473 Relative links starting with ?
Added overloaded methods taking boolean 'strict' flag on URL manipulators.
Default is loose interpretation like most bro
没有合适的资源?快使用搜索试试~ 我知道了~
htmlparser1_6
共789个文件
html:419个
java:226个
gif:40个
需积分: 33 9 下载量 133 浏览量
2009-09-02
11:11:26
上传
评论
收藏 3.79MB RAR 举报
温馨提示
htmlparser1_6 网页匹配 抓取网页 分析数据
资源详情
资源评论
资源推荐
收起资源包目录
htmlparser1_6 (789个子文件)
beanybaby 2KB
Benchmarks 6KB
BlockFeedback 6KB
filterbuilder.cmd 2KB
thumbelina.cmd 2KB
stringextractor.cmd 2KB
sitecapturer.cmd 2KB
linkextractor.cmd 2KB
beanybaby.cmd 2KB
translate.cmd 2KB
parser.cmd 2KB
lexer.cmd 2KB
CollectingParameter 5KB
CompositePattern 6KB
stylesheet.css 1KB
stylesheet.css 1KB
CustomTagExtraction 6KB
CustomTagLinks 6KB
CustomVisitorLinks 6KB
EmailExtraction 7KB
EnableFeedback 6KB
htmlparser_greyscale.eps 842KB
htmlparser_pms.eps 812KB
htmlparser_cmyk.eps 811KB
ExternalIterators 6KB
FactoryMethod 6KB
FeedbackMechanism 7KB
filterbuilder 1KB
FilterLinks 7KB
BeanyBaby.form 10KB
FrequentlyAskedQuestions 6KB
roger.gif 8KB
rsf.gif 8KB
htmlparser2in.gif 3KB
vxhtml10.gif 2KB
vcss.gif 2KB
uk.gif 1KB
usa.gif 1KB
macedonia.gif 1KB
canada.gif 1KB
finland.gif 1KB
india.gif 1KB
taiwan.gif 1KB
france.gif 1017B
swiss.gif 982B
italy.gif 687B
Chain32.gif 278B
Chain16.gif 213B
Knot32.gif 167B
Knot16.gif 140B
paste.gif 134B
AndFilter.gif 113B
NodeClassFilter.gif 112B
open.gif 112B
HasAttributeFilter.gif 108B
TagNameFilter.gif 105B
copy.gif 104B
save.gif 102B
OrFilter.gif 94B
about.gif 90B
new.gif 90B
RegexFilter.gif 89B
cut.gif 89B
StringFilter.gif 87B
HasChildFilter.gif 87B
HasParentFilter.gif 87B
HasSiblingFilter.gif 86B
NotFilter.gif 79B
delete.gif 76B
inherit.gif 57B
inherit.gif 57B
java.header 994B
HomePage 7KB
index-all.html 684KB
FilterBuilder.html 189KB
Thumbelina.html 144KB
serialized-form.html 137KB
ThumbelinaFrame.html 127KB
PicturePanel.html 122KB
HTMLTextBean.html 113KB
Filter.html 108KB
RegexFilterWrapper.html 107KB
HasAttributeFilterWrapper.html 106KB
StringFilterWrapper.html 105KB
HTMLLinkBean.html 102KB
HtmlTreeCellRenderer.html 102KB
HasParentFilterWrapper.html 98KB
HasChildFilterWrapper.html 98KB
TagNameFilterWrapper.html 98KB
HasSiblingFilterWrapper.html 97KB
NodeClassFilterWrapper.html 97KB
BeanyBaby.html 95KB
NotFilterWrapper.html 95KB
OrFilterWrapper.html 95KB
AndFilterWrapper.html 94KB
SubFilterList.html 94KB
NodeFilter.html 92KB
Node.html 87KB
ParserUtils.html 81KB
ParserException.html 74KB
共 789 条
- 1
- 2
- 3
- 4
- 5
- 6
- 8
DDPEAS
- 粉丝: 113
- 资源: 33
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0