NLP：CoreNLP自然语言分析工具.zip_corenlp下载资源-CSDN文库

共2000个文件

java：2144个

out：185个

txt：121个

版权申诉

自然语言处理

java

人工智能

nlp

开发语言

22 浏览量 2022-04-21 10:56:13 上传评论收藏 49.18MB ZIP 举报

自然语言处理（NLP）是计算机科学领域的一个分支，它涉及如何使计算机理解和处理人类语言。Stanford CoreNLP是斯坦福大学开发的一款强大的NLP工具包，它为研究人员、开发者和数据分析人员提供了丰富的功能，使得处理文本数据变得更加便捷。这款工具完全由Java编写，因此在跨平台兼容性和性能上具有优势，特别适合大型项目和高并发环境。 CoreNLP的核心功能包括： 1. **分词**：将连续的文本划分为单独的词汇单元，这是NLP的基石。CoreNLP使用了精确的分词算法，可以处理各种复杂的语言现象。 2. **词性标注**：识别每个单词的语法角色，如名词、动词、形容词等，这对于理解句子结构至关重要。 3. **命名实体识别**：识别文本中的专有名词，如人名、地名、组织名等，这对于信息提取和知识图谱构建非常有用。 4. **句法分析**：分析句子的结构，包括依存关系解析和共指消解，揭示出句子内部的语义关系。 5. **情感分析**：评估文本的情感倾向，例如正面、负面或中立，有助于理解和挖掘用户情绪。 6. **核心ference解决**：识别和链接文本中的同指实体，帮助理解篇章的连贯性。 7. **事件抽取**：从文本中识别和提取关键事件，如交易、任命、竞赛等，这对于新闻摘要和信息监测有重要价值。 8. **文本分类**：可以训练模型对文本进行分类，例如垃圾邮件过滤、新闻主题分类等。 9. **实体链接**：将文本中的实体与知识库中的实体对应起来，增强信息的理解和检索。使用CoreNLP时，开发者可以通过Java API或者命令行接口来调用这些功能。其灵活性允许用户根据需求选择不同的处理管道，或者自定义新的模块。此外，由于Java的强类型特性，CoreNLP提供了良好的错误检查和异常处理机制，降低了开发中的潜在问题。在实际应用中，CoreNLP广泛应用于信息提取、问答系统、机器翻译、舆情分析、智能客服等多个领域。同时，它的开源性质也促进了社区的活跃，不断有新的特性和优化被加入到工具包中，使其始终保持在NLP领域的前沿地位。总结来说，Stanford CoreNLP是自然语言处理领域的一个强大工具，它提供的多种功能可以帮助开发者快速高效地处理和理解文本数据，尤其对于那些基于Java的项目，它更是一个不可或缺的资源。通过深入学习和熟练运用CoreNLP，我们可以构建更加智能的应用，推动人工智能技术的发展。

资源推荐

资源详情

资源评论

收起资源包目录

NLP：CoreNLP自然语言分析工具.zip （2000个子文件）

naturalli.css 2KB

calendarview.css 1KB

corenlp-brat.css 983B

sutime.css 215B

corenlp-brat.html 8KB

index.html 2KB

overview.html 367B

overview.html 108B

PTBLexer.java 5.27MB

Morpha.java 4.34MB

CoreNLPProtos.java 3.05MB

SpanishLexer.java 455KB

FrenchLexer.java 395KB

SUTime.java 166KB

GetPatternsFromDataMultiClass.java 154KB

UniversalEnglishGrammaticalStructureTest.java 144KB

ProtobufAnnotationSerializer.java 142KB

RadicalMap.java 133KB

GraphicalModelProto.java 131KB

SUTimeITest.java 124KB

SeqClassifierFlags.java 119KB

ColumnDataClassifier.java 118KB

CRFClassifier.java 115KB

NERFeatureFactory.java 112KB

EnglishGrammaticalStructureTest.java 112KB

TokenSequenceParser.java 104KB

Counters.java 99KB

UniversalEnglishGrammaticalRelations.java 99KB

Tree.java 97KB

StringUtils.java 97KB

EnglishTreebankParserParams.java 96KB

EnglishGrammaticalRelations.java 92KB

ExhaustivePCFGParser.java 86KB

EnglishGrammaticalStructure.java 83KB

MaxentTagger.java 82KB

TokenSequenceMatcherITest.java 80KB

QNMinimizer.java 79KB

TregexTest.java 78KB

SieveCoreferenceSystem.java 74KB

UniversalEnglishGrammaticalStructure.java 74KB

IOUtils.java 73KB

DefaultTeXHyphenData.java 70KB

StanfordCoreNLPServer.java 69KB

AbstractSequenceClassifier.java 68KB

SemanticGraph.java 68KB

SequencePattern.java 66KB

SplittingGrammarExtractor.java 64KB

StanfordCoreNLP.java 62KB

ConcatVectorProto.java 62KB

QuantifiableEntityNormalizer.java 62KB

LexicalizedParser.java 62KB

ArrayMath.java 61KB

CoreAnnotations.java 61KB

ScorePhrasesLearnFeatWt.java 60KB

Mention.java 60KB

CMMClassifier.java 59KB

Relation.java 58KB

PTBTokenizerTest.java 58KB

Mention.java 57KB

EnglishPTBTreebankCorrector.java 56KB

InputPanel.java 55KB

ValueFunctions.java 54KB

Redwood.java 53KB

PTB2TextLexer.java 53KB

DependencyParser.java 53KB

Options.java 52KB

ArabicLexer.java 52KB

ExtractorFramesRare.java 52KB

SequenceMatcher.java 51KB

ProtobufAnnotationSerializerSlowITest.java 51KB

Sentence.java 50KB

Expressions.java 50KB

ConstantsAndVariables.java 49KB

SequenceMatchRules.java 48KB

LinearClassifier.java 48KB

QuestionToStatementTranslator.java 47KB

SemanticGraphUtils.java 47KB

ChineseTreebankParserParams.java 47KB

Document.java 46KB

GrammaticalStructure.java 46KB

UniversalChineseGrammaticalRelations.java 45KB

CoNLLDocumentReader.java 45KB

TokensRegexNERAnnotator.java 45KB

BasicRelationFeatureFactory.java 44KB

ChunkAnnotationUtils.java 44KB

ClauseSplitterSearchProblem.java 43KB

RelationTripleSegmenter.java 43KB

SemgrexTest.java 42KB

CORSFilter.java 42KB

LinearClassifierFactory.java 42KB

MachineReading.java 42KB

SUTimeMain.java 42KB

ConcatVectorTableProto.java 41KB

JodaTimeUtils.java 41KB

TimeFormatter.java 41KB

DirectedMultiGraphTest.java 41KB

CorefRules.java 41KB

GrammaticalStructureConversionUtils.java 41KB

PTBTokenizer.java 40KB

TregexPattern.java 40KB

共 2000 条

Stanford typed dependencies manual

Marie-Catherine de Marneffe and Christopher D. Manning

September 2008

Revised for the Stanford Parser v. 3.7.0+ in September 2016

Please note that this manual describes the original Stanford Dependencies representation. As of ver-

sion 3.5.2, the default representation output by the Stanford Parser and Stanford CoreNLP is the new

Universal Dependencies (UD) representation, and we no longer actively develop the original Stanford

Dependencies representation. For a description of the UD representation, take a look at the Universal De-

pendencies documentation at http:/www.universaldependencies.org and the discussion of the enhanced

and enhanced++ UD representations by Schuster and Manning (2016).

1 Introduction

The Stanford typed dependencies representation was designed to provide a simple description of the

grammatical relationships in a sentence that can easily be understood and effectively used by people

without linguistic expertise who want to extract textual relations. In particular, rather than the phrase

structure representations that have long dominated in the computational linguistic community, it repre-

sents all sentence relationships uniformly as typed dependency relations. That is, as triples of a relation

between pairs of words, such as “the subject of distributes is Bell.” Our experience is that this simple,

uniform representation is quite accessible to non-linguists thinking about tasks involving information

extraction from text and is effective in relation extraction applications.

Here is an example sentence:

Bell, based in Los Angeles, makes and distributes electronic, computer and building prod-

ucts.

For this sentence, the Stanford Dependencies (SD) representation is:

nsubj(makes-8, Bell-1)

nsubj(distributes-10, Bell-1)

vmod(Bell-1, based-3)

nn(Angeles-6, Los-5)

prep in(based-3, Angeles-6)

root(ROOT-0, makes-8)

conj and(makes-8, distributes-10)

amod(products-16, electronic-11)

conj and(electronic-11, computer-13)

amod(products-16, computer-13)

conj and(electronic-11, building-15)

Bell

based

partmod

distributes

nsubj

products

dobj

makes

nsubj

conj_and

dobj

Angeles

prep_in

Los

electronic

amod

building

amod

computer

amod

conj_andconj_and

Figure 1: Graphical representation of the Stanford Dependencies for the sentence: Bell, based in Los

Angeles, makes and distributes electronic, computer and building products.

amod(products-16, building-15)

dobj(makes-8, products-16)

dobj(distributes-10, products-16)

These dependencies map straightforwardly onto a directed graph representation, in which words in

the sentence are nodes in the graph and grammatical relations are edge labels. Figure 1 gives the graph

representation for the example sentence above.

Document overview: This manual provides documentation for the set of dependencies deﬁned for

English. There is also a Stanford Dependency representation available for Chinese, but it is not further

discussed here. Starting in 2014, there has been work to extend Stanford Dependencies to be generally

applicable cross-linguistically. Initial work appeared in de Marneffe et al. (2014), and the current guide-

lines for Universal Dependencies (UD) can be found at http://www.universaldependencies.org. For

SD, Section 2 of the manual deﬁnes the grammatical relations and the taxonomic hierarchy over them

appears in section 3. This is then followed by a description of the several variant dependency repre-

sentations available, aimed at different use cases (section 4), some details of the software available for

generating Stanford Dependencies (section 5), and references to further discussion and use of the SD

representation (section 6).

2 Deﬁnitions of the Stanford typed dependencies

The current representation contains approximately 50 grammatical relations (depending slightly on the

options discussed in section 4). The dependencies are all binary relations: a grammatical relation holds

between a governor (also known as a regent or a head) and a dependent. The grammatical relations are

deﬁned below, in alphabetical order according to the dependency’s abbreviated name (which appears in

the parser output). The deﬁnitions make use of the Penn Treebank part-of-speech tags and phrasal labels.

acomp: adjectival complement

An adjectival complement of a verb is an adjectival phrase which functions as the complement (like an

object of the verb).

She looks very beautiful

nsubj

acomp

advmod

advcl: adverbial clause modiﬁer

An adverbial clause modiﬁer of a VP or S is a clause modifying the verb (temporal clause, consequence,

conditional clause, purpose clause, etc.).

“The accident happened as the night was falling” advcl(happened, falling)

“If you know who did it, you should tell the teacher” advcl(tell, know)

“He talked to him in order to secure the account” advcl(talked, secure)

advmod: adverb modiﬁer

An adverb modiﬁer of a word is a (non-clausal) adverb or adverb-headed phrase that serves to modify

the meaning of the word.

“Genetically modiﬁed food” advmod(modiﬁed, genetically)

“less often” advmod(often, less)

agent: agent

An agent is the complement of a passive verb which is introduced by the preposition “by” and does the

action. This relation only appears in the collapsed dependencies, where it can replace prep by, where

appropriate. It does not appear in basic dependencies output.

“The man has been killed by the police” agent(killed, police)

“Effects caused by the protein are important” agent(caused, protein)

amod: adjectival modiﬁer

An adjectival modiﬁer of an NP is any adjectival phrase that serves to modify the meaning of the NP.

“Sam eats red meat” amod(meat, red)

“Sam took out a 3 million dollar loan” amod(loan, dollar)

“Sam took out a $ 3 million loan” amod(loan, $)

appos: appositional modiﬁer

An appositional modiﬁer of an NP is an NP immediately to the right of the ﬁrst NP that serves to deﬁne

or modify that NP. It includes parenthesized examples, as well as deﬁning abbreviations in one of these

structures.

Sam , my brother , arrived

appos

Bill ( John ’s cousin )

appos

The Australian Broadcasting Corporation ( ABC )

appos

aux: auxiliary

An auxiliary of a clause is a non-main verb of the clause, e.g., a modal auxiliary, or a form of “be”, “do”

or “have” in a periphrastic tense.

Reagan has died

aux

He should leave

aux

auxpass: passive auxiliary

A passive auxiliary of a clause is a non-main verb of the clause which contains the passive information.

“Kennedy has been killed” auxpass(killed, been)

aux(killed,has)

“Kennedy was/got killed” auxpass(killed, was/got)

cc: coordination

A coordination is the relation between an element of a conjunct and the coordinating conjunction word of

the conjunct. (Note: different dependency grammars have different treatments of coordination. We take

one conjunct of a conjunction (normally the ﬁrst) as the head of the conjunction.) A conjunction may

also appear at the beginning of a sentence. This is also called a cc, and dependent on the root predicate

of the sentence.

“Bill is big and honest” cc(big, and)

“They either ski or snowboard” cc(ski, or)

“And then we left.” cc(left, And)

ccomp: clausal complement

A clausal complement of a verb or adjective is a dependent clause with an internal subject which func-

tions like an object of the verb, or adjective. Clausal complements for nouns are limited to complement

clauses with a subset of nouns like “fact” or “report”. We analyze them the same (parallel to the analysis

of this class as “content clauses” in Huddleston and Pullum 2002). Such clausal complements are usually

ﬁnite (though there are occasional remnant English subjunctives).

“He says that you like to swim” ccomp(says, like)

“I am certain that he did it” ccomp(certain, did)

“I admire the fact that you are honest” ccomp(fact, honest)

conj: conjunct

A conjunct is the relation between two elements connected by a coordinating conjunction, such as “and”,

“or”, etc. We treat conjunctions asymmetrically: The head of the relation is the ﬁrst conjunct and other

conjunctions depend on it via the conj relation.

“Bill is big and honest” conj(big, honest)

“They either ski or snowboard” conj(ski, snowboard)

cop: copula

A copula is the relation between the complement of a copular verb and the copular verb. (We normally

take a copula as a dependent of its complement; see the discussion in section 4.)

“Bill is big” cop(big, is)

“Bill is an honest man” cop(man, is)

csubj: clausal subject

A clausal subject is a clausal syntactic subject of a clause, i.e., the subject is itself a clause. The governor

of this relation might not always be a verb: when the verb is a copular verb, the root of the clause is the

complement of the copular verb. In the two following examples, “what she said” is the subject.

“What she said makes sense” csubj(makes, said)

“What she said is not true” csubj(true, said)

csubjpass: clausal passive subject

A clausal passive subject is a clausal syntactic subject of a passive clause. In the example below, “that

she lied” is the subject.

“That she lied was suspected by everyone” csubjpass(suspected, lied)

dep: dependent

A dependency is labeled as dep when the system is unable to determine a more precise dependency

relation between two words. This may be because of a weird grammatical construction, a limitation in

the Stanford Dependency conversion software, a parser error, or because of an unresolved long distance

dependency.

“Then, as if to show that he could, . . . ” dep(show, if)

det: determiner

A determiner is the relation between the head of an NP and its determiner.

“The man is here” det(man, the)

“Which book do you prefer?” det(book, which)

discourse: discourse element

This is used for interjections and other discourse particles and elements (which are not clearly linked to

the structure of the sentence, except in an expressive way). We generally follow the guidelines of what

the Penn Treebanks count as an INTJ. They deﬁne this to include: interjections (oh, uh-huh, Welcome),

ﬁllers (um, ah), and discourse markers (well, like, actually, but not you know).

Iguazu is in Argentina :)

discourse

dobj: direct object

The direct object of a VP is the noun phrase which is the (accusative) object of the verb.

“She gave me a raise” dobj(gave, raise)

“They win the lottery” dobj(win, lottery)

评论收藏

内容反馈

版权申诉

方案互联

粉丝: 18
资源: 926

NLP：CoreNLP自然语言分析工具.zip

NLP：利用自然语言处理技术进行情感分析.zip

NLP.zip_NLP_nlp处理docx_python nlp_自然语言处理

NLP：深度学习自然语言处理工具.zip

CoreNLP一套Java核心自然语言处理工具，用于标记化、句子分词、NER分析、相互引用、情感分析等.zip

NLP：自然语言问答系统.zip

自然语言处理与NLP项目.zip

NLP，自然语言分析，小语种语料包

自然语言处理NLP课程资料合集-74份.zip

NLTK，NLP，自然语言处理，自然语言分析，语料包

情感分析资料，NLP，自然语言分析

nlp分析工具是一款基于NLP开源算法和模型库（jieba、spacy、paddlenlp）对文本数据进行向量化，然.zip

NLP，自然语言分析，影评语料包，英语

NLP，自然语言分析，演讲语料包

NLP，自然语言分析，语料包

NLP，自然语言学习资源

NLP，自然语言处理资源

NLP，自然语言处理，语料包

awesome-nlp, 专门用于自然语言处理的资源列表( 自然语言处理).zip

stanford-corenlp-full-2018-10-05.zip

stanford-corenlp.jar.zip_Stanford corenlp_jar_zip

stanford-corenlp-4.2.2.zip

stanford-corenlp-full-2015-12-09.zip

中科大自然语言处理考试试卷.zip

stanford-corenlp-4.5.6.zip

stanford-corenlp-4.3.2.zip

斯坦福NLP相关jar包2018

stanfordcorenlp 4.2.0 安装包

stanford-corenlp-3.9.2-models.jar

PyPI 官网下载 | stanford-corenlp-python-3.3.6-0.linux-x86_64.tar.gz

最新资源