【免费】论文：支持向量机推理算法资源-CSDN文库

需积分: 0 118 浏览量 2009-07-20 14:17:02 上传评论收藏 80KB PDF 举报

资源详情

资源评论

Inductive Learning Algorithms and Representations for

Text Categorization

Susan Dumais

Microsoft Research

One Microsoft Way

Redmond, WA 98052

sdumais@microsoft.com

John Platt

Microsoft Research

One Microsoft Way

Redmond, WA 98052

jplatt@microsoft.com

Mehran Sahami

Computer Science Department

Stanford University

Stanford, CA 94305-9010

sahami@cs.stanford.edu

David Heckerman

Microsoft Research

One Microsoft Way

Redmond, WA 98052

heckerma@microsoft.com

1. ABSTRACT

Text categorization – the assignment of natural

language texts to one or more predefined

categories based on their content – is an

important component in many information

organization and management tasks. We

compare the effectiveness of five different

automatic learning algorithms for text

categorization in terms of learning speed, real-

time classification speed, and classification

accuracy. We also examine training set size,

and alternative document representations.

Very accurate text classifiers can be learned

automatically from training examples. Linear

Support Vector Machines (SVMs) are

particularly promising because they are very

accurate, quick to train, and quick to evaluate.

1.1 Keywords

Text categorization, classification, support vector machines,

machine learning, information management.

2. INTRODUCTION

As the volume of information available on the Internet and

corporate intranets continues to increase, there is growing

interest in helping people better find, filter, and manage

these resources. Text categorization – the assignment of

natural language texts to one or more predefined categories

based on their content – is an important component in many

information organization and management tasks. Its most

widespread application to date has been for assigning

subject categories to documents to support text retrieval,

routing and filtering.

Automatic text categorization can play an important role in

a wide variety of more flexible, dynamic and personalized

information management tasks as well: real-time sorting of

email or files into folder hierarchies; topic identification to

support topic-specific processing operations; structured

search and/or browsing; or finding documents that match

long-term standing interests or more dynamic task-based

interests. Classification technologies should be able to

support category structures that are very general, consistent

across individuals, and relatively static (e.g., Dewey

Decimal or Library of Congress classification systems,

Medical Subject Headings (MeSH), or Yahoo!’s topic

hierarchy), as well as those that are more dynamic and

customized to individual interests or tasks (e.g., email about

the CIKM conference).

In many contexts (Dewey, MeSH, Yahoo!, CyberPatrol),

trained professionals are employed to categorize new items.

This process is very time-consuming and costly, thus

limiting its applicability. Consequently there is increased

interest in developing technologies for automatic text

categorization. Rule-based approaches similar to those

used in expert systems are common (e.g., Hayes and

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余7页未读，立即下载

评论收藏

内容反馈

论文：支持向量机推理算法

评论0

最新资源

论文：支持向量机推理算法

评论0

最新资源

相关推荐

论文研究-）的自动推理算法.pdf

论文研究-）中的自动推理算法.pdf

论文研究-粗糙本体支持的知识推理框架.pdf

论文研究-FPN并行反向推理算法研究.pdf

InferencePGM:重现论文“概率图形模型中推理和学习算法的比较”中的结果

awesome-vector-search:与向量搜索相关的库，服务和研究论文的集合

论文研究-数据缺失的小样本条件下BN参数学习.pdf

基于随机森林和单类支持向量机的电信行业客户流失预测 (2013年)

Mixed-Membership-Stochastic-Blockmodel:具有3种推理方案的混合成员随机块模型实现

论文研究-带重要度可信度的模糊框架规则表示及其推理算法 .pdf

论文研究-基于等距加密和案例推理的旅游线路聚类算法.pdf

论文研究-一种用于MADIDs的联结树因式粒子推理算法 .pdf

论文研究-位置服务中基于二分图的身份推理攻击算法.pdf

论文研究-一种基于局部联结树的贝叶斯网近似推理算法 .pdf

rrr:使用改进的 Robinson 分辨率算法从 RTE 数据集中提取的蕴涵规则

DeepPath：我的EMNLP论文“ DeepPath：知识图推理的强化学习方法”的代码和文档

数据挖掘在各行业的应用论文

论文研究-利用隶属函数宽度的模糊插值推理方法.pdf

论文研究-基于XML Schema的XML强多值依赖的推理规则集.pdf

论文研究-模糊蕴涵算子的性质和合成推理算法的扩展.pdf

数据分析和近似推理的模糊系统建模算法-研究论文

论文研究-基于AFS模糊逻辑的案例推理算法研究 .pdf

论文研究-单元化单隐变量变结构DDBN推理算法.pdf

论文研究-证据网络模型及其推理算法.pdf

论文研究-基于OWA算子的区间值加权模糊推理.pdf

论文研究-基于Log-Gabor小波变换和证据推理的车型识别.pdf

数据挖掘论文合集-242篇（part3）

数据挖掘论文合集-242篇（part1）

数据挖掘论文合集-242篇（part2）