没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
“A Comparative Study of Various Data Mining Techniques: Statistics, Decision Trees, and Neural Networks” 是一篇分析和比较不同数据挖掘技术的研究论文。该文献探讨了三种主要的数据挖掘方法——统计方法、决策树和神经网络——在不同应用场景中的优缺点、适用性和性能差异。以下是该论文的概述: 背景与动机: 数据挖掘的重要性:随着数据量的爆炸式增长,如何从大量数据中提取有价值的信息成为关键问题。数据挖掘技术广泛应用于各个领域,如金融、医疗、市场营销等,帮助发现隐藏的模式、预测未来趋势并支持决策。 方法选择的挑战:在数据挖掘中,有多种方法可供选择,但不同方法在处理不同类型的数据或问题时表现各异。研究人员和从业者需要了解这些技术的优劣,以便在特定的应用场景中选择最合适的方法。 研究内容: 比较对象:论文主要比较了三种数据挖掘技术: 统计方法:包括线性回归、逻辑回归、时间序列分析等。这些方法基于概率和统计学理论,适用于具有明确统计假设和模型的场景。 决策树:如C4.5、CART等。
资源推荐
资源详情
资源评论
International Journal of Computer Applications Technology and Research
Volume 5– Issue 3, 172 - 175, 2016, ISSN:- 2319–8656
www.ijcat.com 172
A Comparative Study of Various Data Mining
Techniques: Statistics, Decision Trees and Neural
Networks
Balar Khalid
Department of Computer Science
Hassan II University-FMPC
Casablanca, Morocco
Naji Abdelwahab
Department of Computer Science
Hassan II University-ENSET
Mohammedia, Morocco
Abstract: In this paper we focus on some techniques for solving data mining tasks such as: Statistics, Decision Trees and Neural
Networks. The new approach has succeed in defining some new criteria for the evaluation process, and it has obtained valuable results
based on what the technique is, the environment of using each techniques, the advantages and disadvantages of each technique, the
consequences of choosing any of these techniques to extract hidden predictive information from large databases, and the methods of
implementation of each technique. Finally, the paper has presented some valuable recommendations in this field.
Keywords: Data mining, Statistics, Logistic Regression, Decision Trees and Neural Networks.
1. INTRODUCTION
Extraction useful information from data is very far easier from
collecting them. Therefore many sophisticated techniques,
such as those developed in the multi- disciplinary field data
mining are applied to the analysis of the datasets. One of the
most difficult tasks in data mining is determining which of the
multitude of available data mining technique is best suited to a
given problem. Clearly, a more generalized approach to
information extraction would improve the accuracy and cost
effectiveness of using data mining techniques.
Therefore, this paper proposes a new direction based on
evaluation techniques for solving data mining tasks, by using
three techniques: Statistics, Decision Tree and Neural
Networks.
The aim of this new approach is to study those techniques and
their processes and to evaluate data mining techniques on the
basis of: the suitability to a given problem, the advantages and
disadvantages, and the consequences of choosing any
technique, [5].
2. DATA MINING TOOLS
Data mining, the extraction of hidden predictive information
from large databases, is a powerful new technology with great
potential to help companies focus on the most important
information in their data warehouses [6]. Data mining tools
predict future trends and behaviors allowing businesses to
make proactive knowledge driven decisions. Data mining
tools can answer business question that traditionally were too
time consuming to resolve.
They scour database for hidden patterns, finding predictive
information that experts may miss because it lies outside their
expectations.
3. SELECTED DATA MINING
TECHNIQUES
A large number of modeling techniques are labeled "data
mining" techniques [7]. This section provides a short review
of a selected number of these techniques. Our choice was
guided the focus on the most currently used models. The
review in this section only highlights some of the features of
different techniques and how they influence, and benefit from.
We do not present a complete exposition of the mathematical
details of the algorithms, or their implementations.
Although various different techniques are used for different
purposes those that are of interest in the present context [4].
Data mining techniques which are selected are Statistics,
Decision Tree and Neural Networks.
3.1 Statistical Techniques
By strict definition "statistics" or statistical techniques are not
data mining. They were being used long before the term data
mining was coined. However, statistical techniques are driven
by the data and are used to discover patterns and build
predictive models.
Today people have to deal with up to terabytes of data and
have to make sense of it and glean the important patterns from
it. Statistics can help greatly in this process by helping to
answer several important questions about their data: what
patterns are there in database?, what is the chance that an
event will occur?, which patterns are significant?, and what is
a high level summary of the data that gives some idea of what
is contained in database?
In statistics, prediction is usually synonymous with regression
of some form. There are a variety of different types of
regression in statistics but the basic idea is that a model is
created that maps values from predictors in such a way that
the lowest error occurs in making a prediction.
The simplest form of regression is Simple Linear Regression
that just contains one predictor and a prediction. The
relationship between the two can be mapped on a two
dimensional space and the records plotted for the prediction
values along the Y axis and the predictor values along the X
axis. The simple linear regression model then could be viewed
as the line that minimized the error rate between the actual
prediction value and the point on the line [2].
Adding more predictors to the linear equation can produce
more complicated lines that take more information into
资源评论
gllgool
- 粉丝: 183
- 资源: 7
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 建筑工程消防验收现场评定表.docx
- 交叉检查记录表(急救分中心).doc
- 交叉检查记录表(社区服务中心和乡镇卫生院).doc
- 居家适老化改造补贴实施细则产品功能表.docx
- 井田勘探探矿权出让收益评估报告( 模板).doc
- 髋关节功能丧失程度评定表.docx
- 买断式回购应急确认对话报价申请单.docx
- 每月质量安全调度会议纪要.docx
- 每月电梯安全调度会议纪要.docx
- 每周电梯质量安全排查报告.docx
- 每周电梯安全排查报告.docx
- 每月质量安全调度会议纪要表.docx
- 排水管网情况表.docx
- 聘请服务审批表(表格模板).docx
- 培训班次计划表.doc
- 密闭式输液表格、注意事项.docx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功