没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
A Brief Tutorial on Database Queries, Data Mining, andOLAPLutz HamelDepartment of Computer Science and StatisticsUniversity of Rhode IslandTyler HallKingston, RI 02881Tel: (401) 480-9499Fax: (401) 874-4617Email: hamel@cs.uri.eduA Brief Tutorial on Database Queries, Data Mining, andOLAPLutz Hamel, University of Rhode Island, USAINTRODUCTIONModern, commercially available relational database systems now routinely includea cadre of data retrieval and analysis tools. Here we shed some light on theint
资源推荐
资源详情
资源评论
A Brief Tutorial on Database Queries, Data Mining, and
OLAP
Lutz Hamel
Department of Computer Science and Statistics
University of Rhode Island
Tyler Hall
Kingston, RI 02881
Tel: (401) 480-9499
Fax: (401) 874-4617
Email: hamel@cs.uri.edu
A Brief Tutorial on Database Queries, Data Mining, and
OLAP
Lutz Hamel, University of Rhode Island, USA
INTRODUCTION
Modern, commercially available relational database systems now routinely include
a cadre of data retrieval and analysis tools. Here we shed some light on the
interrelationships between the most common tools and components included in today’s
database systems: query language engines, data mining components, and on-line
analytical processing (OLAP) tools. We do so by pair-wise juxtaposition which will
underscore their differences and highlight their complementary value.
BACKGROUND
Today’s commercially available relational database systems now routinely include
tools such as SQL database query engines, data mining components, and OLAP (Craig,
Vivona, & Bercovitch, 1999; Oracle, 2001; Scalzo, 2003; Seidman, 2001). These tools
allow developers to construct high powered business intelligence (BI) applications which
are not only able to retrieve records efficiently but also support sophisticated analyses
such as customer classification and market segmentation. However, with powerful tools
so tightly integrated with the database technology understanding the differences between
these tools and their comparative advantages and disadvantages becomes critical for
effective application development. From the practitioner’s point of view questions like
the following often arise:
• Is running database queries against large tables considered data mining?
• Can data mining and OLAP be considered synonymous?
• Is OLAP simply a way to speed up certain SQL queries?
The issue is being complicated even further by the fact that data analysis tools are often
implemented in terms of data retrieval functionality. Consider the data mining models in
the Microsoft SQL server which are implemented through extensions to the SQL
database query language (e.g. predict join) (Seidman, 2001) or the proposed SQL
extensions to enable decision tree classifiers (Sattler & Dunemann, 2001). OLAP cube
definition is routinely accomplished via the data definition language (DDL) facilities of
SQL by specifying either a star or snowflake schema (Kimball, 1996).
MAIN THRUST OF THE CHAPTER
The following sections contain the pair wise comparisons between the tools and
components considered in this chapter.
Database Queries vs. Data Mining
Virtually all modern, commercial database systems are based on the relational
model formalized by Codd in the 60s and 70s (Codd, 1970) and the SQL language (Date,
2000) which allows the user to efficiently and effectively manipulate a database. In this
model a database table is a representation of a mathematical relation, that is, a set of
items that share certain characteristics or attributes. Here, each table column represents an
剩余14页未读,继续阅读
资源评论
weixin_38692162
- 粉丝: 4
- 资源: 904
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功