基于 Apriori 算法的关联规则挖掘系统的设计与实现
论 文 作 者 姓 名 :
申 请 学 位 专 业 : 计算机科学与技术
申 请 学 位 类 别 : 工学学士
指导教师姓名(职称):
论 文 提 交 日 期 :
基于 Apriori 算法的关联规则挖掘系统的设计与实现
摘 要
随着信息时代的发展, 信息量呈几何级数增长, 人们发现从这些海量信息中
获取有用的信息越来越困难, 要找出信息背后隐藏的规律更是不可想象。 数据挖
掘就是从大量数据中获取有用信息的一门新技术, 关联规则挖掘是数据挖掘方法
中的一种。本文详细论述了基于 Apriori 算法的关联规则挖掘系统的设计开发过
程。系统基于经典的 Apriori 算法,对事务数据库进行了位图矩阵转换,大大提
高了搜索效率,并能分别挖掘频繁项集和关联规则。
论文组织如下: 首先介绍了数据挖掘的产生、定义和应用;接着阐述了关联
规则挖掘的基本概念;然后对系统的需求进行了分析,并提出设计方案; 紧接着
是系统的具体实现; 最后对系统进行了测试, 将系统用于挖掘中药方剂库中的药
对药组,验证了系统的正确性和实用性。
关键词:
数据挖掘;关联规则; Apriori 算法
The Design and Implementation of Association Rules Mining
System based on Apriori Arithmetic
Abstract
With the development of the information era, the quantity of information
increases in the way of geometric series, and people find that it is more and more
difficult to obtain valuable informatin and it is incogitable to find out the rules hiding
in the information. Data mining is a new technology to mine valuable informatin from
abundant data, and association rules mining is a method of data mining. This paper
elaborates on the process of the design and development of association rules mining
system based on Apriori . The system is based upon classical Apriori arithmetic, and
converts chinese medicine prescriptions database to a bitmap matrix, which greatly
enhances the efficiency of search, and can mine frequent items and association rules
respectively.
The paper is organized as following:
Firstly, introduces the generation, definition
and application of data mining; Secondly, sets forth the conception of association
rules mining; Thirdly, analyzes the demand of the system, and propses the design
project and implements the system; Finally, gives a test to mine chinese medicine
groups from a chinese medicine prescriptions data, which proves the system valid and
applicable.
Key words:
Data mining; Apriori; Chinese traditional medicine
评论2