没有合适的资源?快使用搜索试试~ 我知道了~
人工智能-数据挖掘-Hadoop海量数据挖掘在宽带客户信息推送系统中的应用研究.pdf
1.该资源内容由用户上传,如若侵权请联系客服进行举报
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
版权申诉
0 下载量 107 浏览量
2022-06-25
12:42:55
上传
评论
收藏 2.27MB PDF 举报
温馨提示
试读
67页
人工智能-数据挖掘-Hadoop海量数据挖掘在宽带客户信息推送系统中的应用研究.pdf
资源推荐
资源详情
资源评论
ABSTRACT
II
ABSTRACT
With the rapid growth of various broadband access network users, as well as an
increasingly competitive market, each big telecommunication operators began
increasing emphasis business information promotion service for broadband users.
Due to the business information needed to promote the huge and multifarious, how to
handle the relevant data efficiently, and improve the efficiency of the push and
pertinence, becomes the effective ways to each big telecommunication operators to
consolidate market competitiveness. For example, studying how the fixed-line
broadband services for different user groups, to realize the differentiation of telecom
products recommended service. And applied to the telecom broadband customers
information push system, is one of the highlights in agro-scientific research in the
current telecom operators.
Through this article analysis and research the product promotion data and push
mode from the existing telecommunications push systems, to familiar with the
telecom recommend product features and deficiencies of the existing data push mode.
And then go into details of association rule data mining theory. And in-depth research
and analysis based on multidimensional and project compression for telecom
recommended product data characteristics of the Apriori algorithm; Then, in view of
the exponential growth products recommended amount of data, that leads to the
algorithm performance limitations of problem. Proposed based on cloud computing
MR_AprioriDpc parallel mining algorithm of Hadoop platform, and applied to
products recommended by the association rules mining. Finally, the mining results are
applied to broadband customer information push system, and design and implement
the relevant function module.
In this paper, the main innovation:
(1) Based on the telecom product recommendations, design with association
rules analysis, push the task function of custom and the custom push three custom
push function module, and in telecom broadband customers information push system
implementation;
万方数据
ABSTRACT
III
(2) Aiming at the shortcomings of the Apriori algorithm, in-depth research and
analysis based on multidimensional and project compression improved Apriori
algorithm; And according to its performance limitations of huge amounts of data
problem, put forward the Hadoop platform based on cloud computing parallelization
MR_AprioriDpc algorithm;
(3) Recommended products association rules mining model is established, and
recommend MR_AprioriDpc algorithm is applied to the telecom products in
association rules mining.
Keywords: broadband customers; information push; Hadoop; MapReduce; data
mining; AprioriDpc algorithm; MR_AprioriDpc algorithm
万方数据
目 录
IV
目 录
第 1 章 引言 ............................................................................................................... 1
1.1 研究背景及课题意义 ..................................................................................... 1
1.2 课题来源 ......................................................................................................... 2
1.3 国内外研究现状 ............................................................................................. 2
1.4 本文研究内容 ................................................................................................. 3
1.5 论文组织结构 ................................................................................................. 4
第 2 章 需求分析与相关知识 ................................................................................... 6
2.1 需求分析 ......................................................................................................... 6
2.2.1 推送模式分析 ....................................................................................... 6
2.2.2 推送数据分析 ....................................................................................... 6
2.2 电信推送系统数据库服务器 ....................................................................... 10
2.3 数据挖掘相关知识 ....................................................................................... 11
2.3.1 数据挖掘的定义 ................................................................................. 11
2.3.2 数据挖掘的过程 ................................................................................. 11
2.3.3 数据挖掘的方法 ................................................................................. 13
2.3.4 数据仓库 ............................................................................................. 14
2.3.5 多维数据模型 ..................................................................................... 14
2.4 关联规则理论 ............................................................................................... 15
2.4.1 关联规则基本概念 ............................................................................. 15
2.4.2 经典 Apriori 算法................................................................................ 16
2.5 本章小结 ....................................................................................................... 21
第 3 章 改进的 Apriori 算法 ................................................................................... 22
3.1 概况 ............................................................................................................... 22
3.2 基于多维项目压缩的 Apriori 改进算法...................................................... 23
3.2.1 算法思想 ............................................................................................. 23
3.2.2 算法伪代码 ......................................................................................... 25
万方数据
目 录
V
3.2.3 算法实例 ............................................................................................. 27
3.2.4 测试与分析 ......................................................................................... 28
3.3 本章小结 ....................................................................................................... 31
第 4 章 基于 Hadoop 的 Apriori 改进算法并行化 ................................................. 32
4.1 Hadoop 分布式架构 ....................................................................................... 32
4.1.1 Hadoop 概述 ......................................................................................... 32
4.1.2 Hadoop 特性 ......................................................................................... 33
4.2 MapReduce 分布式计算模型 ........................................................................ 34
4.2.1 MapReduce 概述 .................................................................................. 34
4.2.2 MapReduce 工作流程 .......................................................................... 36
4.2.3 MapReduce 中的关键技术 .................................................................. 37
4.3 HDFS 分布式文件系统 ................................................................................. 38
4.3.1 HDFS 系统架构概述 ........................................................................... 38
4.3.2 HDFS 的关键技术 ............................................................................... 39
4.4 并行 Apriori 改进算法(MR_AprioriDpc)设计 ....................................... 39
4.4.1 实现思路 ............................................................................................. 39
4.4.2 算法伪代码 ......................................................................................... 42
4.4.3 测试与分析 ......................................................................................... 44
4.5 本章小结 ....................................................................................................... 45
第 5 章 MR_AprioriDpc 算法的应用与实现 .......................................................... 47
5.1 功能需求 ....................................................................................................... 47
5.2 系统架构 ....................................................................................................... 48
5.2.1 系统体系结构 ..................................................................................... 48
5.2.2 系统功能模块 ..................................................................................... 49
5.2.3 系统主界面 ......................................................................................... 50
5.3 定制推送模块设计与实现 ........................................................................... 51
5.3.1 关联规则挖掘模型 ............................................................................. 51
5.3.2 数据仓库构建 ..................................................................................... 52
5.3.3 关联规则挖掘功能设计与实现 ......................................................... 54
万方数据
目 录
VI
5.3.4 推送任务定制功能设计与实现 ......................................................... 56
5.3.5 定制推送统计功能设计与实现 ......................................................... 60
5.4 本章小结 ....................................................................................................... 61
第 6 章 总结与展望 ................................................................................................. 62
6.1 总结 ............................................................................................................... 62
6.2 展望 ............................................................................................................... 62
致 谢 ......................................................................................................................... 63
参考文献 ..................................................................................................................... 64
万方数据
剩余66页未读,继续阅读
资源评论
programyp
- 粉丝: 89
- 资源: 1万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功