没有合适的资源?快使用搜索试试~ 我知道了~
学位论文-—微博舆情管理平台:数据分析系统的设计与实现.doc
2 下载量 61 浏览量
2023-07-08
21:10:00
上传
评论
收藏 623KB DOC 举报
温馨提示
试读
62页
学位论文-—微博舆情管理平台:数据分析系统的设计与实现.doc
资源推荐
资源详情
资源评论
北京交通大学毕业设计(论文)
毕业设计(论文)
中 文 题 目 : 微 博 舆 情 管 理 平
台
数据分析系统的设计与实现
英 文 题 目 : MicroBlog Public Opinion
Management Platform: The Design
and Implementation of the Data
Analysis System
1
中文摘要
随着网络技术应用的普及和发展,舆情的传播方式和传播速度都发生
了根本性变化, 网络舆情对人类的社会状态产生了全方位的影响,微博舆
情则是网络舆情的重要组成部分,它的特点有:直接性,突发性,偏差性,
丰富性和互动性。
本文以微博消息为研究对象,研究了微博消息传播的特点与模型,通
过对抓取数据的分析发现了微博传播的单向性,便捷性,背对脸等特点,
还有微博意见领袖在微博传播中的重要作用,微博热点的产生规律。根据
对数据分析的结果提出了趋势分析的算法。利用空间向量模型完成对微博
内容的结构数据化,利用 K-means 算法完成对微博消息的聚类分析,找到
所要分析的某类微博内容,进而在这类微博中找出微博消息意见领袖,提
出微博意见领袖影响力评估算法,WeiboRank 算法,并结合算法完成了微
博消息预警模块的实现,初步实现了微博舆情管理平台的数据预警分析功
能。
关键词:微博舆情 文本聚类 趋势分析
北京交通大学毕业设计(论文)
Abstract
Along with the universal application and rapid development of
network technology, the approaches that the net-mediated public
sentiment spread have been fundamentally changed. The net-mediated
public sentiment has exerted huge influence on the way that the society
operates. As the one of the most significant parts of the net-mediated
public sentiment, the public sentiment which is produced and spread by
the microblog has several important characters, such as directness,
immediacy, deviation, variability, interactivity.
Taking the microblog messages as our investigating subject, this
paper aimed to do research on the characteristics and models of
delivering messages between microblog users, Through the analysis of the
capture data found unidirectional, micro-blog communication
convenience, back on the face and other characteristics, and raised an
effective algorithm to sort these kinds of messages. Using the spatial
vector model, the K-means algorithm did cluster analysis on microblog
messages, and found out the opinion leaders among tremendous messages.
Then, an influential estimation algorithm of the microblog opinion leaders
was raised,WeiboRank algorithm. Together with the estimation
algorithm, we also achieved the early warning part and some basic data
warning analysis functions on the whole microblog-mediated public
sentiment platform.
Key words:microblog-mediated public sentiment, text clustering,
trend analysis
北京交通大学毕业设计(论文)
目 录
一、 概述......................................................................................................1
1.1 课题背景与研究意义.........................................................................1
1.1.1 课题背景...................................................................................1
1.1.2 研究现状...................................................................................3
1.1.3 研究意义...................................................................................3
1.2 论文结构..............................................................................................4
二、微博消息传播模型....................................................................................4
2.1 微博消息传播的特点..........................................................................4
2.2 微博用户状态......................................................................................6
2.3 微博意见领袖......................................................................................7
2.4 微博传播模型......................................................................................9
三、微博舆情管理平台的设计与实现..........................................................12
3.1 微博舆情管理平台的总体流程........................................................12
3.2 数据分析系统设计流程....................................................................13
四、微博舆情管理平台的实现......................................................................14
4.1 样本选取与数据来源........................................................................14
4.2 微博数据转化....................................................................................15
4.3 微博文本聚类....................................................................................17
4.3.1 文本聚类定义.........................................................................17
4.3.2 机器学习.................................................................................18
4.3.3K-means 算法...........................................................................19
4.4 微博意见领袖重要性评估................................................................21
4.4.1 PageRank 算法........................................................................21
4.4.2 WeiboRank 算法.....................................................................22
4.4.3 算法对比................................................................................23
4.5 微博舆情预警模块............................................................................25
4.5.1 微博舆情预警.........................................................................25
北京交通大学毕业设计(论文)
4.5.2 趋势分析模块.........................................................................26
4.6 趋势分析结果比较............................................................................29
五、结论与展望..............................................................................................31
5.1 系统不足............................................................................................31
5.2 未来展望............................................................................................32
5.2.1 改进预期.................................................................................32
5.2.2 新增功能.................................................................................32
5.3 结束语................................................................................................33
参考文献..........................................................................................................34
附录Ⅰ: 翻译原文......................................................................................35
Cluster Analysis:Basic Concepts and Algorithms............................................35
1Overview ........................................................................................................40
1.1.1What Is Cluster Analysis? ................................................................40
1.1.2 Different Types of Clusterings ........................................................41
1.1.3Different Types of Clusters ..............................................................44
2.Road Map ......................................................................................................47
• K-means .................................................................................................47
• Agglomerative Hierarchical Clustering .................................................48
• DBSCAN ...............................................................................................48
附录Ⅱ: 中文翻译......................................................................................48
聚类分析:基本概念及算法..........................................................................48
1 概述...............................................................................................................51
1.1.1 什么是聚类分析?.........................................................................51
1.1.2 不同类型的群集合.........................................................................52
1.1.3 簇的不同类型.................................................................................53
2.路线图...........................................................................................................56
•K-means 算法 .........................................................................................56
•凝聚层次聚类.........................................................................................56
剩余61页未读,继续阅读
资源评论
zzzzl333
- 粉丝: 698
- 资源: 7万+
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功