没有合适的资源?快使用搜索试试~ 我知道了~
时间序列异常检测研究综述
21 下载量 62 浏览量
2021-03-27
18:31:28
上传
评论 2
收藏 202KB PDF 举报
温馨提示
时间序列是一类重要的时间数据对象,可以很容易地从科学和金融应用中获得,并且时间序列的异常检测已成为当前的热门研究课题。 这项调查旨在提供有关异常检测研究的结构化和全面的概述。 在本文中,我们讨论了异常的定义,并根据每种技术采用的基本方法将现有技术分为不同的类别。 对于每个类别,我们都会确定该类别中该技术的优缺点。 然后,我们简要介绍一下最近的代表性方法。 此外,我们还指出了有关多元时间序列异常的一些关键问题。 最后,讨论了有关异常检测的一些建议,并总结了未来的研究趋势,有望对时间序列和其他相关领域的研究者有所帮助。
资源推荐
资源详情
资源评论
978-1-5090-6126-6/16/$31.00 2016 IEEE 426
A SURVEY OF RESEARCH ON ANOMALY DETECTION FOR TIME SERIES
HU-SHENG WU
Materiel Engineering College, Armed Police Force Engineering University, Xi'an 710086, China
E-MAIL: wuhusheng0421@163.com
Abstract:
Time series is an important class of temporal data objects
and it can be easily obtained from scientific and financial
applications, and anomaly detection for time series is
becoming a hot research topic recently. This survey tries to
provide a structured and comprehensive overview of the
research on anomaly detection. In this paper, we have
discussed the definition of anomaly and grouped existing
techniques into different categories based on the underlying
approach adopted by each technique. And for each category,
we identify the advantages and disadvantages of the
techniques in that category. Then, we provide a briefly
discussion on the representative methods recently.
Furthermore, we also point out some key issues about
multivariate time series anomaly. Finally, some suggestions
about anomaly detection are discussed and future research
trends are also summarized, which is hopefully beneficial to
the researchers of time series and other relative domains.
Keywords:
Time series; anomaly detection; big data; data mining;
multivariate time series
1. Introduction
Date have proliferated over with the development of
information technology. According to the latest research of
University of Southern California in 2016, since 1980s, the
scale of data has increased sharply and doubled every year
or even a few months. All of the global data information
has reached 295EB until 2007, 1.8ZB until 2011, and will
reach more than 40ZB in 2020. Therefore, big data era has
arrived. And how to abstract potential information from the
plentiful data needs to be solved urgently [1]. A big part of
these data is time series. The so-called time series is a
sequence records according with the chronological order.
Time series is so widely applied in many fields, such as
military, economy and scientific observations that it has
aroused great attention concern from researchers.
For normal time series data, although the number of
abnormal data is very small, it does not mean that the
abnormal data is not important. On the contrary, some
important data may be hided behind these few abnormal
data. In medical field, the abnormal heart beat should be
detected so that doctors can find the disease in time by the
ECG observation. In addition, the anomaly detection of
time series can be widely used in the engine condition
monitoring, network intrusion detection, anti-money
laundering, network public opinion monitoring, credit card
fraud, stock market analysis, improper tax behavior
monitoring, major construction projects inspection, natural
disaster analysis and so on. Obviously, the study of
abnormal detection of time series has important theoretical
value and practical significance.
2. Definition of time series anomaly detection
Time series widely exist in various large financial,
medical, engineering and social science database. There are
two features between time series and other data types.
Firstly, the time attribute, the records of each variable must
have time dimension and should be arranged in
chronological order, and some date types like market basket
data do not have such attributes. Secondly, the sequence
attribute, the record values are a continuous one in a certain
period of time and in certain laws. Time series can be
divided into univariate time series and multivariate time
series according to the number of variables, the specific
definition is as follows:
Definition 1 Time Series A sequence records
according with the chronological order
(1), (2), , ( ), , ( )
i i i i
S v v v t v n …… ……
is defined as the time series.
In which t (t=1,2,…,n)
is defined as time,
i
(
1,2, ,im
)
is defined as variable.
()
i
vt
is defined as the record of the
variable
i
on time
t
. When m=1,
S
is defined as
univariate time series (UTS). When m>1,
S
is defined as
multivariate time series (MTS).
At the same time, in order to make the anomaly
detection of MTS meaningful, MTS needs to meet the three
requirements as follows: (1) For the MTS data set, the
variable dimension of each sequence is identical, and the
variables which have the same meaning are one-to-one
match. (2) For one MTS sample, the recording time of each
资源评论
weixin_38667835
- 粉丝: 6
- 资源: 937
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- C#ASP.NET口腔门诊会员病历管理系统源码 门诊会员管理系统源码数据库 SQL2008源码类型 WebForm
- 灰狼优化算法(Grey Wolf Optimizer,GWO)是一种群智能优化算法
- JAVA的SpringBoot宠物医院管理系统源码数据库 MySQL源码类型 WebForm
- STM32Fxx英文参考手册 + Jlink下载教程
- 贪心算法 - 数据结构与算法
- C#ASP.NET网络进销存管理系统源码数据库 SQL2008源码类型 WebForm
- Liny 的浏览器为 HarmonyOS NEXT 而构建,旨在为各种性能水平的设备提供一个浏览器的轻量之选
- FLASH批量导入PSD文件
- 529f675667cf31af3454bd60644e631a.mp4
- 此文件夹包含用于分析和转换 .d.ts文件,目的是将 ArkUI 界面暴露给更多语言和运行时
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功