1
A Survey on Concept Drift Adaptation
JO
˜
AO GAMA, University of Porto, Portugal
INDR
˙
E
ˇ
ZLIOBAIT
˙
E, Aalto University, Finland
ALBERT BIFET, Yahoo! Research Barcelona, Spain
MYKOLA PECHENIZKIY, Eindhoven University of Technology, the Netherlands
ABDELHAMID BOUCHACHIA, Bournemouth University, UK
Concept drift primarily refers to an online supervised learning scenario when the relation between the in-
put data and the target variable changes over time. Assuming a general knowledge of supervised learning
in this paper we characterize adaptive learning process, categorize existing strategies for handling concept
drift, discuss the most representative, distinct and popular techniques and algorithms, discuss evaluation
methodology of adaptive algorithms, and present a set of illustrative applications. This introduction to the
concept drift adaptation presents the state of the art techniques and a collection of benchmarks for re-
searchers, industry analysts and practitioners. The survey aims at covering the different facets of concept
drift in an integrated way to reflect on the existing scattered state-of-the-art.
Categories and Subject Descriptors: I.2.6 [Artificial Intelligence]: Learning
General Terms: Design, Algorithms, Performance
Additional Key Words and Phrases: concept drift, change detection, adaptive learning
ACM Reference Format:
Gama, J.,
ˇ
Zliobait
˙
e, I., Bifet, A., Pechenizkiy, M., and Bouchachia, A. 2013. A Survey on Concept Drift Adap-
tation. ACM Comput. Surv. 1, 1, Article 1 (January 2013), 35 pages.
DOI = 10.1145/0000000.0000000 http://doi.acm.org/10.1145/0000000.0000000
1. INTRODUCTION
Our digital universe is rapidly growing. The volume of data generated in 2012 has
been estimated to surpass 2.8 zetabytes (2.8 trillion gigabytes) as reported in the IDC
survey [Gantz and Reinsel 2012]. Efficient and effective tools and analysis methods for
dealing with the ever-growing amount of data in different applications and fields are of
paramount need. Very often data comes in the form of streams rendering its analysis
and processing even more resource demanding.
Traditionally in data mining data is first collected and then processed in an offline
mode. For instance, predictive models are trained using historical data given as a set
of pairs (input, output). Models trained in such a way can be afterwards applied for
predicting the output for new unseen input data. However, streaming data can not be
processed similarly because data comes continuously over time and possibly is never-
ending. Accommodating such data in the machine’s main memory is impractical and
often infeasible. Hence, only an online processing is suitable. In this case, predictive
models can be trained either incrementally by continuous update or by retraining us-
ing recent batches of data.
In dynamically changing and non-stationary environments, the data distribution
can change over time yielding the phenomenon of concept drift [Schlimmer and
Granger 1986; Widmer and Kubat 1996]. The real concept drift
1
refers to changes in
the conditional distribution of the output (i.e., target variable) given the input (input
features), while the distribution of the input may stay unchanged. A typical example
of the real concept drift is a change in user’s interests when following an online news
stream. Whilst the distribution of the incoming news documents often remains the
same, the conditional distribution of the interesting (and thus not interesting) news
1
The term real refers to one particular type of concept drift. It doesn’t mean that other types of drift are not
concept drifts.
ACM Computing Surveys, Vol. 1, No. 1, Article 1, Publication date: January 2013.