Mining decision rules on data streams in the presence of concept drifts

Cheng Jung Tsai, Chien I. Lee, Wei Pang Yang

Research output: Contribution to journalArticle

23 Citations (Scopus)

Abstract

In a database, the concept of an example might change along with time, which is known as concept drift. When the concept drift occurs, the classification model built by using the old dataset is not suitable for predicting a new dataset. Therefore, the problem of concept drift has attracted a lot of attention in recent years. Although many algorithms have been proposed to solve this problem, they have not been able to provide users with a satisfactory solution to concept drift. That is, the current research about concept drift focuses only on updating the classification model. However, real life decision makers might be very interested in the rules of concept drift. For example, doctors desire to know the root causes behind variation in the causes and development of disease. In this paper, we propose a concept drift rule mining tree, called CDR-Tree, to accurately discover the underlying rule governing concept drift. The main contributions of this paper are: (a) we address the problem of mining concept-drifting rules which has not been considered in previously developed classification schemes; (b) we develop a method that can accurately mine rules governing concept drift; (c) we develop a method that should classification models be required, can efficiently and accurately generate such models via a simple extraction procedure rather than constructing them anew; and (d) we propose two strategies to reduce the complexity of concept-drifting rules mined by our CDR-Tree.

Original languageEnglish
Pages (from-to)1164-1178
Number of pages15
JournalExpert Systems with Applications
Volume36
Issue number2 PART 1
DOIs
Publication statusPublished - 2009 Jan 1

All Science Journal Classification (ASJC) codes

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this

Tsai, Cheng Jung ; Lee, Chien I. ; Yang, Wei Pang. / Mining decision rules on data streams in the presence of concept drifts. In: Expert Systems with Applications. 2009 ; Vol. 36, No. 2 PART 1. pp. 1164-1178.
@article{035bd6eeb0144ec6a989ee40ef1d644c,
title = "Mining decision rules on data streams in the presence of concept drifts",
abstract = "In a database, the concept of an example might change along with time, which is known as concept drift. When the concept drift occurs, the classification model built by using the old dataset is not suitable for predicting a new dataset. Therefore, the problem of concept drift has attracted a lot of attention in recent years. Although many algorithms have been proposed to solve this problem, they have not been able to provide users with a satisfactory solution to concept drift. That is, the current research about concept drift focuses only on updating the classification model. However, real life decision makers might be very interested in the rules of concept drift. For example, doctors desire to know the root causes behind variation in the causes and development of disease. In this paper, we propose a concept drift rule mining tree, called CDR-Tree, to accurately discover the underlying rule governing concept drift. The main contributions of this paper are: (a) we address the problem of mining concept-drifting rules which has not been considered in previously developed classification schemes; (b) we develop a method that can accurately mine rules governing concept drift; (c) we develop a method that should classification models be required, can efficiently and accurately generate such models via a simple extraction procedure rather than constructing them anew; and (d) we propose two strategies to reduce the complexity of concept-drifting rules mined by our CDR-Tree.",
author = "Tsai, {Cheng Jung} and Lee, {Chien I.} and Yang, {Wei Pang}",
year = "2009",
month = "1",
day = "1",
doi = "10.1016/j.eswa.2007.11.034",
language = "English",
volume = "36",
pages = "1164--1178",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "2 PART 1",

}

Mining decision rules on data streams in the presence of concept drifts. / Tsai, Cheng Jung; Lee, Chien I.; Yang, Wei Pang.

In: Expert Systems with Applications, Vol. 36, No. 2 PART 1, 01.01.2009, p. 1164-1178.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Mining decision rules on data streams in the presence of concept drifts

AU - Tsai, Cheng Jung

AU - Lee, Chien I.

AU - Yang, Wei Pang

PY - 2009/1/1

Y1 - 2009/1/1

N2 - In a database, the concept of an example might change along with time, which is known as concept drift. When the concept drift occurs, the classification model built by using the old dataset is not suitable for predicting a new dataset. Therefore, the problem of concept drift has attracted a lot of attention in recent years. Although many algorithms have been proposed to solve this problem, they have not been able to provide users with a satisfactory solution to concept drift. That is, the current research about concept drift focuses only on updating the classification model. However, real life decision makers might be very interested in the rules of concept drift. For example, doctors desire to know the root causes behind variation in the causes and development of disease. In this paper, we propose a concept drift rule mining tree, called CDR-Tree, to accurately discover the underlying rule governing concept drift. The main contributions of this paper are: (a) we address the problem of mining concept-drifting rules which has not been considered in previously developed classification schemes; (b) we develop a method that can accurately mine rules governing concept drift; (c) we develop a method that should classification models be required, can efficiently and accurately generate such models via a simple extraction procedure rather than constructing them anew; and (d) we propose two strategies to reduce the complexity of concept-drifting rules mined by our CDR-Tree.

AB - In a database, the concept of an example might change along with time, which is known as concept drift. When the concept drift occurs, the classification model built by using the old dataset is not suitable for predicting a new dataset. Therefore, the problem of concept drift has attracted a lot of attention in recent years. Although many algorithms have been proposed to solve this problem, they have not been able to provide users with a satisfactory solution to concept drift. That is, the current research about concept drift focuses only on updating the classification model. However, real life decision makers might be very interested in the rules of concept drift. For example, doctors desire to know the root causes behind variation in the causes and development of disease. In this paper, we propose a concept drift rule mining tree, called CDR-Tree, to accurately discover the underlying rule governing concept drift. The main contributions of this paper are: (a) we address the problem of mining concept-drifting rules which has not been considered in previously developed classification schemes; (b) we develop a method that can accurately mine rules governing concept drift; (c) we develop a method that should classification models be required, can efficiently and accurately generate such models via a simple extraction procedure rather than constructing them anew; and (d) we propose two strategies to reduce the complexity of concept-drifting rules mined by our CDR-Tree.

UR - http://www.scopus.com/inward/record.url?scp=56349151977&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=56349151977&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2007.11.034

DO - 10.1016/j.eswa.2007.11.034

M3 - Article

AN - SCOPUS:56349151977

VL - 36

SP - 1164

EP - 1178

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 2 PART 1

ER -