An Algorithm to Mine Normalized Weighted Sequential Patterns Using a Prefix-projected Database

Authors

  • János Demetrovics Janos Demetrovics Institute for Computer Science and Control (SZTAKI) Hungarian Academy of Sciences
  • Vu Duc Thi Vu Duc Thi Information Technology Institute Vietnam National University (VNU) Hanoi, Vietnam
  • Tran Huy Duong Tran Huy Duong Institute of Information Technology Vietnam Academy of Science and Technology (VAST) Hanoi, Vietnam

DOI:

https://doi.org/10.55630/sjc.2015.9.105-122

Keywords:

Data Mining, Frequent Sequential Patterns, Weighted, Sequential Patterns

Abstract

Sequential pattern mining is an important subject in data mining with broad
applications in many different areas. However, previous sequential mining
algorithms mostly aimed to calculate the number of occurrences (the support)
without regard to the degree of importance of different data items.

In this paper, we propose to explore the search space of subsequences
with normalized weights. We are not only interested in the number
of occurrences of the sequences (supports of sequences), but also concerned
about importance of sequences (weights). When generating subsequence
candidates we use both the support and the weight of the candidates while
maintaining the downward closure property of these patterns which allows
to accelerate the process of candidate generation.

Downloads

Published

2016-04-18

Issue

Section

Articles