TITLE

FREQUENT ITEMSET MINING IN TRANSACTIONAL DATA STREAMS BASED ON QUALITY CONTROL AND RESOURCE ADAPTATION

AUTHOR(S)
Chandrika, J.; Kumar, K. R. Ananda
PUB. DATE
November 2012
SOURCE
International Journal of Data Mining & Knowledge Management Proc;Nov2012, Vol. 2 Issue 6, p1
SOURCE TYPE
Academic Journal
DOC. TYPE
Article
ABSTRACT
The increasing importance of data stream arising in a wide range of advanced applications has led to the extensive study of mining frequent patterns. Mining data streams poses many new challenges amongst which are the one-scan nature, the unbounded memory requirement and the high arrival rate of data streams. Further the usage of memory resources should be taken care of regardless of the amount of data generated in the stream. In this work we extend the ideas of existing proposals to ensure efficient resource utilization and quality control. The proposed algorithm RAQ-FIG (Resource Adaptive Quality Assuring Frequent Item Generation) accounts for the computational resources like memory available and dynamically adapts the rate of processing based on the available memory. It will compute the recent approximate frequent itemsets by using a single pass algorithm. The empirical results demonstrate the efficacy of the proposed approach for finding recent frequent itemsets from a data stream.
ACCESSION #
84322556

 

Related Articles

  • Frequent Contiguous Pattern Mining Algorithms for Biological Data Sequences. Rajasekaran, S.; Arockiam, L. // International Journal of Computer Applications;Jun2014, Vol. 95, p15 

    Transaction sequences in market-basket analysis have large set of alphabets with small length, whereas bio-sequences have small set of alphabets of long length with gap. There is the difference in pattern finding algorithms of these two sequences. The chances of repeatedly occurring small...

  • K-means Sequence Clustering Algorithm based on Top-k Maximal Frequent Sequence Patterns. Di Wu; Jiadong Ren // International Journal of Advancements in Computing Technology;Nov2012, Vol. 4 Issue 20, p405 

    As one of the most important problems in data mining, sequence clustering has become an active researching area in recent years. In this paper, K-means sequence clustering algorithm TMFK-means based on top-k maximal frequent sequence patterns is proposed. According to a novel algorithm...

  • The arules R-Package Ecosystem: Analyzing Interesting Patterns from Large Transaction Data Sets. Hahsler, Michael; Chelluboina, Sudheer; Hornik, Kurt; Buchta, Christian // Journal of Machine Learning Research;Jun2011, Vol. 12 Issue 6, p2021 

    This paper describes the ecosystem of R add-on packages developed around the infrastructure provided by the package arules. The packages provide comprehensive functionality for analyzing interesting patterns including frequent itemsets, association rules, frequent sequences and for building...

  • Discovering High-Order Periodic Patterns. Jiong Yang; Wei Wang; Yu, Philips S. // Knowledge & Information Systems;May2004, Vol. 6 Issue 3, p243 

    Discovery of periodic patterns in time series data has become an active research area with many applications. These patterns can be hierarchical in nature, where a higher-level pat-tern may consist of repetitions of lower-level patterns. Unfortunately, the presence of noise may prevent these...

  • Accuracy of generalized context patterns in the context based sequential patterns mining. Ziembiński, Radosław Z. // Control & Cybernetics;2011, Vol. 40 Issue 3, p585 

    A context pattern is a frequent subsequence mined from the context database containing set of sequences. This kind of sequential patterns and all elements inside them are described by additional sets of context attributes e.g. continuous ones. The contexts describe circumstances of transactions...

  • A Web Log Frequent Sequential Pattern Mining Algorithm Linked WAP-Tree. Zhengyu Zhu; Meiyu Zheng; Yihan Wu // Journal of Software (1796217X);Oct2015, Vol. 10 Issue 10, p1228 

    Web log frequent sequence pattern mining is an important field of Web log mining and of discovering interactive frequent sequence pattern between users and websites. It is easy to analyse users' access sequence patterns by utilizing these sequence patterns and it is meaningful to build an...

  • A technique for extracting behavioral sequence patterns from GPS recorded data. Vu, Thi; Lee, Yang; Bui, The // Computing;Mar2014, Vol. 96 Issue 3, p163 

    The mobile wireless market has been attracting many customers. Technically, the paradigm of anytime-anywhere connectivity raises previously unthinkable challenges, including the management of million of mobile customers, their profiles, the profiles-based selective information dissemination, and...

  • Choosing the Best Bayesian Classifier: An Empirical Study. Moran, Stuart; Hey, Yulan; Kecheng Liu // IAENG International Journal of Computer Science;Sep2009, Vol. 36 Issue 4, p322 

    It is often difficult for data miners to know which classifier will perform most effectively in any given dataset. Usually an understanding of learning algorithms is combined with detailed domain knowledge of the dataset at hand to lead to the choice of a classifier. We propose an empirical...

  • DATA MINING.  // Radiology Today;May2013, Vol. 14 Issue 5, p12 

    The article focuses on data mining that is a potentially paradigm-shifting technology which serves as a useful and efficient tool for improving patient care and streamlines business processes. Data mining depends upon important infrastructure and spadework before anything of value can be...

Share

Read the Article

Courtesy of THE LIBRARY OF VIRGINIA

Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics