Multi-objective genetic algorithms based automated clustering for fuzzy association rules mining

Alhajj, Reda; Kaya, Mehmet
December 2008
Journal of Intelligent Information Systems;Dec2008, Vol. 31 Issue 3, p243
Academic Journal
Researchers realized the importance of integrating fuzziness into association rules mining in databases with binary and quantitative attributes. However, most of the earlier algorithms proposed for fuzzy association rules mining either assume that fuzzy sets are given or employ a clustering algorithm, like CURE, to decide on fuzzy sets; for both cases the number of fuzzy sets is pre-specified. In this paper, we propose an automated method to decide on the number of fuzzy sets and for the autonomous mining of both fuzzy sets and fuzzy association rules. We achieve this by developing an automated clustering method based on multi-objective Genetic Algorithms (GA); the aim of the proposed approach is to automatically cluster values of a quantitative attribute in order to obtain large number of large itemsets in less time. We compare the proposed multi-objective GA based approach with two other approaches, namely: 1) CURE-based approach, which is known as one of the most efficient clustering algorithms; 2) Chien et al. clustering approach, which is an automatic interval partition method based on variation of density. Experimental results on 100 K transactions extracted from the adult data of USA census in year 2000 showed that the proposed automated clustering method exhibits good performance over both CURE-based approach and Chien et al.’s work in terms of runtime, number of large itemsets and number of association rules.


Related Articles

  • An Analysis of Particle Swarm Optimization with Data Clustering-Technique for Optimization in Data Mining. Khan, Amreen; Bawane, N. G.; Bodkhe, Sonali // International Journal on Computer Science & Engineering;2010, p2223 

    Data clustering is a popular approach for automatically finding classes, concepts, or groups of patterns. Clustering aims at representing large datasets by a fewer number of prototypes or clusters. It brings simplicity in modeling data and thus plays a central role in the process of knowledge...

  • Multicore Processing for Clustering Algorithms. Rao, Rekhansh; Nagwanshi, Kapil Kumar; Dubey, Sipi // International Journal of Computer Technology & Applications;2012, Vol. 3 Issue 2, p555 

    Data Mining algorithms such as classification and clustering are the future of computation, though multidimensional data-processing is required. People are using multicore processors with GPU's. Most of the programming languages doesn't provide multiprocessing facilities and hence wastage of...

  • Neural Networks in Multi-Relational Data Mining. RaviSankar, M.; PremChand, P. // International Journal of Research & Reviews in Ad hoc Networks;Mar2011, Vol. 1 Issue 1, p19 

    Neural networks are non-parametric, robust, and exhibit good learning and generalization capabilities in data-rich environments. Multi-relational data mining framework is based on the search for interesting patterns in the relational database. Multi-relational data mining algorithms search a...

  • Clustering Objects With Using PCM-Algorithm on the Basis of the Interval Type 2 Fuzzy Set and Genetic Algorithm.  // Bulletin of PNU;2010, Vol. 18 Issue 3, p53 

    For problems of statistic processing and teaching without teacher cluster analysis is often used. Among clustering methods the most popular is c-means method. Using fuzzy-set theory together with c-means method, namely, fuzzy c-means algorithm, gives good results. In this article the problem of...

  • Fuzzy K-mean Clustering Via Random Forest For Intrusiion Detection System. Bharti, Kusum; Jain, Shweta; Shukla, Sanyam // International Journal on Computer Science & Engineering;2010, Vol. 2 Issue 6, p2197 

    Due to continuous growth of the internet technology, there is need to establish security mechanism. So for achieving this objective various NIDS has been propsed. Datamining is one of the most effective techniques used for intrusion detection. This work evaluates the performance of unsupervised...

  • Gene Ontology-based Knowledge Discovery Through Fuzzy Cluster Analysis. Pal, Nikhil; Keller, James M.; Popescu, Mihail; Bezdek, James C.; Mitchell, Joyce A.; Huband, Jacalyn // Neural, Parallel & Scientific Computations;Sep/Dec2005, Vol. 13 Issue 3/4, p337 

    The article discusses the use of fuzzy cluster analysts for gene ontology-based knowledge discovery. Linear combinations of order statistics of gene product data were presented to both crisp and fuzzy clustering algorithms. It also demonstrated how fuzzy partition matrices generated by...

  • Genetic Fuzzy Data Mining With Divide-And-Conquer Strategy. Kannan, M.; Yasodha, P.; Srividhya, V. // International Journal of Computer Science Engineering & Technolo;Feb2011, Vol. 1 Issue 1, p30 

    Data mining is most commonly used in attempts to induce association rules from transaction data. Most previous studies focused on binary-valued transaction data. Transaction data in real-world applications, however, usually consist of quantitative values. This paper, thus, proposes a fuzzy...

  • MULTI-DENSITY DBSCAN USING REPRESENTATIVES: MDBSCAN-UR. Ahmed, Rwand; El-Zaza, Eman; Ashour, Wesam // Computing & Information Systems;Oct2011, Vol. 15 Issue 2, p1 

    DBSCAN is one of the most popular algorithms for cluster analysis. It can discover clusters with arbitrary shape and separate noises. But this algorithm cannot choose its parameter according to distributing of dataset. It simply uses the global uses minimum number of points (MinPts) parameter,...

  • An Efficient Approach for Text Clustering Based on Frequent Itemsets. Krishna, S. Murali; Bhavani, S. Durga // European Journal of Scientific Research;Jun2010, Vol. 42 Issue 3, p385 

    In recent times, the vast amount of textual information available in electronic form is growing at staggering rate. This increasing number of textual data has led to the task of mining useful or interesting frequent itemsets (words/terms) from very large text databases and still it seems to be...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics