Rough Set Approach to Multivariate Decision Trees Inducing

Dianhong Wang; Xingwen Liu; Liangxiao Jiang; Xiaoting Zhang; Yongguang Zhao
April 2012
Journal of Computers;Apr2012, Vol. 7 Issue 4, p870
Academic Journal
Aimed at the problem of huge computation, large tree size and over-fitting of the testing data for multivariate decision tree (MDT) algorithms, we proposed a novel roughset- based multivariate decision trees (RSMDT) method. In this paper, the positive region degree of condition attributes with respect to decision attributes in rough set theory is used for selecting attributes in multivariate tests. And a new concept of extended generalization of one equivalence relation corresponding to another one is introduced and used for construction of multivariate tests. We experimentally test RSMDT algorithm in terms of classification accuracy, tree size and computing time, using the whole 36 UCI Machine Learning Repository data sets selected by Weka platform, and compare it with C4.5, classification and regression trees (CART), classification and regression trees with linear combinations (CART-LC), Oblique Classifier 1 (OC1), Quick Unbiased Efficient Statistical Trees (QUEST). The experimental results indicate that RSMDT algorithm significantly outperforms the comparison classification algorithms with improved classification accuracy, relatively small tree size, and shorter computing time.


Related Articles

  • A Decision Tree Algorithm based on Rough Set Theory after Dimensionality Reduction. Shrivastava, Shailendra K.; Tantuway, Manisha // International Journal of Computer Applications;Mar2011, Vol. 17, p29 

    Decision tree technology has proven to be a valuable way of capturing human decision making within a computer. As ID3 select those attribute as splitting attributes which have different values whether it classify dataset properly or not. There is another drawback of ID3 which repeat sub tree...

  • An Improved Decision Tree Algorithm Based on the Attribute Set Dependency. Yihong Cao; Yuwan Gu; Huanhuan Cai; Yuqiang Sun // Information Technology Journal;2013, Vol. 12 Issue 22, p6641 

    The decision tree algorithm is the more popular areas of research in data mining and ID3 algorithm is the core algorithm of decision tree algorithm, through research and analysis of the ID3 algorithm, for its shortcoming of multi-value bias interrelated, difficult to remove noise and attribute...

  • AN INNOVATIVE ALGORITHM FOR FEATURE SELECTON BASED ON ROUGH SET WITH FUZZY C-MEANS CLUSTERING. SRIDEVI, T.; SHYAMALA, K.; MURUGAN, A. // Journal of Theoretical & Applied Information Technology;10/31/2014, Vol. 68 Issue 3, p514 

    Feature selection is a fundamental problem in data mining, especially for high level dimensional datasets. Feature selection is a process commonly used in machine learning, wherein subsets of the features from the original set of features are selected for application of a learning algorithm. The...

  • The research of decision tree learning algorithm in technology of data mining classification. Guang-xian Ji // Journal of Convergence Information Technology;Jun2012, Vol. 7 Issue 10, p216 

    Decision tree (decision tree), also known as decision tree is a kind of information theory-based, decision tree data structure based on this classification algorithm. Categorized the decision tree in the field of data mining has been studied for many years, and had a lot of algorithms, such as...

  • Univariate Decision Tree Induction using Maximum Margin Classification. Yıldız, Olcay Taner // Computer Journal;Mar2012, Vol. 55 Issue 3, p293 

    In many pattern recognition applications, first decision trees are used due to their simplicity and easily interpretable nature. In this paper, we propose a new decision tree learning algorithm called univariate margin tree where, for each continuous attribute, the best split is found using...

  • Classifying Cinnamomums using rough sets classifier based on interval-discretization. Ching-Hsue Cheng; Yao-Hsien Chen; Jing-Wei Liu // Plant Systematics & Evolution;Jun2009, Vol. 280 Issue 1/2, p89 

    Classification, which is the task of assigning objects to one of several predefined categories, is a pervasive problem that encompasses many diverse applications. Decision tree classifier, which is a simple yet widely used classification technique, employs training data to yield decision rules;...

  • A novel hybrid feature selection method based on rough set and improved harmony search. Inbarani, H.; Bagyamathi, M.; Azar, Ahmad // Neural Computing & Applications;Nov2015, Vol. 26 Issue 8, p1859 

    Feature selection is a process of selecting optimal features that produce the most prognostic outcome. It is one of the essential steps in knowledge discovery. The crisis is that not all features are important. Most of the features may be redundant, and the rest may be irrelevant and noisy. This...

  • Selección de atributos relevantes aplicando algoritmos que combinan conjuntos aproximados y optimización en colonias de hormigas. Rodríguez, Yanela; Fernández, Yumilka; Bello, Rafael; Caballero, Yailé // Revista Cubana de Ciencias Informáticas;ene-mar2014, Vol. 8 Issue 1, p140 

    Feature selection can be viewed as one of the most fundamental problems in the field of machine learning. An analysis on the methods of feature selection is done in this investigation; stressing those that use techniques of Ant Colony Optimization and the Rough Set Theory. Also, in this...

  • Question Classification based on Rough Set Attributes and Value Reduction. Li Peng; Zhang Kai-Hui // Information Technology Journal;May2011, Vol. 10 Issue 5, p1061 

    This study presents a method on automatic question classification through attribute and value reduction based on rough set theory. The core of the method is adopting statistical machine learning, with the assistance of a fair number of training corpus, attempts to automatically obtain useful and...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics