TITLE

Improving prediction accuracy of tumor classification by reusing genes discarded during gene selection

AUTHOR(S)
Yang, Jack Y.; Guo-Zheng Li; Hao-Hua Meng; Qu Yang, Mary; Youping Deng
PUB. DATE
January 2008
SOURCE
BMC Genomics;2008 Supplement 1, Vol. 9, Special section p1
SOURCE TYPE
Academic Journal
DOC. TYPE
Article
ABSTRACT
Background: Since the high dimensionality of gene expression microarray data sets degrades the generalization performance of classifiers, feature selection, which selects relevant features and discards irrelevant and redundant features, has been widely used in the bioinformatics field. Multitask learning is a novel technique to improve prediction accuracy of tumor classification by using information contained in such discarded redundant features, but which features should be discarded or used as input or output remains an open issue. Results: We demonstrate a framework for automatically selecting features to be input, output, and discarded by using a genetic algorithm, and propose two algorithms: GA-MTL (Genetic algorithm based multi-task learning) and e-GA-MTL (an enhanced version of GA-MTL). Experimental results demonstrate that this framework is effective at selecting features for multitask learning, and that GA-MTL and e-GA-MTL perform better than other heuristic methods. Conclusions: Genetic algorithms are a powerful technique to select features for multi-task learning automatically; GA-MTL and e-GA-MTL are shown to to improve generalization performance of classifiers on microarray data sets.
ACCESSION #
35701835

 

Related Articles

  • LEAF: Leave-one-out Forward Selection Method for Informative Gene Discovery in DNA Microarray Data. Fukuta, Kentaro; Okada, Yoshifumi // IAENG International Journal of Computer Science;Jun2011, Vol. 38 Issue 2, p160 

    Preventing, diagnosing, and treating disease is greatly facilitated by the availability of biomarkers. Recent improvements in bioinformatics technology have facilitated large-scale screening of DNA microarrays for candidate biomarkers. Here we discuss a gene selection method, which is called...

  • Decision Tree Classifiers in Bioinformatics. Polaka, Inese // Computer Science (1407-7493);2010, Vol. 44, p119 

    No abstract available.

  • Dimension reduction with redundant gene elimination for tumor classification. Xue-Qiang Zeng; Guo-Zheng Li; Yang, Jack Y.; Yang, Mary Qu; Geng-Feng Wu // BMC Bioinformatics;2008 Supplement 6, Vol. 9, Special section p1 

    Background: Analysis of gene expression data for tumor classification is an important application of bioinformatics methods. But it is hard to analyse gene expression data from DNA microarray experiments by commonly used classifiers, because there are only a few observations but with thousands...

  • Exploiting big biology: Integrating large-scale biological data for function inference. Marcotte, Edward M.; Date, Shailesh V. // Briefings in Bioinformatics;Dec2001, Vol. 2 Issue 4, p51 

    The amount of data produced by molecular biologists is growing at an exponential rate. Some of the fastest growing sets of data are measurements of gene expression, comparable in quantity only to gene sequences and the vast biological literature. Both gene expression data and sequence data offer...

  • NIFTI: An evolutionary approach for finding number of clusters in microarray data. Jonnalagadda, Sudhakar; Srinivasan, Rajagopalan // BMC Bioinformatics;2009, Vol. 10, Special section p1 

    Background: Clustering techniques are routinely used in gene expression data analysis to organize the massive data. Clustering techniques arrange a large number of genes or assays into a few clusters while maximizing the intra-cluster similarity and inter-cluster separation. While clustering of...

  • SC2ATmd: a tool for integration of the figure of merit with cluster analysis for gene expression data. Olex, Amy L.; Fetrow, Jacquelyn S. // Bioinformatics;May2011, Vol. 27 Issue 9, p1330 

    Summary: Standard and Consensus Clustering Analysis Tool for Microarray Data (SC2ATmd) is a MATLAB-implemented application specifically designed for the exploration of microarray gene expression data via clustering. Implementation of two versions of the clustering validation method figure of...

  • Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data. Michal Karzynski; Álvaro Mateos; Javier Herrero; Joaquín Dopazo // Artificial Intelligence Review;Oct2003, Vol. 20 Issue 1/2, p39 

    Class prediction and feature selection is key in the context of diagnostic applications of DNA microarrays. Microarray data is noisy and typically composed of a low number of samples and a large number of genes. Perceptrons can constitute an efficient tool for accurate classification of...

  • CLUSTERING IN CONJUNCTION WITH WRAPPER APPROACH TO SELECT DISCRIMINATORY GENES FOR MICROARRAY DATASET CLASSIFICATION. BALA, Rajni; AGRAWAL, Ramesh Kumar // Computing & Informatics;2012, Vol. 31 Issue 5, p921 

    With the advent of microarray technology, it is possible to measure gene expression levels of thousands of genes simultaneously. This helps us diagnose and classify some particular cancers directly using DNA microarray. Highdimensionality and small sample size of microarray datasets has made the...

  • Human Liver Cancer Classification using Microarray Gene Expression Data. Rajeswari, P.; Reena, G. Sophia // International Journal of Computer Applications;Nov2011, Vol. 34, p25 

    Cancer is one of the dreadful diseases, which causes considerable death rate in humans. Cancer is featured by an irregular, unmanageable growth that may demolish and attack neighboring healthy body tissues or somewhere else in the body. There are dissimilar techniques lives for the naming of...

Share

Read the Article

Courtesy of THE LIBRARY OF VIRGINIA

Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics