An Extension of PPLS-DA for Classification and Comparison to Ordinary PLS-DA

Telaar, Anna; Liland, Kristian Hovde; Repsilber, Dirk; Nürnberg, Gerd
February 2013
PLoS ONE;Feb2013, Vol. 8 Issue 2, p1
Academic Journal
Classification studies are widely applied, e.g. in biomedical research to classify objects/patients into predefined groups. The goal is to find a classification function/rule which assigns each object/patient to a unique group with the greatest possible accuracy (classification error). Especially in gene expression experiments often a lot of variables (genes) are measured for only few objects/patients. A suitable approach is the well-known method PLS-DA, which searches for a transformation to a lower dimensional space. Resulting new components are linear combinations of the original variables. An advancement of PLS-DA leads to PPLS-DA, introducing a so called ‘power parameter’, which is maximized towards the correlation between the components and the group-membership. We introduce an extension of PPLS-DA for optimizing this power parameter towards the final aim, namely towards a minimal classification error. We compare this new extension with the original PPLS-DA and also with the ordinary PLS-DA using simulated and experimental datasets. For the investigated data sets with weak linear dependency between features/variables, no improvement is shown for PPLS-DA and for the extensions compared to PLS-DA. A very weak linear dependency, a low proportion of differentially expressed genes for simulated data, does not lead to an improvement of PPLS-DA over PLS-DA, but our extension shows a lower prediction error. On the contrary, for the data set with strong between-feature collinearity and a low proportion of differentially expressed genes and a large total number of genes, the prediction error of PPLS-DA and the extensions is clearly lower than for PLS-DA. Moreover we compare these prediction results with results of support vector machines with linear kernel and linear discriminant analysis.


Related Articles

  • Identification of Gene-Expression Signatures and Protein Markers for Breast Cancer Grading and Staging. Yao, Fang; Zhang, Chi; Du, Wei; Liu, Chao; Xu, Ying // PLoS ONE;9/16/2015, Vol. 10 Issue 9, p1 

    The grade of a cancer is a measure of the cancer's malignancy level, and the stage of a cancer refers to the size and the extent that the cancer has spread. Here we present a computational method for prediction of gene signatures and blood/urine protein markers for breast cancer grades and...

  • Comparison of Genome-Wide Association Methods in Analyses of Admixed Populations with Complex Familial Relationships. Kadri, Naveen K.; Guldbrandtsen, Bernt; Sørensen, Peter; Sahana, Goutam // PLoS ONE;Mar2014, Vol. 9 Issue 3, p1 

    Population structure is known to cause false-positive detection in association studies. We compared the power, precision, and type-I error rates of various association models in analyses of a simulated dataset with structure at the population (admixture from two populations; P) and family (K)...

  • Genetic biomarkers for brain hemisphere differentiation in Parkinson's Disease. Hourani, Mou'ath; Mendes, Alexandre; Berretta, Regina; Moscato, Pablo // AIP Conference Proceedings;11/2/2007, Vol. 952 Issue 1, p207 

    This work presents a study on the genetic profile of the left and right hemispheres of the brain of a mouse model of Parkinson's disease (PD). The goal is to characterize, in a genetic basis, PD as a disease that affects these two brain regions in different ways. Using the same whole-genome...

  • Identifying critical transitions of complex diseases based on a single sample. Liu, Rui; Yu, Xiangtian; Liu, Xiaoping; Xu, Dong; Aihara, Kazuyuki; Chen, Luonan // Bioinformatics;Jun2014, Vol. 30 Issue 11, p1579 

    Motivation: Unlike traditional diagnosis of an existing disease state, detecting the pre-disease state just before the serious deterioration of a disease is a challenging task, because the state of the system may show little apparent change or symptoms before this critical transition during...

  • Sequential interim analyses of survival data in DNA microarray experiments. Leha, Andreas; Beißbarth, Tim; Jung, Klaus // BMC Bioinformatics;2011, Vol. 12 Issue 1, p127 

    Background: Discovery of biomarkers that are correlated with therapy response and thus with survival is an important goal of medical research on severe diseases, e.g. cancer. Frequently, microarray studies are performed to identify genes of which the expression levels in pretherapeutic tissue...

  • Revisiting the technical validation of tumour biomarker assays: how to open a Pandora's box. Marchiò, Caterina; Dowsett, Mitch; Reis-Filho, Jorge S. // BMC Medicine;2011, Vol. 9 Issue 1, p41 

    A tumour biomarker is a characteristic that is objectively measured and evaluated in tumour samples as an indicator of normal biological processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention. The development of a biomarker contemplates distinct phases,...

  • Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumor classification. Shu-Lin Wang; Xue-Ling Li; Jianwen Fang // BMC Bioinformatics;2012, Vol. 13 Issue 1, p178 

    Background: Previous studies on tumor classification based on gene expression profiles suggest that gene selection plays a key role in improving the classification performance. Moreover, finding important tumor-related genes with the highest accuracy is a very important task because these genes...

  • Multi-resolution independent component analysis for high-performance tumor classification and biomarker discovery. Henry Han; Xiao-Li Li // BMC Bioinformatics;2011 Supplement 1, Vol. 12 Issue Suppl 1, p1 

    Background: Although high-throughput microarray based molecular diagnostic technologies show a great promise in cancer diagnosis, it is still far from a clinical application due to its low and instable sensitivities and specificities in cancer molecular pattern recognition. In fact,...

  • Breast cancer classification: linking molecular mechanisms to disease prognosis. Taherian-Fard, Atefeh; Srihari, Sriganesh; Ragan, Mark A. // Briefings in Bioinformatics;May2015, Vol. 16 Issue 3, p461 

    Breast cancer was traditionally perceived as a single disease; however, recent advances in gene expression and genomic profiling have revealed that breast cancer is in fact a collection of diseases exhibiting distinct anatomical features, responses to treatment and survival outcomes....


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics