Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery

Abu-Jamous, Basel; Fa, Rui; Roberts, David J.; Nandi, Asoke K.
February 2013
PLoS ONE;Feb2013, Vol. 8 Issue 2, p1
Academic Journal
Clustering analysis has a growing role in the study of co-expressed genes for gene discovery. Conventional binary and fuzzy clustering do not embrace the biological reality that some genes may be irrelevant for a problem and not be assigned to a cluster, while other genes may participate in several biological functions and should simultaneously belong to multiple clusters. Also, these algorithms cannot generate tight clusters that focus on their cores or wide clusters that overlap and contain all possibly relevant genes. In this paper, a new clustering paradigm is proposed. In this paradigm, all three eventualities of a gene being exclusively assigned to a single cluster, being assigned to multiple clusters, and being not assigned to any cluster are possible. These possibilities are realised through the primary novelty of the introduction of tunable binarization techniques. Results from multiple clustering experiments are aggregated to generate one fuzzy consensus partition matrix (CoPaM), which is then binarized to obtain the final binary partitions. This is referred to as Binarization of Consensus Partition Matrices (Bi-CoPaM). The method has been tested with a set of synthetic datasets and a set of five real yeast cell-cycle datasets. The results demonstrate its validity in generating relevant tight, wide, and complementary clusters that can meet requirements of different gene discovery studies.


Related Articles

  • Biclustering of time-lagged gene expression data using real number. Liu, F.; Wang, L. B. // Journal of Biomedical Science & Engineering;Feb2010, Vol. 3 Issue 2, p217 

    Analysis of gene expression data can help to find the time-lagged co-regulation of gene cluster. However, existing method just solve the problem under the condition when the data is discrete number. In this paper, we propose efficient algorithm to indentify time-lagged co-regulated gene cluster...

  • Fuzzy Types Clustering for Microarray Data. Seo Young Kim; Tai Myong Choi; Jong Sung Bae // International Journal of Computational Intelligence;2006, Vol. 2 Issue 1, p12 

    The main goal of microarray experiments is to quantify the expression of every object on a slide as precisely as possible, with a further goal of clustering the objects. Recently, many studies have discussed clustering issues involving similar patterns of gene expression. This paper presents an...

  • PathVar: analysis of gene and protein expression variance in cellular pathways using microarray data. Glaab, Enrico; Schneider, Reinhard // Bioinformatics;Feb2012, Vol. 28 Issue 3, p446 

    Summary: Finding significant differences between the expression levels of genes or proteins across diverse biological conditions is one of the primary goals in the analysis of functional genomics data. However, existing methods for identifying differentially expressed genes or sets of genes by...

  • Bioinformatic analysis of microarray data reveals several key genes related to heart failure. Z.-G. ZHANG; H. CAO; G. LIU; H.-M. FAN; Z.-M. LIU // European Review for Medical & Pharmacological Sciences;Sep2013, Vol. 17 Issue 18, p2441 

    OBJECTIVES: Heart failure is a major public health problem worldwide. However, the molecular mechanism is still unclear. This study aims to discover differentially expressed genes (DEGs) between non-ischemic or ischemic heart failure samples and healthy control, which may be used for diagnosis...

  • Inferring Regulatory Networks from Expression Data Using Tree-Based Methods. Huynh-Thu, Vân Anh; Irrthum, Alexandre; Wehenkel, Louis; Geurts, Pierre // PLoS ONE;2010, Vol. 5 Issue 9, p1 

    One of the pressing open problems of computational systems biology is the elucidation of the topology of genetic regulatory networks (GRNs) using high throughput genomic data, in particular microarray gene expression data. The Dialogue for Reverse Engineering Assessments and Methods (DREAM)...

  • From Knockouts to Networks: Establishing Direct Cause- Effect Relationships through Graph Analysis. Pinna, Andrea; Soranzo, Nicola; de la Fuente, Alberto // PLoS ONE;2010, Vol. 5 Issue 10, p1 

    Background: Reverse-engineering gene networks from expression profiles is a difficult problem for which a multitude of techniques have been developed over the last decade. The yearly organized DREAM challenges allow for a fair evaluation and unbiased comparison of these methods. Results: We...

  • A model of molecular interactions on short oligonucleotide microarrays. Zhang, Li; Miles, Michael F; Aldape, Kenneth D // Nature Biotechnology;Jul2003, Vol. 21 Issue 7, p818 

    High-density short oligonucleotide microarrays have become a widely used tool for measuring gene expression on a large scale[SUP1,2]. However, details of the mechanism of binding on microarrays remain unclear[SUP3]. Short oligonucleotide probes currently synthesized on microarrays are often...

  • Signal Oscillation Is Another Reason for Variability in Microarray-Based Gene Expression Quantification. Singh, Raghvendra // PLoS ONE;Jan2013, Vol. 8 Issue 1, Special section p1 

    Microarrays have been widely used for various biological applications, such as, gene expression profiling, determination of SNPs, and disease profiling. However, quantification and analysis of microarray data have been a challenge. Previously, by taking into account translational and rotational...

  • Differential co-expression framework to quantify goodness of biclusters and compare biclustering algorithms. Burton Kuan Hui Chia; Karuturi, R. Krishna Murthy // Algorithms for Molecular Biology;2010, Vol. 5, p23 

    Background: Biclustering is an important analysis procedure to understand the biological mechanisms from microarray gene expression data. Several algorithms have been proposed to identify biclusters, but very little effort was made to compare the performance of different algorithms on real...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics