Approaches to reduce false positives and false negatives in the analysis of microarray data: applications in type 1 diabetes research

Jian Wu; Lenchik, Nataliya I.; Gerling, Ivan C.
January 2008
BMC Genomics;2008 Supplement 2, Vol. 9, Special section p1
Academic Journal
Background: As studies of molecular biology system attempt to achieve a comprehensive understanding of a particular system, Type 1 errors may be a significant problem. However, few investigators are inclined to accept the increase in Type 2 errors (false positives) that may result when less stringent statistical cut-off values are used. To address this dilemma, we developed an analysis strategy that used a stringent statistical analysis to create a list of differentially expressed genes that served as "bait" to "fish out" other genes with similar patterns of expression. Results: Comparing two strains of mice (NOD and C57Bl/6), we identified 93 genes with statistically significant differences in their patterns of expression. Hierarchical clustering identified an additional 39 genes with similar patterns of expression differences between the two strains. Pathway analysis was then employed: 1) identify the central genes and define biological processes that may be regulated by the genes identified, and 2) identify genes on the lists that could not be connected to each other in pathways (potential false positives). For networks created by both gene lists, the most connected (central) genes were interferon gamma (IFN-?) and tumor necrosis factor alpha (TNF-a). These two cytokines are relevant to the biological differences between the two strains of mice. Furthermore, the network created by the list of 39 genes also suggested other biological differences between the strains. Conclusion: Taken together, these data demonstrate how stringent statistical analysis, combined with hierarchical clustering and pathway analysis may offer deeper insight into the biological processes reflected from a set of expression array data. This approach allows us to 'recapture" false negative genes that otherwise would have been missed by the statistical analysis.


Related Articles

  • Exploiting big biology: Integrating large-scale biological data for function inference. Marcotte, Edward M.; Date, Shailesh V. // Briefings in Bioinformatics;Dec2001, Vol. 2 Issue 4, p51 

    The amount of data produced by molecular biologists is growing at an exponential rate. Some of the fastest growing sets of data are measurements of gene expression, comparable in quantity only to gene sequences and the vast biological literature. Both gene expression data and sequence data offer...

  • Immune effects of dietary anethole on Eimeria acervulina infection. Duk Kyung Kim; Hyun S. Lillehoj; Sung Hyen Lee; Seung Ik Jang; Myeong Seon Park; Wongi Min; Lillehoj, Erik P.; Bravos, David // Poultry Science;Oct2013, Vol. 92 Issue 10, p2625 

    The effects of anethole on in vitro and in vivo parameters of chicken immunity during experimental avian coccidiosis were evaluated. Anethole reduced the viability of invasive Eimeria acervulina sporozoites after 2 or 4 h of treatment in vitro by 45 and 42%, respectively, and stimulated 6.0-fold...

  • PU.1 induces apoptosis in myeloma cells through direct transactivation of TRAIL. Ueno, S.; Tatetsu, H.; Hata, H.; Iino, T.; Niiro, H.; Akashi, K.; Tenen, D. G.; Mitsuya, H.; Okuno, Y. // Oncogene;11/19/2009, Vol. 28 Issue 46, p4116 

    We earlier reported that PU.1 was downregulated in myeloma cell lines and myeloma cells in a subset of myeloma patients, and that conditional PU.1 expression in PU.1-negative myeloma cell lines, U266 and KMS12PE, induced growth arrest and apoptosis. To elucidate the molecular mechanisms of the...

  • Expression diversity and evolutionary dynamics of rice duplicate genes. Won Cheol Yim; Byung-Moo Lee; Cheol Seong Jang // Molecular Genetics & Genomics;May2009, Vol. 281 Issue 5, p483 

    Duplicate genes are believed to be a major source of new gene functions over evolutionary time. In order to evaluate the evolutionary dynamics of rice duplicate genes, formed principally by paleoployploidization prior to the speciation of the Poaceae family, we have employed a public microarray...

  • Quality determination and the repair of poor quality spots in array experiments. Tom, Brian D. M.; Gilks, Walter R.; Brooke-Powell, Elizabeth T.; Ajioka, James W. // BMC Bioinformatics;2005, Vol. 6, p234 

    Background: A common feature of microarray experiments is the occurence of missing gene expression data. These missing values occur for a variety of reasons, in particular, because of the filtering of poor quality spots and the removal of undefined values when a logarithmic transformation is...

  • Lymphotoxin pathway directs thymic Aire expression. Chin, Robert K.; Lo, James C.; Kim, Oliver; Blink, Sarah E.; Christiansen, Peter A.; Peterson, Pärt; Yang Wang; Ware, Carl; Yang-Xin Fu // Nature Immunology;Nov2003, Vol. 4 Issue 11, p1121 

    The autoimmune regulator Aire is a key mediator of central tolerance for peripherally restricted antigens. Its absence in human patients results in autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy. The cellular signals that regulate Aire expression are undefined. We show here that...

  • Effects of acanthoic acid on TNF- alpha gene expression and haptoglobin synthesis. Kang, H-S.; Song, H. K.; Lee, J-J.; Pyun, K-H.; Choi, I. // Mediators of Inflammation;Aug98, Vol. 7 Issue 4, p257 

    TUMOUR necrosis factor- alpha (TNF- alpha ) is a major proinflammatory cytokine inducing the synthesis and release of many inflammatory mediators. It is involved in immune regulation, autoim mune diseases, and inflammation. Our previous study demonstrated that acanthoic acid, (-)-pimara-9(11),...

  • A novel parametric approach to mine gene regulatory relationship from microarray datasets. Wanlin Liu; Dong Li; Qijun Liu; Yunping Zhu; Fuchu He // BMC Bioinformatics;Jan2010 Supplement 11, Vol. 11, p1 

    Background: Microarray has been widely used to measure the gene expression level on the genome scale in the current decade. Many algorithms have been developed to reconstruct gene regulatory networks based on microarray data. Unfortunately, most of these models and algorithms focus on global...

  • Discovering gene expression patterns in time course microarray experiments by ANOVA SCA. María José Nueda; Ana Conesa; Johan A. Westerhuis; Huub C. J. Hoefsloot; Age K. Smilde; Manuel Talón; Alberto Ferrer // Bioinformatics;Jul2007, Vol. 23 Issue 14, p1792 

    Motivation: Designed microarray experiments are used to investigate the effects that controlled experimental factors have on gene expression and learn about the transcriptional responses associated with external variables. In these datasets, signals of interest coexist with varying sources of...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics