A statistical framework for consolidating "sibling" probe sets for Affymetrix GeneChip data

Hua Li; Dongxiao Zhu; Cook, Malcolm
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: Affymetrix GeneChip typically contains multiple probe sets per gene, defined as sibling probe sets in this study. These probe sets may or may not behave similar across treatments. The most appropriate way of consolidating sibling probe sets suitable for analysis is an open problem. We propose the Analysis of Variance (ANOVA) framework to decide which sibling probe sets can be consolidated. Results: The ANOVA model allows us to separate the sibling probe sets into two types: those behave similarly across treatments and those behave differently across treatments. We found that consolidation of sibling probe sets of the former type results in large increase in the number of differentially expressed genes under various statistical criteria. The approach to selecting sibling probe sets suitable for consolidating is implemented in R language and freely available from http:// research.stowers-institute.org/hul/affy/. Conclusion: Our ANOVA analysis of sibling probe sets provides a statistical framework for selecting sibling probe sets for consolidation. Consolidating sibling probe sets by pooling data from each greatly improves the estimates of a gene expression level and results in identification of more biologically relevant genes. Sibling probe sets that do not qualify for consolidation may represent annotation errors or other artifacts, or may correspond to differentially processed transcripts of the same gene that require further analysis.


Related Articles

  • Interpretation of multiple probe sets mapping to the same gene in Affymetrix GeneChips. Stalteri, Maria A; Harrison, Andrew P // BMC Bioinformatics;2007, Vol. 8, p13 

    Background: Affymetrix GeneChip technology enables the parallel observations of tens of thousands of genes. It is important that the probe set annotations are reliable so that biological inferences can be made about genes which undergo differential expression. Probe sets representing the same...

  • Novel definition files for human GeneChips based on GeneAnnot. Ferrari, Francesco; Bortoluzzi, Stefania; Coppe, Alessandro; Sirota, Alexandra; Safran, Marilyn; Shmoish, Michael; Ferrari, Sergio; Lancet, Doron; Danieli, Gian Antonio; Bicciato, Silvio // BMC Bioinformatics;2007 Supplement 2, Vol. 8, p446 

    Background: Improvements in genome sequence annotation revealed discrepancies in the original probeset/gene assignment in Affymetrix microarray and the existence of differences between annotations and effective alignments of probes and transcription products. In the current generation of...

  • Expression Profiling of a Heterogeneous Population of ncRNAs Employing a Mixed DNA/LNA Microarray. Skreka, Konstantinia; Zywicki, Marek; Karbiener, Michael; Hüttenhofer, Alexander; Scheideler, Marcel; Rederstorff, Mathieu // Journal of Nucleic Acids;2012, p1 

    Mammalian transcriptomes mainly consist of non protein coding RNAs. These ncRNAs play various roles in all cells and are involved inmultiple regulation pathways. More recently, ncRNAs have also been described as valuable diagnostic tools. While RNAseq approaches progressively replace...

  • Improved precision and accuracy for microarrays using updated probe set definitions. Sandberg, Rickard; Larsson, Ola // BMC Bioinformatics;2007, Vol. 8, p48 

    Background: Microarrays enable high throughput detection of transcript expression levels. Different investigators have recently introduced updated probe set definitions to more accurately map probes to our current knowledge of genes and transcripts. Results: We demonstrate that updated probe set...

  • Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays. Lu, Jun; Lee, Joseph C; Salit, Marc L; Cam, Margaret C // BMC Bioinformatics;2007, Vol. 8, p108 

    Background: Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous...

  • Transcriptome analysis of grain development in hexaploid wheat. Yongfang Wan; Poole, Rebecca L.; Huttly, Alison K.; Toscano-Underwood, Claudia; Feeney, Kevin; Welham, Sue; Gooding, Mike J.; Mills, Clare; Edwards, Keith J.; Shewry, Peter R.; Mitchell, Rowan A. C. // BMC Genomics;2008, Vol. 9, Special section p1 

    Background: Hexaploid wheat is one of the most important cereal crops for human nutrition. Molecular understanding of the biology of the developing grain will assist the improvement of yield and quality traits for different environments. High quality transcriptomics is a powerful method to...

  • An open-source oligomicroarray standard for human and mouse. Wright, Matthew A.; Church, George M. // Nature Biotechnology;Nov2002, Vol. 20 Issue 11, p1082 

    Discusses an open-source DNA microarray standard for human and mouse transcriptomes. Advantages of open-source standards; Design of the probe standard and probe selection algorithm; Comparison of the human probes selected using the design criteria with those selected by Operon Technologies.

  • DNA Pooling: a tool for large-scale association studies. Sham, Pak; Bader, Joel S.; Craig, Ian; O'Donovan, Michael; Owen, Michael // Nature Reviews Genetics;Nov2002, Vol. 3 Issue 11, p862 

    DNA pooling is a practical way to reduce the cost of large-scale association studies to identify susceptibility loci for common diseases. Pooling allows allele frequencies in groups of individuals to be measured using far fewer PCR reactions and genotyping assays than are used when genotyping...

  • Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data. Hui Yu; Feng Wang; Kang Tu; Lu Xie; Yuan-Yuan Li; Yi-Xue Li // BMC Bioinformatics;2007, Vol. 8, p194 

    Background: The wide use of Affymetrix microarray in broadened fields of biological research has made the probeset annotation an important issue. Standard Affymetrix probeset annotation is at gene level, i.e. a probeset is precisely linked to a gene, and probeset intensity is interpreted as gene...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics