Feature extraction and signal processing for nylon DNA microarrays

Lopez, F.; Rougemont, J.; Loriod, B.; Bourgeois, A.; Loï, L.; Bertucci, F.; Hingamp, P.; Houlgatte, R.; Granjeaud, S.
January 2004
BMC Genomics;2004, Vol. 5, p38
Academic Journal
Background: High-density DNA microarrays require automatic feature extraction methodologies and softwares. These can be a potential source of non-reproducibility of gene expression measurements. Variation in feature location or in signal integration methodology may be a significant contribution to the observed variance in gene expression levels. Results: We explore sources of variability in feature extraction from DNA microarrays on Nylon membrane with radioactive detection. We introduce a mathematical model of the signal emission and derive methods for correcting biases such as overshining, saturation or variation in probe amount. We also provide a quality metric which can be used qualitatively to flag weak or untrusted signals or quantitatively to modulate the weight of each experiment or gene in higher level analyses (clustering or discriminant analysis). Conclusions: Our novel feature extraction methodology, based on a mathematical model of the radioactive emission, reduces variability due to saturation, neighbourhood effects and variable probe amount. Furthermore, we provide a fully automatic feature extraction software, BZScan, which implements the algorithms described in this paper.


Related Articles

  • Novel implementation of conditional co-regulation by graph theory to derive co-expressed genes from microarray data. Rawat, Arun; Seifert, Georg J.; Youping Deng // BMC Bioinformatics;2008 Supplement 9, Vol. 9, Special section p1 

    Background: Most existing transcriptional databases like Comprehensive Systems-Biology Database (CSB.DB) and Arabidopsis Microarray Database and Analysis Toolbox (GENEVESTIGATOR) help to seek a shared biological role (similar pathways and biosynthetic cycles) based on correlation. These utilize...

  • ChromSorter PC: A database of chromosomal regions associated with human prostate cancer. Etim, Ann; Guohui Zhou; Xinyu Wen; Hang Liu; Ruotti, Victor; Twigger, Simon; Weihong Jin; Matysiak, Brian; Mathis, Jedidiah; Tonellato, Peter J.; Datta, Milton W. // BMC Genomics;2004, Vol. 5, p27 

    Background: Our increasing use of genetic and genomic strategies to understand human prostate cancer means that we need access to simplified and integrated information present in the associated biomedical literature. In particular, microarray gene expression studies and associated genetic...

  • Microarray and EST database estimates of mRNA expression levels differ: The protein length versus expression curve for C. elegans. Munoz, Enrique T.; Bogarad, Leonard D.; Deem, Michael W. // BMC Genomics;2004, Vol. 5, p1 

    Background: Various methods for estimating protein expression levels are known. The level of correlation between these methods is only fair, and systematic biases in each of the methods cannot be ruled out. We here investigate systematic biases in the estimation of gene expression rates from...

  • Cross-platform comparison of microarray data using order restricted inference. Klinglmueller, Florian; Tuechler, Thomas; Posch, Martin // Bioinformatics;Apr2011, Vol. 27 Issue 7, p953 

    Motivation: Titration experiments measuring the gene expression from two different tissues, along with total RNA mixtures of the pure samples, are frequently used for quality evaluation of microarray technologies. Such a design implies that the true mRNA expression of each gene, is either...

  • A comparative analysis of data generated using two different target preparation methods for hybridization to high-density oligonucleotide microarrays. Gold, David; Coombes, Kevin; Medhane, Dina; Ramaswamy, Anitha; Zhenlin Ju; Strong, Louise; Ja Seok Koo; Kapoor, Mini // BMC Genomics;2004, Vol. 5, p2 

    Background: To generate specific transcript profiles, one must isolate homogenous cell populations using techniques that often yield small amounts of RNA, requiring researchers to employ RNA amplification methods. The data generated by using these methods must be extensively evaluated to...

  • Wavelet-based feature extraction for DNA microarray classification. Sarhan, Ahmad // Artificial Intelligence Review;Mar2013, Vol. 39 Issue 3, p237 

    Complementary DNA (cDNA) microarray-based tumor gene expression profiles have been successfully used for cancer diagnosis. The main difficulty in processing cDNA microarrays is the ultra-high dimensionality of the microarrays. In this paper, we approach the dimensionality reduction using a novel...

  • A Hybrid Both Filter and Wrapper Feature Selection Method for Microarray Classification. Li-Yeh Chuang; Chao-Hsuan Ke; Cheng-Hong Yang // International MultiConference of Engineers & Computer Scientists;2008, p146 

    Gene expression data is widely used in disease analysis and cancer diagnosis. However, since gene expression data could contain thousands of genes simultaneously, successful microarray classification is rather difficult. Feature selection is an important pre-treatment for any classification...

  • Dimension reduction with redundant gene elimination for tumor classification. Xue-Qiang Zeng; Guo-Zheng Li; Yang, Jack Y.; Yang, Mary Qu; Geng-Feng Wu // BMC Bioinformatics;2008 Supplement 6, Vol. 9, Special section p1 

    Background: Analysis of gene expression data for tumor classification is an important application of bioinformatics methods. But it is hard to analyse gene expression data from DNA microarray experiments by commonly used classifiers, because there are only a few observations but with thousands...

  • ANALYSING OF DNA MICROARRAY DATA USING PRINCIPLE COMPONENT ANALYSIS (PCA). SABBAR, BAYAN M.; SULYMAN, MENA R. // Journal of Theoretical & Applied Information Technology;12/10/2014, Vol. 70 Issue 1, p130 

    In DNA analysis, Microarray technology is a new tool that can automate the diagnostic task and improve the accuracy of the traditional diagnostic techniques. With microarrays, it is possible to examine the expression of thousands of genes in the same time. In this paper is used the Principle...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics