An entropy test for single-locus genetic association analysis

Ruiz-Marín, Manuel; Matilla-García, Mariano; García Cordoba, José Antonio; Susillo-González, Juan Luis; Romo-Astorga, Alejandro; González-Pérez, Antonio; Ruiz, Agustín; Gayán, Javier
January 2010
BMC Genetics;2010, Vol. 11, p19
Academic Journal
Background: The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests of genetic association, and especially for the identification of rare effects Results: We introduce a new genetic association test based on symbolic dynamics and symbolic entropy. Using a freely available software, we have applied this entropy test, and a conventional test, to simulated and real datasets, to illustrate the method and estimate type I error and power. We have also compared this new entropy test to the Fisher exact test for assessment of association with low-frequency SNPs. The entropy test is generally more powerful than the conventional test, and can be significantly more powerful when the genotypic test is applied to low allele-frequency markers. We have also shown that both the Fisher and Entropy methods are optimal to test for association with lowfrequency SNPs (MAF around 1-5%), and both are conservative for very rare SNPs (MAF<1%) Conclusions: We have developed a new, simple, consistent and powerful test to detect genetic association of biallelic/ SNP markers in case-control data, by using symbolic dynamics and symbolic entropy as a measure of gene dependence. We also provide a standard asymptotic distribution of this test statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. The entropy test is generally as good or even more powerful than the conventional and Fisher tests. Furthermore, the entropy test is more computationally efficient than the Fisher's Exact test, especially for large number of markers. Therefore, this entropy-based test has the advantage of being optimal for most SNPs, regardless of their allele frequency (Minor Allele Frequency (MAF) between 1-50%). This property is quite beneficial, since many researchers tend to discard low allele-frequency SNPs from their analysis. Now they can apply the same statistical test of association to all SNPs in a single analysis., which can be especially helpful to detect rare effects.


Related Articles

  • Contribution of variant alleles of ABCB11 to susceptibility to intrahepatic cholestasis of pregnancy. Dixon, P. H.; van Mil, S. W. C.; Chambers, J.; Strautnieks, S.; Thompson, R. J.; Lammert, F.; Kubitz, R.; Keitel, V.; Glantz, A.; Mattsson, L.-Å.; Marschall, H.-U.; Molokhia, M.; Moore, G. E.; Linton, K. J.; Williamson, C. // Gut;Apr2009, Vol. 58 Issue 4, p537 

    Background: Intrahepatic cholestasis of pregnancy (ICP) has a complex aetiology with a significant genetic component. ABCB11 encodes the bile salt export pump (BSEP); mutations cause a spectrum of cholestatic disease, and are implicated in the aetiology of ICP. Methods: ABCB11 variation in ICP...

  • Demonstrating an Interactive Genetic Drift Exercise. Carter, Ashley J.R. // Journal of College Science Teaching;Mar/Apr2002, Vol. 31 Issue 6, p408 

    Discusses a hands-on demonstration of the phenomenon of genetic drift in populations. Background on genetic drift; Procedure for showing the effect of population size on gene frequencies; Questions about common misconceptions of genetic drift and selection.

  • Effect of misspecification of gene frequency on the two-point LOD score. Pal, Deb K; Durner, Martina; Greenberg, David A // European Journal of Human Genetics;Nov2001, Vol. 9 Issue 11, p855 

    In this study, we used computer simulation of simple and complex models to ask: (1) What is the penalty in evidence for linkage when the assumed gene frequency is far from the true gene frequency? (2) If the assumed model for gene frequency and inheritance are misspecified in the analysis, can...

  • Host movement and the genetic structure of populations of parasitic nematodes. Blouin, Michael S.; Yowell, Charles A. // Genetics;Nov95, Vol. 141 Issue 3, p1007 

    Compares population genetic structures of five species of parasitic nematodes: Ostertagia ostertagi and Haemonchus placei from cattle, H. contortus and Teladorsagia circumcincta from sheep and Mazamastrongylus odocoilei from white-tailed deer. Patterns consistent with gene flow among...

  • A Diffusion Approximation for Selection and Drift in a Subdivided Population. Cherry, Joshua L.; Wakeley, John // Genetics;Jan2003, Vol. 163 Issue 1, p421 

    The population-genetic consequences of population structure are of great interest and have been studied extensively. An area of particular interest is the interaction among population structure, natural selection, and genetic drift. At first glance, different results in this area give very...

  • In Brief.  // Nature Reviews Genetics;Oct2010, Vol. 11 Issue 10, p671 

    The article offers news brief related to genetics including the integration of rare and common alleles in diverse human populations, the cyclic gene expression in Arabidopsis thaliana which determines its competence for root branching, and the exome data which can be used to identify adaptive...

  • Poster presentations: 1. Clinical Genetics/Counselling.  // Journal of Medical Genetics;Sep2002 Supplement, Vol. 39, pS35 

    Discusses the abstract of the research paper entitled 'Awareness of the contribution of genetic factors to aetiology amongst individuals with bipolar disorder,' by Judy Tocher and D. Craufurd and presented during the British Human Genetics Conference at the University of York in England in...

  • Analysis of 6 short tandem repeat loci in Navarre (Northern Spain). Iriondo, Mikel; de la Rua, Concepcion // Human Biology;Feb99, Vol. 71 Issue 1, p43 

    Performs a genetic study on a sample of 146 autochthonous individuals from the province of Navarre in Northern Spain, to test for six short tandem repeat (STR) systems. Allele frequencies obtained from the Navarre population for the six STR systems; Comparison of the allele frequency...

  • Stability of the FMRI CGG repeat in a Basque sample. Arrieta, I.; Gil, A. // Human Biology;Feb99, Vol. 71 Issue 1, p55 

    Examines the FMR1 gene stability in normal individuals of Basque origin from the Biscay province of France. FMR1 CGG repeat distribution and AGG interspersion; FRAXAC1 and DXS548 repeat distribution and haplotype analysis; Allele association between FMR1 CGG and microsatellite markers.


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics