Choice of population structure informative principal components for adjustment in a case-control study

Peloso, Gina M.; Lunetta, Kathryn L.
January 2011
BMC Genetics;2011, Vol. 12 Issue 1, p64
Academic Journal
Background: There are many ways to perform adjustment for population structure. It remains unclear what the optimal approach is and whether the optimal approach varies by the type of samples and substructure present. The simplest and most straightforward approach is to adjust for the continuous principal components (PCs) that capture ancestry. Through simulation, we explored the issue of which ancestry informative PCs should be adjusted for in an association model to control for the confounding nature of population structure while maintaining maximum power. A thorough examination of selecting PCs for adjustment in a case-control study across the possible structure scenarios that could occur in a genome-wide association study has not been previously reported. Results: We found that when the SNP and phenotype frequencies do not vary over the sub-populations, all methods of selection provided similar power and appropriate Type I error for association. When the SNP is not structured and the phenotype has large structure, then selection methods that do not select PCs for inclusion as covariates generally provide the most power. When there is a structured SNP and a non-structured phenotype, selection methods that include PCs in the model have greater power. When both the SNP and the phenotype are structured, all methods of selection have similar power. Conclusions: Standard practice is to include a fixed number of PCs in genome-wide association studies. Based on our findings, we conclude that if power is not a concern, then selecting the same set of top PCs for adjustment for all SNPs in logistic regression is a strategy that achieves appropriate Type I error. However, standard practice is not optimal in all scenarios and to optimize power for structured SNPs in the presence of unstructured phenotypes, PCs that are associated with the tested SNP should be included in the logistic model.


Related Articles


    The relationship of phenotypic morphological and behavioural variation was investigated for the ascoglossan (=sacoglossan) opisthobranch Placida dendritica. Morphological attributes were documented for individual slugs collected from three algal host species along the central Oregon coast, USA,...

  • Efficient techniques for genotype-phenotype correlational analysis. Saha, Subrata; Rajasekaran, Sanguthevar; Bi, Jinbo; Pathak, Sudipta // BMC Medical Informatics & Decision Making;2013, Vol. 13 Issue 1, p1 

    Background: Single Nucleotide Polymorphisms (SNPs) are sequence variations found in individuals at some specific points in the genomic sequence. As SNPs are highly conserved throughout evolution and within a population, the map of SNPs serves as an excellent genotypic marker. Conventional SNPs...

  • Candidate Gene Analyses of Skeletal Variation in Malocclusion. da Fontoura, C. S. G.; Miller, S. F.; Wehby, G. L.; Amendt, B. A.; Holton, N. E.; Southard, T. E.; Allareddy, V.; Moreno Uribe, L. M. // Journal of Dental Research;Jul2015, Vol. 94 Issue 7, p913 

    This study evaluated associations between craniofacial candidate genes and skeletal variation in patients with malocclusion. Lateral cephalometric radiographs of 269 untreated adults with skeletal classes I, II, and III malocclusion were digitized with 14 landmarks. Two-dimensional coordinates...

  • Extensions to gene set enrichment. Zhen Jiang // Bioinformatics;Feb2007, Vol. 23 Issue 3, p306 

    Motivation: Gene Set Enrichment Analysis (GSEA) has been developed recently to capture changes in the expression of pre-defined sets of genes. We propose number of extensions to GSEA, including the use of different statistics to describe the association between genes and phenotypes of interest....

  • Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits Zaitlen, Noah; Kraft, Peter; Patterson, Nick; Pasaniuc, Bogdan; Bhatia, Gaurav; Pollack, Samuela; Price, Alkes L. // PLoS Genetics;May2013, Vol. 9 Issue 5, p1 

    Important knowledge about the determinants of complex human phenotypes can be obtained from the estimation of heritability, the fraction of phenotypic variation in a population that is determined by genetic factors. Here, we make use of extensive phenotype data in Iceland, long-range phased...

  • Toward the Human Genotope. Huggins, Peter; Pachter, Lior; Sturmfels, Bernd // Bulletin of Mathematical Biology;Nov2007, Vol. 69 Issue 8, p2723 

    The human genotope is the convex hull of all allele frequency vectors that can be obtained from the genotypes present in the human population. In this paper, we take a few initial steps toward a description of this object, which may be fundamental for future population based genetics studies....

  • PCA and Clustering Reveal Alternate mtDNA Phylogeny of N and M Clades. Alexe, G.; Vijaya Satya, R.; Seiler, M.; Platt, D.; Bhanot, T.; Hui, S.; Tanaka, M.; Levine, A.; Bhanot, G. // Journal of Molecular Evolution;Nov2008, Vol. 67 Issue 5, p465 

    Phylogenetic trees based on mtDNA polymorphisms are often used to infer the history of recent human migrations. However, there is no consensus on which method to use. Most methods make strong assumptions which may bias the choice of polymorphisms and result in computational complexity which...

  • Application of principal component analysis to pharmacogenomic studies in Canada. Visscher, H.; Ross, C. J. D.; DubĂ©, M. -P.; Brown, A. M. K.; Phillips, M. S.; Carleton, B. C.; Hayden, M. R. // Pharmacogenomics Journal;Dec2009, Vol. 9 Issue 6, p362 

    Ethnicity can confound results in pharmacogenomic studies. Allele frequencies of loci that influence drug metabolism can vary substantially between different ethnicities and underlying ancestral genetic differences can lead to spurious findings in pharmacogenomic association studies. We...

  • Effect of population stratification analysis on false-positive rates for common and rare variants. Hua He; Xue Zhang; Lili Ding; Baye, Tesfaye M.; Kurowski, Brad G.; Martin, Lisa J. // BMC Proceedings;2011 Supplement 9, Vol. 5 Issue Suppl 9, p1 

    Principal components analysis (PCA) has been successfully used to correct for population stratification in genomewide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics