On the identification of potential regulatory variants within genome wide association candidate SNP sets

Chih-yu Chen; I-Shou Chang; Hsiung, Chao A.; Wasserman, Wyeth W.
July 2014
BMC Medical Genomics;2014, Vol. 7 Issue 1, p2
Academic Journal
Background Genome wide association studies (GWAS) are a population-scale approach to the identification of segments of the genome in which genetic variations may contribute to disease risk. Current methods focus on the discovery of single nucleotide polymorphisms (SNPs) associated with disease traits. As there are many SNPs within identified risk loci, and the majority of these are situated within non-coding regions, a key challenge is to identify and prioritize variants affecting regulatory sequences that are likely to contribute to the phenotype assessed. Methods We focused investigation on SNPs within lung and breast cancer GWAS loci that reached genome-wide significance for potential roles in gene regulation with a specific focus on SNPs likely to disrupt transcription factor binding sites. Within risk loci, the regulatory potential of sub-regions was classified using relevant open chromatin and epigenetic high throughput sequencing data sets from the ENCODE project in available cancer and normal cell lines. Furthermore, transcription factor affinity altering variants were predicted by comparison of position weight matrix scores between disease and reference alleles. Lastly, ChIP-seq data of transcription associated factors and topological domains were included as binding evidence and potential gene target inference. Results The sets of SNPs, including both the disease-associated markers and those in high linkage disequilibrium with them, were significantly over-represented in regulatory sequences of cancer and/or normal cells; however, over-representation was generally not restricted to disease-relevant tissue specific regions. The calculated regulatory potential, allelic binding affinity scores and ChIP-seq binding evidence were the three criteria used to prioritize candidates. Fitting all three criteria, we highlighted breast cancer susceptibility SNPs and a borderline lung cancer relevant SNP located in cancer-specific enhancers overlapping multiple distinct transcription associated factor ChIP-seq binding sites. Conclusion Incorporating high throughput sequencing epigenetic and transcription factor data sets from both cancer and normal cells into cancer genetic studies reveals potential functional SNPs and informs subsequent characterization efforts.


Related Articles

  • Dynamic, Large-Scale Profiling of Transcription Factor Activity from Live Cells in 3D Culture. Weiss, Michael S.; Bernabé, Beatriz Peñalver; Bellis, Abigail D.; Broadbelt, Linda J.; Jeruss, Jacqueline S.; Shea, Lonnie D. // PLoS ONE;2010, Vol. 5 Issue 11, p1 

    Background: Extracellular activation of signal transduction pathways and their downstream target transcription factors (TFs) are critical regulators of cellular processes and tissue development. The intracellular signaling network is complex, and techniques that quantify the activities of...

  • Evolutionary Consequences, Constraints and Potential of Polyploidy in Plants. Weiss-Schneeweiss, H.; Emadzade, K.; Jang, T.-S.; Schneeweiss, G.M. // Cytogenetic & Genome Research;Jul2013, Vol. 140 Issue 2-4, p137 

    Polyploidy, the possession of more than 2 complete genomes, is a major force in plant evolution known to affect the genetic and genomic constitution and the phenotype of an organism, which will have consequences for its ecology and geography as well as for lineage diversification and speciation....

  • Effects of DNA mass on multiple displacement whole genome amplification and genotyping performance. Bergen, Andrew W; Ying Qi; Haque, Kashif A; Welch, Robert A; Chanock, Stephen J // BMC Biotechnology;2005, Vol. 5, p24 

    Background: Whole genome amplification (WGA) promises to eliminate practical molecular genetic analysis limitations associated with genomic DNA (gDNA) quantity. We evaluated the performance of multiple displacement amplification (MDA) WGA using gDNA extracted from lymphoblastoid cell lines (N =...

  • A system for studying evolution of life-like virtual organisms. Neyfakh, Alex A.; Baranova, Natalya N.; Mizrokhi, Lev J. // Biology Direct;2006, Vol. 1, p23 

    Background: Fitness landscapes, the dependences of fitness on the genotype, are of critical importance for the evolution of living beings. Unfortunately, fitness landscapes that are relevant to the evolution of complex biological functions are very poorly known. As a result, the existing theory...

  • ENNET: inferring large gene regulatory networks from expression data using gradient boosting. Sławek, Janusz; Arodź, Tomasz // BMC Systems Biology;2013, Vol. 7 Issue 1, p2 

    Background The regulation of gene expression by transcription factors is a key determinant of cellular phenotypes. Deciphering genome-wide networks that capture which transcription factors regulate which genes is one of the major efforts towards understanding and accurate modeling of living...

  • A Functional Module of Yeast Mediator That Governs the Dynamic Range of Heat-Shock Gene Expression. Singh, Harpreet; Erkine, Alexander M.; Kremer, Selena B.; Duttweiler, Harry M.; Davis, Donnie A.; Iqbal, Jabed; Gross, Rachel R.; Gross, David S. // Genetics;Apr2006, Vol. 172 Issue 4, p2169 

    We report the results of a genetic screen designed to identify transcriptional coregulators of yeast heat-shock factor (HSF). This sequence-specific activator is required to stimulate both basal and induced transcription; however, the identity of factors that collaborate with HSF in governing...

  • Benefits of noise. Skipper, Magdalena // Nature Reviews Genetics;Feb2007, Vol. 8 Issue 2, p92 

    The article examines the influence of transcription factors on the gene expression variability among cells. The article reveals that variation in the sequence of TATA box directly affects the frequency and magnitude of gene expression and transcription activation of individual cells. The...

  • Lef1 Haploinsufficient Mice Display a Low Turnover and Low Bone Mass Phenotype in a Gender- and Age-Specific Manner. Noh, Tommy; Gabet, Yankel; Cogan, Jon; Yunfan Shi; Tank, Archana; Sasaki, Tomoyo; Criswell, Braden; Dixon, Alexis; Lee, Christopher; Tam, Joseph; Kohler, Thomas; Segev, Eran; Kockeritz, Lisa; Woodgett, James; Müller, Ralph; Yang Chai; Smith, Elisheva; Bab, Itai; Frenkel, Baruch // PLoS ONE;2009, Vol. 4 Issue 5, p1 

    We investigated the role of Lef1, one of the four transcription factors that transmit Wnt signaling to the genome, in the regulation of bone mass. Microcomputed tomographic analysis of 13- and 17-week-old mice revealed significantly reduced trabecular bone mass in Lef1+/- females compared to...

  • Characterization of Transcription Factors Following Expression Profiling of Medicago truncatula- Botrytis spp. Interactions. Villegas-Fernández, Ángel; Krajinski, Franziska; Schlereth, Armin; Madrid, Eva; Rubiales, Diego // Plant Molecular Biology Reporter;Oct2014, Vol. 32 Issue 5, p1030 

    Medicago truncatula may be used as a model plant to study the pathosystem Vicia faba/ Botrytis spp. This study aims to investigate the profiling of the transcription factors (TFs) of M. truncatula involved in the interaction with Botrytis cinerea and Botrytis fabae. The response of two genotypes...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics