Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel

Khatkar, Mehar S.; Nicholas, Frank W.; Collins, Andrew R.; Zenger, Kyall R.; Cavanagh, Julie A. L.; Barris, Wes; Schnabel, Robert D.; Taylor, Jeremy F.; Raadsma, Herman W.
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. Results: We computed three LD estimates, D', Dvol and r², for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r² is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r² than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r²) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r²) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. Conclusion: For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r² > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r² > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r² a minimum sample of 75 is adequate.


Related Articles

  • Two-stage genome-wide association study identifies integrin beta 5 as having potential role in bull fertility. Feugang, Jean M.; Kaya, Abdullah; Page, Grier P.; Lang Chen; Mehta, Tapan; Hirani, Kashif; Nazareth, Lynne; Topper, Einko; Gibbs, Richard; Memili, Erdogan // BMC Genomics;2009, Vol. 10, Special section p1 

    Background: Fertility is one of the most critical factors controlling biological and financial performance of animal production systems and genetic improvement of lines. The objective of this study was to identify molecular defects in the sperm that are responsible for uncompensable fertility in...

  • A linkage map of the Atlantic salmon (Salmo salar) based on EST-derived SNP markers. Moen, Thomas; Hayes, Ben; Baranski, Matthew; Berg, Paul R.; Kjøglum, Sissel; Koop, Ben F.; Davidson, Willie S.; Omholt, Stig W.; Lien, Sigbjørn // BMC Genomics;2008, Vol. 9, Special section p1 

    Background: The Atlantic salmon is a species of commercial and ecological significance. Like other salmonids, the species displays residual tetrasomy and a large difference in recombination rate between sexes. Linkage maps with full genome coverage, containing both type I and type II markers,...

  • Genome-wide detection of copy number variations using high-density SNP genotyping platforms in Holsteins. Li Jiang; Jicai Jiang; Jie Yang; Xuan Liu; Jiying Wang; Haifei Wang; Xiangdong Ding; Jianfeng Liu; Qin Zhang // BMC Genomics;2013, Vol. 14 Issue 1, p1 

    Background: Copy number variations (CNVs) are widespread in the human or animal genome and are a significant source of genetic variation, which has been demonstrated to play an important role in phenotypic diversity. Advances in technology have allowed for identification of a large number of...

  • Selection of haplotype variables from a high-density marker map for genomic prediction. Cuyabano, Beatriz C. D.; Guosheng Su; Lund, Mogens S. // Genetics Selection Evolution;Aug2015, Vol. 47 Issue 1, p1 

    Background: Using haplotype blocks as predictors rather than individual single nucleotide polymorphisms (SNPs) may improve genomic predictions, since haplotypes are in stronger linkage disequilibrium with the quantitative trait loci than are individual SNPs. It has also been hypothesized that an...

  • A robust linkage map of the porcine autosomes based on gene-associated SNPs. Vingborg, Rikke K. K.; Gregersen, Vivi R.; Bujie Zhan; Panitz, Frank; Høj, Anette; Sørensen, Kirsten K.; Madsen, Lone B.; Larsen, Knud; Hornshøj, Henrik; Xuefei Wang; Bendixen, Christian // BMC Genomics;2009, Vol. 10, Special section p1 

    Background: Genetic linkage maps are necessary for mapping of mendelian traits and quantitative trait loci (QTLs). To identify the actual genes, which control these traits, a map based on gene-associated single nucleotide polymorphism (SNP) markers is highly valuable. In this study, the SNPs...

  • Characteristics of linkage disequilibrium in North American Holsteins. Bohmanova, Jarmila; Sargolzaei, Mehdi; Schenkel, Flavio S. // BMC Genomics;2010, Vol. 11, p421 

    Background: Effectiveness of genomic selection and fine mapping is determined by the level of linkage disequilibrium (LD) across the genome. Knowledge of the range of genome-wide LD, defined as a non-random association of alleles at different loci, can provide an insight into the optimal density...

  • Precision-mapping and statistical validation of quantitative trait loci by machine learning. Bedo, Justin; Wenzl, Peter; Kowalczyk, Adam; Kilian, Andrzej // BMC Genetics;2008, Vol. 9, Special section p1 

    Background: We introduce a QTL-mapping algorithm based on Statistical Machine Learning (SML) that is conceptually quite different to existing methods as there is a strong focus on generalisation ability. Our approach combines ridge regression, recursive feature elimination, and estimation of...

  • Genome-Wide Estimates of Coancestry, Inbreeding and Effective Population Size in the Spanish Holstein Population. Rodríguez-Ramilo, Silvia Teresa; Fernández, Jesús; Toro, Miguel Angel; Hernández, Delfino; Villanueva, Beatriz // PLoS ONE;Apr2015, Vol. 10 Issue 4, p1 

    Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a...

  • Association Between Gene Polymorphism of Bovine Growth Hormone and Milk Traits in the Iranian Holstein Bulls. Sadeghi, M.; Shahr-e-Babak, M. Moradi; Rahimi, G.; Javaremi, A. Nejati // Asian Journal of Animal Sciences;2010, Vol. 4 Issue 3, p107 

    No abstract available.


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics