Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly

Saini, Navinder; Shultz, Jeffry; Lightfoot, David A.
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: Many of the world's most important food crops have either polyploid genomes or homeologous regions derived from segmental shuffling following polyploid formation. The soybean (Glycine max) genome has been shown to be composed of approximately four thousand short interspersed homeologous regions with 1, 2 or 4 copies per haploid genome by RFLP analysis, microsatellite anchors to BACs and by contigs formed from BAC fingerprints. Despite these similar regions,, the genome has been sequenced by whole genome shotgun sequence (WGS). Here the aim was to use BAC end sequences (BES) derived from three minimum tile paths (MTP) to examine the extent and homogeneity of polyploid-like regions within contigs and the extent of correlation between the polyploid-like regions inferred from fingerprinting and the polyploid-like sequences inferred from WGS matches. Results: Results show that when sequence divergence was 1-10%, the copy number of homeologous regions could be identified from sequence variation in WGS reads overlapping BES. Homeolog sequence variants (HSVs) were single nucleotide polymorphisms (SNPs; 89%) and single nucleotide indels (SNIs 10%). Larger indels were rare but present (1%). Simulations that had predicted fingerprints of homeologous regions could be separated when divergence exceeded 2% were shown to be false. We show that a 5-10% sequence divergence is necessary to separate homeologs by fingerprinting. BES compared to WGS traces showed polyploid-like regions with less than 1% sequence divergence exist at 2.3% of the locations assayed. Conclusion: The use of HSVs like SNPs and SNIs to characterize BACs wil improve contig building methods. The implications for bioinformatic and functional annotation of polyploid and paleopolyploid genomes show that a combined approach of BAC fingerprint based physical maps, WGS sequence and HSV-based partitioning of BAC clones from homeologous regions to separate contigs will allow reliable de-convolution and positioning of sequence scaffolds (see BES_scaffolds section of SoyGD). This approach will assist genome annotation for paleopolyploid and true polyploid genomes such as soybean and many important cereal and fruit crops.


Related Articles

  • Multiple origins of allopolyploid Aegilops triuncialis. Vanichanon, A.; Blake, N.K.; Sherman, J.D.; Talbert, L.E. // Theoretical & Applied Genetics;Mar2003, Vol. 106 Issue 5, p804 

    Polyploidization is a key component of plant evolution. The number of independent origins of polyploid species traditionally has been underestimated. The objective of this study was to ascertain the number of origins of a tetraploid Aegilops species. We screened 84 primer sets to identify...

  • Transferability of wheat microsatellites to diploid Triticeae species carrying the A, B and D genomes. Sourdille, P.; Tavaud, M.; Charmet, G.; Bernard, M. // Theoretical & Applied Genetics;Aug2001, Vol. 103 Issue 2/3, p346 

    Hexaploid wheat (Triticum aestivum L em Thell) is derived from a complex hybridization procedure involving three diploid species carrying the A, B and D genomes. In this study, we evaluated the ability of microsatellite sequences from T. aestivum to be revealed on different ancestral diploid...

  • Polyploidization-induced genome variation in triticale. Xue-Feng Ma; Peng Fang; Gustafson, J. Perry // Genome;Oct2004, Vol. 47 Issue 5, p839 

    Polyploidization-induced genome variation in triticale (× Triticosecale Wittmack) was investigated using both AFLP and RFLP analyses. The AFLP analyses were implemented with both EcoRI–MseI (E–M) and PstI–MseI (P–M) primer combinations, which, because of their...

  • Three TERT genes in Nicotiana tabacum. Sýkorová, Eva; Fulnečková, Jana; Mokroš, Petr; Fajkus, Jiří; Fojtová, Miloslava; Peška, Vratislav // Chromosome Research;May2012, Vol. 20 Issue 4, p381 

    Telomerase is essential for proper functioning of telomeres in eukaryotes. We cloned and characterised genes for the protein subunit of telomerase (TERT) in the allotetraploid Nicotiana tabacum (tobacco) and its diploid progenitor species Nicotiana sylvestris and Nicotiana tomentosiformis with...

  • Accessing complex crop genomes with next-generation sequencing. Edwards, David; Batley, Jacqueline; Snowdon, Rod // Theoretical & Applied Genetics;Jan2013, Vol. 126 Issue 1, p1 

    Many important crop species have genomes originating from ancestral or recent polyploidisation events. Multiple homoeologous gene copies, chromosomal rearrangements and amplification of repetitive DNA within large and complex crop genomes can considerably complicate genome analysis and gene...

  • Complete Genome Sequence of the Soybean Symbiont Bradyrhizobium japonicum Strain USDA6T. Kaneko, Takakazu; Maita, Hiroko; Hirakawa, Hideki; Uchiike, Nobukazu; Minamisawa, Kiwamu; Watanabe, Akiko; Sato, Shusei // Genes;Dec2011, Vol. 2 Issue 4, p763 

    The complete nucleotide sequence of the genome of the soybean symbiont Bradyrhizobium japonicum strain USDA6T was determined. The genome of USDA6 T is a single circular chromosome of 9,207,384 bp. The genome size is similar to that of the genome of another soybean symbiont, B. japonicum USDA110...

  • Progress towards a reference genome for sunflower. Kane, N.C.; Gill, N.; King, M.G.; Bowers, J.E.; Berges, H.; Gouzy, J.; Bachlava, E.; Langlade, N.B.; Lai, Z.; Stewart, M.; Burke, J.M.; Vincourt, P.; Knapp, S.J.; Rieseberg, L.H. // Botany;Jul2011, Vol. 89 Issue 7, p429 

    The Compositae is one of the largest and most economically important families of flowering plants and includes a diverse array of food crops, horticultural crops, medicinals, and noxious weeds. Despite its size and economic importance, there is no reference genome sequence for the Compositae,...

  • The origin of polyploid genomes of bluegrasses Poa L. and Gene flow between northern pacific and sub-Antarctic Islands. Rodionov, A.; Nosov, N.; Kim, E.; Machs, E.; Punina, E.; Probatova, N. // Russian Journal of Genetics;Dec2010, Vol. 46 Issue 12, p1407 

    The involvement of present-day diploid bluegrass species in the formation of polyploid genomes was investigated using comparison of sequences of internal transcribed spacers ITS1 and ITS2, and the 5.8S rRNA sequence. It was demonstrated that highly polyploid New Zealand bluegrasses, P. cita (2 n...

  • Isolating promoters of multigene family members from the polyploid sugarcane genome by PCR-based walking in BAC DNA. Damaj, Mona B.; Beremand, Phillip D.; Buenrostro-Nava, Marco T.; Ivy, John; Kumpatla, Siva P.; Jifon, John; Beyene, Getu; Yu, Qingyi; Thomas, Terry L.; Mirkov, T. Erik // Genome;Oct2010, Vol. 53 Issue 10, p840 

    The availability of a wider range of promoters for regulated expression in valuable transgenic crops would benefit functional genomics studies and current biotechnology programs aimed at improved productivity. Polymerase chain reaction (PCR)-based genome walking techniques are commonly used to...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics