A sequence based synteny map between soybean and Arabidopsis thaliana

Shultz, Jeffry L; Ray, Jeffery D; Lightfoot, David A
January 2007
BMC Genomics;2007, Vol. 8, p1
Academic Journal
Background: Soybean (Glycine max, L. Merr.) is one of the world's most important crops, however, its complete genomic sequence has yet to be determined. Nonetheless, a large body of sequence information exists, particularly in the form of expressed sequence tags (ESTs). Herein, we report the use of the model organism Arabidopsis thaliana (thale cress) for which the entire genomic sequence is available as a framework to align thousands of short soybean sequences. Results: A series of JAVA-based programs were created that processed and compared 341,619 soybean DNA sequences against A. thaliana chromosomal DNA. A. thaliana DNA was probed for short, exact matches (15 bp) to each soybean sequence, and then checked for the number of additional 7 bp matches in the adjacent 400 bp region. The position of these matches was used to order soybean sequences in relation to the A. thaliana genome. Conclusion: Reported associations between soybean sequences and A. thaliana were within a 95% confidence interval of e-30 - e-100. In addition, the clustering of soybean expressed sequence tags (ESTs) based on A. thaliana sequence was accurate enough to identify potential single nucleotide polymorphisms (SNPs) within the soybean sequence clusters. An EST, bacterial artificial chromosome (BAC) end sequence and marker amplicon sequence synteny map of soybean and A. thaliana is presented. In addition, all JAVA programs used to create this map are available upon request and on the WEB.


Related Articles

  • Genome-wide mapping of Arabidopsis thaliana origins of DNA replication and their associated epigenetic marks. Costas, Celina; de la Paz Sanchez, Maria; Stroud, Hume; Yanchun Yu; Oliveros, Juan Carlos; Suhua Feng; Benguria, Alberto; López-Vidriero, Irene; Zhang, Xiaoyu; Solano, Roberto; Jacobsen, Steven E.; Gutierrez, Crisanto // Nature Structural & Molecular Biology;Mar2011, Vol. 18 Issue 3, p395 

    Genome integrity requires faithful chromosome duplication. Origins of replication, the genomic sites at which DNA replication initiates, are scattered throughout the genome. Their mapping at a genomic scale in multicellular organisms has been challenging. In this study we profiled origins in...

  • GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences. Cumbie, Jason S.; Kimbrel, Jeffrey A.; Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H. // PLoS ONE;2011, Vol. 6 Issue 10, p1 

    GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without...

  • 2b-RAD: a simple and flexible method for genome-wide genotyping. Wang, Shi; Meyer, Eli; McKay, John K; Matz, Mikhail V // Nature Methods;Aug2012, Vol. 9 Issue 8, p808 

    We describe 2b-RAD, a streamlined restriction site-associated DNA (RAD) genotyping method based on sequencing the uniform fragments produced by type IIB restriction endonucleases. Well-studied accessions of Arabidopsis thaliana were genotyped to validate the method's accuracy and to demonstrate...

  • Extensive Natural Epigenetic Variation at a De Novo Originated Gene. Silveira, Amanda Bortolini; Trontin, Charlotte; Cortijo, Sandra; Barau, Joan; Bem, Luiz Eduardo Vieira Del; Loudet, Olivier; Colot, Vincent; Vincentz, Michel // PLoS Genetics;Apr2013, Vol. 9 Issue 4, Special section p1 

    Epigenetic variation, such as heritable changes of DNA methylation, can affect gene expression and thus phenotypes, but examples of natural epimutations are few and little is known about their stability and frequency in nature. Here, we report that the gene Qua-Quine Starch (QQS) of Arabidopsis...

  • QTL Analysis Using SNP Markers Developed by Next-Generation Sequencing for Identification of Candidate Genes Controlling 4-Methylthio-3-Butenyl Glucosinolate Contents in Roots of Radish, Raphanus sativus L. Zou, Zhongwei; Ishida, Masahiko; Feng Li; Kakizaki, Tomohiro; Suzuki, Sho; Kitashiba, Hiroyasu; Nishio, Takeshi // PLoS ONE;Jan2013, Vol. 8 Issue 1, Special section p1 

    SNP markers for QTL analysis of 4-MTB-GSL contents in radish roots were developed by determining nucleotide sequences of bulked PCR products using a next-generation sequencer. DNA fragments were amplified from two radish lines by multiplex PCR with six primer pairs, and those amplified by 2,880...

  • Conserved non-coding sequences are associated with rates of mRNA decay in Arabidopsis. Spangler, Jacob B.; Feltus, Frank Alex // Frontiers in Plant Science;May2013, Vol. 4, p1 

    Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of c/s-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding...

  • Patterns of Substitution Rate Variation at Many Nuclear Loci in Two Species Trios in the Brassicaceae Partitioned with ANOVA. Braverman, John; Hamilton, Matthew; Johnson, Brent // Journal of Molecular Evolution;Oct2016, Vol. 83 Issue 3/4, p97 

    There are marked variations among loci and among lineages in rates of nucleotide substitution. The generation time hypothesis (GTH) is a neutral explanation for substitution rate heterogeneity that has genomewide application, predicting that species with shorter generation times accumulate DNA...

  • A plant-transformation-competent BIBAC library from the Arabidopsis thaliana Landsberg ecotype for functional and comparative genomics. Chang, Y.-L.; Henriquez, X.; Preuss, D.; Copenhaver, G.; Zhang, H.-B. // Theoretical & Applied Genetics;Jan2003, Vol. 106 Issue 2, p269 

    The genome of the model plant Arabidopsis thaliana has been sequenced to near completion. To facilitate experimental determination of the function of every gene in the species, we constructed a large-insert library from the Landsberg ecotype using a plant-transformation-competent binary BAC...

  • Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. Campbell, Matthew A; Haas, Brian J; Hamilton, John P; Mount, Stephen M; Buell, C Robin // BMC Genomics;2006, Vol. 7, p327 

    Background: Recently, genomic sequencing efforts were finished for Oryza sativa (cultivated rice) and Arabidopsis thaliana (Arabidopsis). Additionally, these two plant species have extensive cDNA and expressed sequence tag (EST) libraries. We employed the Program to Assemble Spliced Alignments...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics