Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations

de Crécy-Lagard, Valérie; El Yacoubi, Basma; de la Garza, Rocio Díaz; Noiriel, Alexandre; Hanson, Andrew D
January 2007
BMC Genomics;2007, Vol. 8, p245
Academic Journal
Background: Folate synthesis and salvage pathways are relatively well known from classical biochemistry and genetics but they have not been subjected to comparative genomic analysis. The availability of genome sequences from hundreds of diverse bacteria, and from Arabidopsis thaliana, enabled such an analysis using the SEED database and its tools. This study reports the results of the analysis and integrates them with new and existing experimental data. Results: Based on sequence similarity and the clustering, fusion, and phylogenetic distribution of genes, several functional predictions emerged from this analysis. For bacteria, these included the existence of novel GTP cyclohydrolase I and folylpolyglutamate synthase gene families, and of a trifunctional p-aminobenzoate synthesis gene. For plants and bacteria, the predictions comprised the identities of a 'missing' folate synthesis gene (folQ) and of a folate transporter, and the absence from plants of a folate salvage enzyme. Genetic and biochemical tests bore out these predictions. Conclusion: For bacteria, these results demonstrate that much can be learnt from comparative genomics, even for well-explored primary metabolic pathways. For plants, the findings particularly illustrate the potential for rapid functional assignment of unknown genes that have prokaryotic homologs, by analyzing which genes are associated with the latter. More generally, our data indicate how combined genomic analysis of both plants and prokaryotes can be more powerful than isolated examination of either group alone.


Related Articles

  • TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation. Bernard, Virginie; Brunaud, Véronique; Lecharny, Alain // BMC Genomics;2010, Vol. 11, p166 

    Background: The TATA-box and TATA-variants are regulatory elements involved in the formation of a transcription initiation complex. Both have been conserved throughout evolution in a restricted region close to the Transcription Start Site (TSS). However, less than half of the genes in model...

  • Intragenomic Matching Reveals a Huge Potential for miRNA-Mediated Regulation in Plants. Lindow, Morten; Jacobsen, Anders; Nygaard, Sanne; Yuan Mang; Krogh, Anders // PLoS Computational Biology;Nov2007, Vol. 3 Issue 11, pe238 

    microRNAs (miRNAs) are important post-transcriptional regulators, but the extent of this regulation is uncertain, both with regard to the number of miRNA genes and their targets. Using an algorithm based on intragenomic matching of potential miRNAs and their targets coupled with support vector...

  • Genome-Wide Association Mapping in Arabidopsis Identifies Previously Known Flowering Time and Pathogen Resistance Genes. Aranzana, María José; Kim, Sung; Zhao, Keyan; Bakker, Erica; Horton, Matthew; Jakob, Katrin; Lister, Clare; Molitor, John; Shindo, Chikako; Tang, Chunlao; Toomajian, Christopher; Traw, Brian; Zheng, Honggang; Bergelson, Joy; Dean, Caroline; Marjoram, Paul; Nordborg, Magnus; Doebley, John // PLoS Genetics;Nov2005, Vol. 1 Issue 5, p531 

    There is currently tremendous interest in the possibility of using genome-wide association mapping to identify genes responsible for natural variation, particularly for human disease susceptibility. The model plant Arabidopsis thaliana is in many ways an ideal candidate for such studies,...

  • Selection for the compactness of highly expressed genes in Gallus gallus. Rao, You S.; Wang, Zhang F.; Chai, Xue W.; Wu, Guo Z.; Ming Zhou; Nie, Qing H.; Zhang, Xi Q. // Biology Direct;2010, Vol. 5, p35 

    Background: Coding sequence (CDS) length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between...

  • Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction. Qian Liu; Aaron J. Mackey; David S. Roos; Fernando C. N. Pereira // Bioinformatics;Mar2008, Vol. 24 Issue 5, p597 

    Motivation: The increasing diversity and variable quality of evidence relevant to gene annotation argues for a probabilistic framework that automatically integrates such evidence to yield candidate gene models. Results: Evigan is an automated gene annotation program for eukaryotic genomes,...

  • Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation. Sherstnev, Alexander; Duc, Céline; Cole, Christian; Zacharaki, Vasiliki; Hornyik, Csaba; Ozsolak, Fatih; Milos, Patrice M; Barton, Geoffrey J; Simpson, Gordon G // Nature Structural & Molecular Biology;Jul2012, Vol. 19 Issue 8, p845 

    It has recently been shown that RNA 3′-end formation plays a more widespread role in controlling gene expression than previously thought. To examine the impact of regulated 3′-end formation genome-wide, we applied direct RNA sequencing to A. thaliana. Here we show the authentic...

  • Genome-wide mapping with biallelic markers in Arabidopsis thaliana. Cho, Raymond J.; Mindrinos, Michael; Richards, Daniel R.; Sapolsky, Ronald J.; Anderson, Mary; Drenkard, Eliana; Dewdney, Julia; Reuber, T. Lynne; Stammers, Melanie; Federspiel, Nancy; Theologis, Athanasios; Yang, Wei-Hsien; Hubbell, Earl; Au, Melinda; Chung, Edward Y.; Lashkari, Deval; Lemieux, Bertrand; Dean, Caroline; Lipshutz, Robert J. // Nature Genetics;Oct99, Vol. 23 Issue 2, p203 

    Single-nucleotide polymorphisms, as well as small insertions and deletions (here referred to collectively as simple nucleotide polymorphisms, or SNPs), comprise the largest set of sequence variants in most organisms. Positional cloning based on SNPs may accelerate the identification of human...

  • Comparison of a Brassica oleracea Genetic Map With the Genome of Arabidopsis thaliana. Lukens, Lewis; Fei Zou; Lydiate, Derek; Parkin, Isobel; Osborn, Tom // Genetics;May2003, Vol. 164 Issue 1, p359 

    Compares Brassica oleracea genetic map with the genome of Arabidopsis thaliana. Use of explicit criteria to distinguish orthologous from paralogous loci; Development of a conservative algorithm to identify collinear loci between genomes and a permutation test; Identification of several...

  • Quantifying the Variation in the Effective Population Size Within a Genome. Gossmann, Toni I.; Woolfit, Megan; Eyre-Walker, Adam // Genetics;Dec2011, Vol. 189 Issue 4, p1389 

    The effective population size (Ne) is one of the most fundamental parameters in population genetics. It is thought to vary across the genome as a consequence of differences in the rate of recombination and the density of selected sites due to the processes of genetic hitchhiking and background...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics