Phylogenomics of plant genomes: a methodology for genome-wide searches for orthologs in plants

Conte, Matthieu G.; Gaillard, Sylvain; Droc, Gaetan; Perin, Christophe
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: Gene ortholog identification is now a major objective for mining the increasing amount of sequence data generated by complete or partial genome sequencing projects. Comparative and functional genomics urgently need a method for ortholog detection to reduce gene function inference and to aid in the identification of conserved or divergent genetic pathways between several species. As gene functions change during evolution, reconstructing the evolutionary history of genes should be a more accurate way to differentiate orthologs from paralogs. Phylogenomics takes into account phylogenetic information from high-throughput genome annotation and is the most straightforward way to infer orthologs. However, procedures for automatic detection of orthologs are still scarce and suffer from several limitations. Results: We developed a procedure for ortholog prediction between Oryza sativa and Arabidopsis thaliana. Firstly, we established an efficient method to cluster A. thaliana and O. sativa full proteomes into gene families. Then, we developed an optimized phylogenomics pipeline for ortholog inference. We validated the full procedure using test sets of orthologs and paralogs to demonstrate that our method outperforms pairwise methods for ortholog predictions. Conclusion: Our procedure achieved a high level of accuracy in predicting ortholog and paralog relationships. Phylogenomic predictions for all validated gene families in both species were easily achieved and we can conclude that our methodology outperforms similarly based methods.


Related Articles

  • Isolation and characterization of conserved non-coding sequences among rice ( Oryza sativa L.) paralogous regions. Xianran Li; Lubin Tan; Liguo Wang; Songnian Hu; Chuanqing Sun // Molecular Genetics & Genomics;Jan2009, Vol. 281 Issue 1, p11 

    Segmental duplication is particularly frequent within plant genomes and the ability of the original single-copy gene to gain a new function for the change of regulatory elements is one of the prominent consequences of duplication. Thus, it is important to study the pattern of conserved...

  • The shrunken genome of Arabidopsis thaliana. Oyama, Ryan K.; Clauss, Maria J.; Formanová, Nataša; Kroymann, Jürgen; Schmid, Karl J.; Vogel, Heiko; Weniger, Kerstin; Windsor, Aaron J.; Mitchell-Olds, Thomas // Plant Systematics & Evolution;Jul2008, Vol. 273 Issue 3/4, p257 

    This paper examines macro and micro-level patterns of genome size evolution in the Brassicaceae. A phylogeny of 25 relatives of Arabidopsis thaliana was reconstructed using four molecular markers under both parsimony and Bayesian methods. Reconstruction of genome size ( C value) evolution as a...

  • Characterization of paralogous protein families in rice. Haining Lin; Shu Ouyang; Egan, Amy; Nobuta, Kan; Haas, Brian J.; Wei Zhu; Xun Gu; Silva, Joana C.; Meyers, Blake C.; Buell, C. Robin // BMC Plant Biology;2008, Vol. 8, Special section p1 

    Background: High gene numbers in plant genomes reflect polyploidy and major gene duplication events. Oryza sativa, cultivated rice, is a diploid monocotyledonous species with a ~390 Mb genome that has undergone segmental duplication of a substantial portion of its genome. This, coupled with...

  • Phylogenetic analyses provide the first insights into the evolution of OVATE family proteins in land plants. Liu, Di; Sun, Wei; Yuan, Yaowu; Zhang, Ning; Hayward, Alice; Liu, Yongliang; Wang, Ying // Annals of Botany;Jun2014, Vol. 113 Issue 7, p1219 

    Background and Aims The OVATE gene encodes a nuclear-localized regulatory protein belonging to a distinct family of plant-specific proteins known as the OVATE family proteins (OFPs). OVATE was first identified as a key regulator of fruit shape in tomato, with nonsense mutants displaying...

  • Isolation and characterisation of an HpSHP gene from Hosta plantaginea. Rao, Guodong; Wang, Ying; Zhang, Dandan; Liu, Di; Li, Fenglan; Lu, Hai // Molecular Biology Reports;Jun2012, Vol. 39 Issue 6, p6887 

    Based on genetic and molecular analyses, the ABC model has been proposed to explain the genetic control of floral development. C-class MADS-box genes play crucial roles in Arabidopsis thaliana development by regulating the organ identities of stamens and gynoecium. The present research reports...

  • Context sequence for transcription factors surrounding start codon in model crops. Jaiswal, Mohenish; Rangan, Latha // Current Science (00113891);7/25/2007, Vol. 93 Issue 2, p215 

    The context of consensus sequences surrounding start codons was determined for two model crops, viz. Arabidopsis thaliana and Oryza sativa that have well-characterized transcription factor databases. Arabidopsis exhibits AT-richness, whereas Oryza exhibits GC-richness upstream and downstream of...

  • Genome Size Variation among Accessions of Arabidopsis thaliana. SCHMUTHS, HEIKE; MEISTER, ARMIN; HORRES, RALF; BACHMANN, KONRAD // Annals of Botany;Number 3 March, 2004, Vol. 93 Issue 3, p317 

    � Background and Aims Estimates of the amount of nuclear DNA of Arabidopsis thaliana, known to be among the lowest within angiosperms, vary considerably. This study aimed to determine genome size of a range of accessions from throughout the entire Eurasian range of the species.�...

  • Analysis of a Triple Testcross Design With Recombinant Inbred Lines Reveals a Significant Role of Epistasis in Heterosis for Biomass-Related Traits in Arabidopsis. Kusterer, Barbara; Muminovic, Jasmina; Utz, H. Friednch; Piepho, Hans-Peter; Barth, Susanne; Heckenberger, Martin; Meyer, Rhonda C.; Altmann, Thomas; Melchinger, Albrecht E. // Genetics;Apr2007, Vol. 175 Issue 4, p2009 

    Primary causes of heterosis are still unknown. Our goal was to investigate the extent and underlying genetic causes of heterosis for five biomass-related traits in Arabidopsis thaliana. We (i) investigated the relative contribution of dominance and epistatic effects to heterosis in the hybrid...

  • A plant cocktail amenable for PCR-based genetic analysis in Arabidopsis thaliana. Kotchoni, Simeon; Gachomo, Emma; Jimenez-Lopez, Jose // Molecular Biology Reports;Nov2011, Vol. 38 Issue 8, p5281 

    Extraction of plant genomic DNA for subsequent genetic studies is often lengthy and difficult. In addition, it requires the use of toxic reagents that remove secondary plant products, which otherwise interfere with the polymerase chain reaction steps. Here we describe a simple cost-efficient...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics