Function prediction from networks of local evolutionary similarity in protein structure

Erdin, Serkan; Venner, Eric; Lisewski, Andreas Martin; Lichtarge, Olivier
April 2013
BMC Bioinformatics;2013, Vol. 14 Issue S3, p1
Academic Journal
Background: Annotating protein function with both high accuracy and sensitivity remains a major challenge in structural genomics. One proven computational strategy has been to group a few key functional amino acids into templates and search for these templates in other protein structures, so as to transfer function when a match is found. To this end, we previously developed Evolutionary Trace Annotation (ETA) and showed that diffusing known annotations over a network of template matches on a structural genomic scale improved predictions of function. In order to further increase sensitivity, we now let each protein contribute multiple templates rather than just one, and also let the template size vary. Results: Retrospective benchmarks in 605 Structural Genomics enzymes showed that multiple templates increased sensitivity by up to 14% when combined with single template predictions even as they maintained the accuracy over 91%. Diffusing function globally on networks of single and multiple template matches marginally increased the area under the ROC curve over 0.97, but in a subset of proteins that could not be annotated by ETA, the network approach recovered annotations for the most confident 20-23 of 91 cases with 100% accuracy. Conclusions: We improve the accuracy and sensitivity of predictions by using multiple templates per protein structure when constructing networks of ETA matches and diffusing annotations.


Related Articles

  • Detection, Characterization and Evolution of Internal Repeats in Chitinases of Known 3-D Structure. Sivaji, Manigandan; Sadasivam, Vinoth; Narayanasamy, Jayabalan; Samuel, Selvaraj; Fan, Chuanzhu // PLoS ONE;Mar2014, Vol. 9 Issue 3, p1 

    Chitinase proteins have evolved and diversified almost in all organisms ranging from prokaryotes to eukaryotes. During evolution, internal repeats may appear in amino acid sequences of proteins which alter the structural and functional features. Here we deciphered the internal repeats from...

  • Strength in numbers. Hughes, Austin L. // Nature;6/20/2002, Vol. 417 Issue 6891, p795 

    Discusses the ways of recording the evolution of life. Use of archeological evidence; Utilization of molecular-biology data; Interpretation of genetic markers.

  • Molecular genetic study of acute intermittent porphyria in Russia: Mutation analysis and functional polymorphism search in porphobilinogen deaminase gene. Surin, V. L.; Luchinina, Yu. A.; Selivanova, D. S.; Pustovoit, Ya. S.; Karpova, I. V.; Pivnik, A. V.; Luk'ianenko, A. V.; Kravchenko, S. K. // Russian Journal of Genetics;Apr2010, Vol. 46 Issue 4, p476 

    Acute intermittent porphyria (AIP) is an autosomal dominant hereditary disease, caused by partial deficiency of porphobilinogen deaminase (PBGD), one of the key enzymes of heme biosynthesis. This study describes molecular genetics of AIP in Russia. Mutation analysis of PBGD gene in 70 unrelated...

  • The Reb1-homologue Ydr026c/Nsi1 is required for efficient RNA polymerase I termination in yeast. Reiter, Alarich; Hamperl, Stephan; Seitz, Hannah; Merkl, Philipp; Perez-Fernandez, Jorge; Williams, Lydia; Gerber, Jochen; Németh, Attila; Léger, Isabelle; Gadal, Olivier; Milkereit, Philipp; Griesenbeck, Joachim; Tschochner, Herbert // EMBO Journal;8/15/2012, Vol. 31 Issue 16, p3480 

    Several DNA cis-elements and trans-acting factors were described to be involved in transcription termination and to release the elongating RNA polymerases from their templates. Different models for the molecular mechanism of transcription termination have been suggested for eukaryotic RNA...

  • Transcriptional regulation of secondary growth in Arabidopsis thaliana. Sookyung Oh; Sunchung Park; Kyung-Hwan Han // Journal of Experimental Botany;Dec2003, Vol. 54 Issue 393, p2709 

    Despite its economic and environmental significance, understanding the molecular biology of secondary growth (i.e. wood formation) in tree species has been lagging behind that of primary growth, primarily due to the inherent difficulties of tree biology. In recent years, Arabidopsis has been...

  • Alternating evolutionary pressure in a genetic algorithm facilitates protein model selection. Offman, Marc N.; Tournier, Alexander L.; Bates, Paul A. // BMC Structural Biology;2008, Vol. 8, Special section p1 

    Background: Automatic protein modelling pipelines are becoming ever more accurate; this has come hand in hand with an increasingly complicated interplay between all components involved. Nevertheless, there are still potential improvements to be made in template selection, refinement and protein...

  • Protein function annotation with Structurally Aligned Local Sites of Activity (SALSAs). Wang, Zhouxi; Yin, Pengcheng; Lee, Joslynn S.; Parasuram, Ramya; Somarowthu, Srinivas; Ondrechen, Mary Jo // BMC Bioinformatics;2013, Vol. 14 Issue S3, p1 

    Background: The prediction of biochemical function from the 3D structure of a protein has proved to be much more difficult than was originally foreseen. A reliable method to test the likelihood of putative annotations and to predict function from structure would add tremendous value to...

  • Discriminative Motif Discovery via Simulated Evolution and Random Under-Sampling. Song, Tao; Gu, Hong // PLoS ONE;Feb2014, Vol. 9 Issue 2, p1 

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons...

  • Random Addition Concatenation Analysis: A Novel Approach to the Exploration of Phylogenomic Signal Reveals Strong Agreement between Core and Shell Genomic Partitions in the Cyanobacteria. Narechania, Apurva; Baker, Richard H.; Sit, Ryan; Kolokotronis, Sergios-Orestis; DeSalle, Rob; Planet, Paul J. // Genome Biology & Evolution;Sep2012, Vol. 4 Issue 9, p30 

    Recent whole-genome approaches to microbial phylogeny have emphasized partitioning genes into functional classes, often focusing on differences between a stable core of genes and a variable shell. To rigorously address the effects of partitioning and combining genes in genome-level analyses, we...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics