A comprehensive analysis of non-sequential alignments between all protein structures

Abyzov, Alexej; Ilyin, Valentin A.
January 2007
BMC Structural Biology;2007, Vol. 7, p78
Academic Journal
Background: The majority of relations between proteins can be represented as a conventional sequential alignment. Nevertheless, unusual non-sequential alignments with different connectivity of the aligned fragments in compared proteins have been reported by many researchers. It is interesting to understand those non-sequential alignments; are they unique, sporadic cases or they occur frequently; do they belong to a few specific folds or spread among many different folds, as a common feature of protein structure. We present here a comprehensive large-scale study of nonsequential alignments between available protein structures in Protein Data Bank. Results: The study has been conducted on a non-redundant set of 8,865 protein structures aligned with the aid of the TOPOFIT method. It has been estimated that between 17.4% and 35.2% of all alignments are non-sequential depending on variations in the parameters. Analysis of the data revealed that non-sequential relations between proteins do occur systematically and in large quantities. Various sizes and numbers of non-sequential fragments have been observed with all possible complexities of fragment rearrangements found for alignments consisting of up to 12 fragments. It has been found that non-sequential alignments are not limited to proteins of any particular fold and are present in more than two hundred of them. Moreover, many of them are found between proteins with different fold assignments. It has been shown that protein structure symmetry does not explain non-sequential alignments. Therefore, compelling evidences have been provided that non-sequential alignments between proteins are systematic and widespread across the protein universe. Conclusion: The phenomenon of the widespread occurrence of non-sequential alignments between proteins might represent a missing rule of protein structure organization. More detailed study of this phenomenon will enhance our understanding of protein stability, folding, and evolution.


Related Articles

  • Protein coalitions in a core mammalian biochemical network linked by rapidly evolving proteins. Ainali, Chrysanthi; Simon, Michelle; Freilich, Shiri; Espinosa, Octavio; Hazelwood, Lee; Tsoka, Sophia; Ouzounis, Christos A.; Hancock, John M. // BMC Evolutionary Biology;2011, Vol. 11 Issue 1, p142 

    Background: Cellular ATP levels are generated by glucose-stimulated mitochondrial metabolism and determine metabolic responses, such as glucose-stimulated insulin secretion (GSIS) from the β-cells of pancreatic islets. We describe an analysis of the evolutionary processes affecting the core...

  • Accessibility and partner number of protein residues, their relationship and a webserver, ContPlot for their display. Pal, Arumay; Bahadur, Ranjit Prasad; Ray, Partha Sarathi; Chakrabarti, Pinak // BMC Bioinformatics;2009, Vol. 10, Special section p1 

    Background: Depending on chemical features residues have preferred locations -- interior or exterior -- in protein structures, which also determine how many other residues are found around them. The close packing of residues is the hallmark of protein interior and protein-protein interaction...

  • WildSpan: mining structured motifs from protein sequences. Chen-Ming Hsu; Chien-Yu Chen; Baw-Jhiune Liu // Algorithms for Molecular Biology;2011, Vol. 6 Issue 1, p6 

    Background: Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually...

  • Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins. Marsico, Annalisa; Henschel, Andreas; Winter, Christof; Tuukkanen, Anne; Vassilev, Boris; Scheubert, Kerstin; Schroeder, Michael // BMC Bioinformatics;2010, Vol. 11, p204 

    Background: A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably...

  • APDB: a novel measure for benchmarking sequence alignment methods without reference alignments. O. O'Sullivan; M. Zehnder; D. Higgins; P. Bucher; A. Grosdidier; C. Notredame // Bioinformatics;Jan2009 Supplement, Vol. 19, p215 

    Motivation: We describe APDB, a novel measure for evaluating the quality of a protein sequence alignment, given two or more PDB structures. This evaluation does not require a reference alignment or a structure superposition. APDB is designed to efficiently and objectively benchmark multiple...

  • BETASCAN: Probable β-amyloids Identified by Pairwise Probabilistic Analysis. Bryan Jr., Allen W.; Menke, Matthew; Cowen, Lenore J.; Lindquist, Susan L.; Berger, Bonnie // PLoS Computational Biology;Mar2009, Vol. 5 Issue 3, p1 

    Amyloids and prion proteins are clinically and biologically important β-structures, whose supersecondary structures are difficult to determine by standard experimental or computational means. In addition, significant conformational heterogeneity is known or suspected to exist in many amyloid...

  • Predicting Binding Sites.  // BioWorld Today;3/21/2011, Vol. 22 Issue 54, Special section p1 

    This article focuses on a study by researchers from the University of Chicago and the Academia Sinica in Taiwan which examined a way to predict from the amino acid sequence which parts of a protein form binding sites with other proteins.

  • Drug delivery coalesces. Dove, Alan // Nature Biotechnology;Mar2000, Vol. 18 Issue 3, p249 

    Reports that scientists in the U.S. have developed a protein delivery system that responds to exogenous treatment with a small molecule. Creation of chimeric proteins that contain the coding sequence for a conditional aggregation domain (CAD) fused to a proinsulin; Accumulation of CAD fusions...

  • The Complex Nature of Protein Interactions. Gwynne, Peter // Genomics & Proteomics;Apr2003, Vol. 3 Issue 3, p37 

    Discusses the use of in silico methods to predict protein-protein reactions. Use of computers and software to predict the interactions on the basis of amino acid sequence information; Advantages of computational methods over other approaches in monitoring protein-protein interactions;...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics