Accessibility and partner number of protein residues, their relationship and a webserver, ContPlot for their display

Pal, Arumay; Bahadur, Ranjit Prasad; Ray, Partha Sarathi; Chakrabarti, Pinak
January 2009
BMC Bioinformatics;2009, Vol. 10, Special section p1
Academic Journal
Background: Depending on chemical features residues have preferred locations -- interior or exterior -- in protein structures, which also determine how many other residues are found around them. The close packing of residues is the hallmark of protein interior and protein-protein interaction sites. Results: The average values of accessible surface area (ASA) and partner number (PN, the number of other residues within a distance of 4.5 Ã… from any atom of a given residue) of different residues have been determined and a webserver, ContPlot has been designed to display these values (relative to the average values) along the protein sequence. This would be useful to visually identify residues that are densely packed, or those involved in protein-protein interactions. The skewness observed in the distribution of PNs is indicative of the hydrophobic or hydrophilic nature of the residue. The variation of ASA with PN can be analytically expressed in terms of a cubic equation. These equations (one for each residue) can be used to estimate the ASA of a polypeptide chain using the PNs of the individual residues in the structure. Conclusion: The atom-based PNs (obtained by counting surrounding atoms) are highly correlated to the residue-based PN, indicating that the latter can adequately capture the atomic details of packing. The average values of ASA and PN associated with each residue should be useful in protein structure prediction or fold-recognition algorithm. ContPlot would provide a handy tool to assess the importance of a residue in the protein structure or interaction site.


Related Articles

  • Computational structural and functional characterization of protein family: Key for the hidden mystery. Umang, Malhotra; Astha, Jaiswal; Aastha, Chhabra; Neha, Atale; Vibha, Rani // Journal of Pharmacy Research;2012, Vol. 5 Issue 7, p3643 

    With the accretion of raw biological data in the form of unannotated protein sequences, biologists are seeking solutions to unravel the functions of proteins which have not yet been characterized experimentally. The numerous bioinformatics tools available can be used to gain an insight into...

  • Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space. Yaniv Loewenstein; Elon Portugaly; Menachem Fromer; Michal Linial // Bioinformatics;Jul2008, Vol. 24 Issue 13, pi41 

    Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets....

  • Information on the secondary structure improves the quality of protein sequence alignment. Litvinov, I. I.; Lobanov, M. Yu.; Mironov, A. A.; Finkelshtein, A. V.; Roytberg, M. A. // Molecular Biology;May2006, Vol. 40 Issue 3, p474 

    The most popular algorithms employed in the pairwise alignment of protein primary structures (Smith-Watermann (SW) algorithm, FASTA, BLAST, etc.) only analyze the amino acid sequence. The SW algorithm is the most accurate, yielding alignments that agree best with superimpositions of the...

  • Protein coalitions in a core mammalian biochemical network linked by rapidly evolving proteins. Ainali, Chrysanthi; Simon, Michelle; Freilich, Shiri; Espinosa, Octavio; Hazelwood, Lee; Tsoka, Sophia; Ouzounis, Christos A.; Hancock, John M. // BMC Evolutionary Biology;2011, Vol. 11 Issue 1, p142 

    Background: Cellular ATP levels are generated by glucose-stimulated mitochondrial metabolism and determine metabolic responses, such as glucose-stimulated insulin secretion (GSIS) from the β-cells of pancreatic islets. We describe an analysis of the evolutionary processes affecting the core...

  • Classification of protein sequences by means of irredundant patterns. Comin, Matteo; Verzotto, Davide // BMC Bioinformatics;2010 Supplement 1, Vol. 11, Special section p1 

    Background: The classification of protein sequences using string algorithms provides valuable insights for protein function prediction. Several methods, based on a variety of different patterns, have been previously proposed. Almost all string-based approaches discover patterns that are not...

  • DIALIGN-T: An improved algorithm for segment-based multiple sequence alignment. Subramanian, Amarendran R.; Weyer-Menkhoff, Jan; l Kaufmann, Michae; Morgenstern, Burkhard // BMC Bioinformatics;2005, Vol. 6, p66 

    Background: We present a complete re-implementation of the segment-based approach to multiple protein alignment that contains a number of improvements compared to the previous version 2.2 of DIALIGN. This previous version is superior to Needleman-Wunsch-based multi-alignment programs on locally...

  • Editorial Hot Topic: Protein Folding & Misfolding (Guest Editor: Carlos Henrique I. Ramos)]. Ramos, Carlos Henrique I. // Protein & Peptide Letters;Apr2005, Vol. 12 Issue 3, p211 

    The three-dimensional structure of a protein depends on its amino acid sequence, as do the stability and the folding mechanism. The first milestones in the study of protein folding took place at the beginning of the 20th century: in 1911, Chick and Martin (J. Physiol. 43:1) showed that proteins...

  • A Comprehensive Benchmark Study of Multiple Sequence Alignment Methods: Current Challenges and Future Perspectives. Thompson, Julie D.; Linard, Benjamin; Lecompte, Odile; Poch, Olivier // PLoS ONE;2011, Vol. 6 Issue 3, p1 

    Multiple comparison or alignmentof protein sequences has become a fundamental tool in many different domains in modern molecular biology, from evolutionary studies to prediction of 2D/3D structure, molecular function and intermolecular interactions etc. By placing the sequence in the framework...

  • Length Variations amongst Protein Domain Superfamilies and Consequences on Structure and Function. Sandhya, Sankaran; Rani, Saane Sudha; Pankaj, Barah; Govind, Madabosse Kande; Offmann, Bernard; Srinivasan, Narayanaswamy; Sowdhamini, Ramanathan // PLoS ONE;2009, Vol. 4 Issue 3, p1 

    Background: Related protein domains of a superfamily can be specified by proteins of diverse lengths. The structural and functional implications of indels in a domain scaffold have been examined. Methodology: In this study, domain superfamilies with large length variations (more than 30%...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics