Improving predicted protein loop structure ranking using a Pareto-optimality consensus method

Yaohang Li; Rata, Ionel; See-wing Chiu; Jakobsson, Eric
January 2010
BMC Structural Biology;2010, Vol. 10, p22
Academic Journal
Background: Accurate protein loop structure models are important to understand functions of many proteins. Identifying the native or near-native models by distinguishing them from the misfolded ones is a critical step in protein loop structure prediction. Results: We have developed a Pareto Optimal Consensus (POC) method, which is a consensus model ranking approach to integrate multiple knowledge- or physics-based scoring functions. The procedure of identifying the models of best quality in a model set includes: 1) identifying the models at the Pareto optimal front with respect to a set of scoring functions, and 2) ranking them based on the fuzzy dominance relationship to the rest of the models. We apply the POC method to a large number of decoy sets for loops of 4- to 12-residue in length using a functional space composed of several carefully-selected scoring functions: Rosetta, DOPE, DDFIRE, OPLS-AA, and a triplet backbone dihedral potential developed in our lab. Our computational results show that the sets of Pareto-optimal decoys, which are typically composed of ∼20% or less of the overall decoys in a set, have a good coverage of the best or near-best decoys in more than 99% of the loop targets. Compared to the individual scoring function yielding best selection accuracy in the decoy sets, the POC method yields 23%, 37%, and 64% less false positives in distinguishing the native conformation, indentifying a near-native model (RMSD < 0.5A from the native) as top-ranked, and selecting at least one near-native model in the top-5-ranked models, respectively. Similar effectiveness of the POC method is also found in the decoy sets from membrane protein loops. Furthermore, the POC method outperforms the other popularly-used consensus strategies in model ranking, such as rank-by-number, rank-by-rank, rank-by-vote, and regression-based methods. Conclusions: By integrating multiple knowledge- and physics-based scoring functions based on Pareto optimality and fuzzy dominance, the POC method is effective in distinguishing the best loop models from the other ones within a loop model set.


Related Articles

  • Evolutionary biochemistry: revealing the historical and physical causes of protein properties. Harms, Michael J.; Thornton, Joseph W. // Nature Reviews Genetics;Aug2013, Vol. 14 Issue 8, p559 

    The repertoire of proteins and nucleic acids in the living world is determined by evolution; their properties are determined by the laws of physics and chemistry. Explanations of these two kinds of causality - the purviews of evolutionary biology and biochemistry, respectively - are typically...

  • Buffering the entropic cost of hydrophobic collapse in protein chains. Fernández, Ariel // Journal of Chemical Physics;12/8/2004, Vol. 121 Issue 22, p11501 

    Direct inspection of high-resolution protein structures reveals that backbone dehydration promotes extra conformational freedom in the peptide bond, especially when the residue is not involved in secondary structure. The results imply a buffering effect that lowers the entropic cost of...

  • degradation.  // Hutchinson Dictionary of Scientific Biography;2005, p1 

    Breaking down of compounds into simpler molecules; for example, the action of enzymes brings about the degradation of proteins to amino acids.

  • Prospects for Atomic Resolution and Neutron Crystallography in Drug Design. Coates, L.; Myles, D. A. A. // Current Drug Targets;Feb2004, Vol. 5 Issue 2, p173 

    The number of protein crystal structures being refined to atomic resolution is increasing each year as well as the size of proteins being studied. There are currently 346 structures in the protein data bank which have been refined to or beyond atomic resolution. The benefits of atomic resolution...

  • Three-way decomposition of a complete 3D 15N-NOESY-HSQC. Gutmanas, Aleksandras; Jarvoll, Patrik; Orekhov, Vladislav Yu.; Billeter, Martin // Journal of Biomolecular NMR;Nov2002, Vol. 24 Issue 3, p191 

    Three-way decomposition is applied for the structural analysis of a complete three-dimensional 15N-NOESY-HSQC of the 128 residues long protein azurin. The procedure presented includes decomposition using the software MUNIN, providing an initial characterization of the complete spectrum by 355...

  • Chemical shift correlation via RFDR: Elimination of resonance offset effects. Heise, Bert; Leppert, Jörg; Ohlenschläger, Oliver; Görlach, Matthias; Ramachandran, Ramadurai // Journal of Biomolecular NMR;Nov2002, Vol. 24 Issue 3, p237 

    It is shown that it is possible to effectively execute RFDR experiments with adiabatic inversion pulses and obtain resonance offset compensation that is superior to what can be achieved by conventional rectangular pulses. Employing 40-μs tanh/tan adiabatic pulses at a power level of ∼38...

  • A spin-state-selective experiment for measuring heteronuclear one-bond and homonuclear two-bond couplings from an HSQC-type spectrum. Permi, Perttu // Journal of Biomolecular NMR;Jan2002, Vol. 22 Issue 1, p27 

    Recently, a set of selective 1D experiments with spin-state-selective excitation for CH spin systems was introduced by Parella and Belloc (J. Magn. Reson., 148, 78–87 (2001)). We have expanded and generalized this concept further, and demonstrated that a very simple experiment utilizing...

  • Exact sequence analysis for three-dimensional hydrophobic-polar lattice proteins. Schiemann, Reinhard; Bachmann, Michael; Janke, Wolfhard // Journal of Chemical Physics;3/15/2005, Vol. 122 Issue 11, p114705 

    We have exactly enumerated all sequences and conformations of hydrophobic-polar (HP) proteins with chains of up to 19 monomers on the simple cubic lattice. For two variants of the HP model, where only two types of monomers are distinguished, we determined and statistically analyzed designing...

  • Protein phase diagrams: The physics behind their elliptic shape. Lesch, Harald; Hecht, Christoph; Friedrich, Josef // Journal of Chemical Physics;12/22/2004, Vol. 121 Issue 24, p12671 

    We relate the condition for the elliptic shape of the phase diagram of proteins to the degree of correlation in the fluctuations of the changes of enthalpy and volume at the denaturing-refolding transition. Since this degree cannot be larger than 1, hyperbolically shaped diagrams are not likely...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics