Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

Marsico, Annalisa; Henschel, Andreas; Winter, Christof; Tuukkanen, Anne; Vassilev, Boris; Scheubert, Kerstin; Schroeder, Michael
January 2010
BMC Bioinformatics;2010, Vol. 11, p204
Academic Journal
Background: A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results: We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to a-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94%) appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1) a dimer interface motif found in voltage-gated chloride channels, (2) a proton transfer motif found in heme-copper oxidases, and (3) a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions: Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.


Related Articles

  • Prediction of the translocon-mediated membrane insertion free energies of protein sequences. Yungki Park; Volkhard Helms // Bioinformatics;May2008, Vol. 24 Issue 10, p1271 

    Motivation: Helical membrane proteins (HMPs) play crucial roles in a variety of cellular processes. Unlike water-soluble proteins, HMPs need not only to fold but also get inserted into the membrane to be fully functional. This process of membrane insertion is mediated by the translocon complex....

  • Identification and Expression of Nine Oak Aquaporin Genes in the Primary Root Axis of Two Oak Species, Quercus petraea and Quercus robur. Rasheed-Depardieu, Claire; Parent, Claire; Crèvecoeur, Michèle; Parelle, Julien; Tatin-Froux, Fabienne; Provost, Grégoire Le; Capelli, Nicolas // PLoS ONE;Dec2012, Vol. 7 Issue 12, p1 

    Aquaporins (AQPs) belong to the Major Intrinsic Protein family that conducts water and other small solutes across biological membranes. This study aimed to identify and characterize AQP genes in the primary root axis of two oak species, Quercus petraea and Quercus robur. Nine putative AQP genes...

  • Eukaryote-wide sequence analysis of mitochondrial &beta-barrel outer membrane proteins. Imai, Kenichiro; Fujita, Naoya; Gromiha, M. Michael; Horton, Paul // BMC Genomics;2011, Vol. 12 Issue 1, p79 

    Background: The outer membranes of mitochondria are thought to be homologous to the outer membranes of Gram negative bacteria, which contain 100's of distinct families of β-barrel membrane proteins (BOMPs) often forming channels for transport of nutrients or drugs. However, only four families...

  • Expression and Molecular Characterization of Spherical Particles Derived from the Genome of the Hyperthermophilic Euryarchaeote Pyrococcus furiosus. Namba, Kazunori; Hagiwara, Kyoji; Tanaka, Hideaki; Nakaishi, Yuichiro; Khoon Tee Chong; Yamashita, Eiki; Armah, George Enyimah; Ono, Yasuko; Ishino, Yoshizumi; Omura, Toshihiro; Tsukihara, Tomitake; Nakagawa, Atsushi // Journal of Biochemistry;Aug2005, Vol. 138 Issue 2, p193 

    Spherical particles (SPs) of approximately 30 nm in diameter were found in the hyperthermophilic archaeon Pyrococcus furiosus. The SPs contained no nucleic acid and were composed of a single 39-kDa protein. The amino acid sequences of the amino-terminal and internal fragments were identical to...

  • Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information. Suresh, M. Xavier; Gromiha, M. Michael; Suwa, Makiko // Advances in Bioinformatics;1/31/2015, Vol. 2015, p1 

    Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane...

  • Rampant Exchange of the Structure and Function of Extramembrane Domains between Membrane and Water Soluble Proteins. Nam, Hyun-Jun; Han, Seong Kyu; Bowie, James U.; Kim, Sanguk // PLoS Computational Biology;Mar2013, Vol. 9 Issue 3, p1 

    Of the membrane proteins of known structure, we found that a remarkable 67% of the water soluble domains are structurally similar to water soluble proteins of known structure. Moreover, 41% of known water soluble protein structures share a domain with an already known membrane protein structure....

  • Discrimination of outer membrane proteins with improved performance. Changhui Yan; Jing Hu; Yingfeng Wang // BMC Bioinformatics;2008, Vol. 9, Special section p1 

    Background: Outer membrane proteins (OMPs) perform diverse functional roles in Gram-negative bacteria. Identification of outer membrane proteins is an important task. Results: This paper presents a method for distinguishing outer membrane proteins (OMPs) from non-OMPs (that is, globular proteins...

  • Simultaneous Horizontal Gene Transfer of a Gene Coding for Ribosomal Protein L27 and Operational Genes in Arthrobacter Sp. Garcia-Vallvé, Santiago; Simó, Francesc X.; Montero, Miguel A.; Arola, Lluís; Romeu, Antoni // Journal of Molecular Evolution;Dec2002, Vol. 55 Issue 6, p632 

    Phylogenetic analysis of bacterial L27 ribosomal proteins showed that, against taxonomy, the L27 protein from the Actinobacteria Arthrobacter sp. clusters with protein sequences from the Bacillus group. The L27 gene clusters in the Arthrobacter sp. genome with six genes responsible for...

  • A protein sequence that can encode native structure by disfavoring alternate conformations. Wigley, W. Christian; Corboy, Michael J.; Cutler, Todd D.; Thibodeau, Patrick H.; Oldan, Jorge; Lee, Min Goo; Rizo, Josep; Hunt, John F.; Thomas, Philip J. // Nature Structural Biology;May2002, Vol. 9 Issue 5, p381 

    The linear sequence of amino acids contains all the necessary information for a protein to fold into its unique three-dimensional structure. Native protein sequences are known to accomplish this by promoting the formation of stable, kinetically accessible structures. Here we describe a Pro...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics