Annotation of Protein Domains Reveals Remarkable Conservation in the Functional Make up of Proteomes Across Superkingdoms

Nasir, Arshan; Naeem, Aisha; Khan, Muhammad Jawad; Lopez-Nicora, Horacio D.; Caetano-Anollés, Gustavo
December 2011
Genes;Dec2011, Vol. 2 Issue 4, p869
Academic Journal
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins encoded in the genome of an organism. The molecular functions of proteins are the direct consequence of their structure and structure can be inferred from sequence using hidden Markov models of structural recognition. Here we analyze the functional annotation of protein domain structures in almost a thousand sequenced genomes, exploring the functional and structural diversity of proteomes. We find there is a remarkable conservation in the distribution of domains with respect to the molecular functions they perform in the three superkingdoms of life. In general, most of the protein repertoire is spent in functions related to metabolic processes but there are significant differences in the usage of domains for regulatory and extra-cellular processes both within and between superkingdoms. Our results support the hypotheses that the proteomes of superkingdom Eukarya evolved via genome expansion mechanisms that were directed towards innovating new domain architectures for regulatory and extra/intracellular process functions needed for example to maintain the integrity of multicellular structure or to interact with environmental biotic and abiotic factors (e.g., cell signaling and adhesion, immune responses, and toxin production). Proteomes of microbial superkingdoms Archaea and Bacteria retained fewer numbers of domains and maintained simple and smaller protein repertoires. Viruses appear to play an important role in the evolution of superkingdoms. We finally identify few genomic outliers that deviate significantly from the conserved functional design. These include Nanoarchaeum equitans, proteobacterial symbionts of insects with extremely reduced genomes, Tenericutes and Guillardia theta. These organisms spend most of their domains on information functions, including translation and transcription, rather than on metabolism and harbor a domain repertoire characteristic of parasitic organisms. In contrast, the functional repertoire of the proteomes of the Planctomycetes-Verrucomicrobia-Chlamydiae superphylum was no different than the rest of bacteria, failing to support claims of them representing a separate superkingdom. In turn, Protista and Bacteria shared similar functional distribution patterns suggesting an ancestral evolutionary link between these groups.


Related Articles

  • Evolution and Quantitative Comparison of Genome-Wide Protein Domain Distributions. Parikesit, Arli A.; Stadler, Peter F.; Prohaska, Sonja J. // Genes;Dec2011, Vol. 2 Issue 4, p912 

    The metabolic and regulatory capabilities of an organism are implicit in its protein content. This is often hard to estimate, however, due to ascertainment biases inherent in the available genome annotations. Its complement of recognizable functional protein domains and their combinations convey...

  • Crystal and Molecular Structure of the Yellow Form of Chloro(2,2' :6',2"-terpyridine)platinum (II)chloride dihydrate, [Pt(terpy)Cl]Cl·2H2O. Sengül, Abdurrahman // Turkish Journal of Chemistry;2004, Vol. 28 Issue 5, p667 

    The synthesis, characterization, and X-ray crystal structure of the yellow dimorph, [Pt(terpy)Cl]Cl · 2H2O, are reported. The yellow acicular crystals are monoclinic: space group P21/n, a = 6.908(3) Å b = 17.06700(11) Å, c = 13. 8390(10) Å, β = 98.607(4)°, andDcalc =2.204 Mg...

  • Geomorphology and tectonics of Himalaya-Tibet region. Nadgir, B. // Journal of the Geological Society of India;Jan2014, Vol. 83 Issue 1, p115 

    The article reflects on Tibet Region's tectonics and geology providing information on the formation of Archaean-Dharwar fold belts, on physiographic changes caused by Mahabharat Fault (MF) and Deep Crustal Fault (CF), and the Main Boundary Fault (MBF).

  • New paleomagnetic data confirm a dual-collision process in the Himalayas. Wenjiao Xiao // National Science Review;2015, Vol. 2 Issue 4, p395 

    The article discusses the Himalayan-Tibetan Orogen, a continent-continent collision belt created by the India-Asia collision and continual plate convergence.

  • Chloroplast Proteomics and the Compartmentation of Plastidial Isoprenoid Biosynthetic Pathways. Joyard, Jacques; Ferro, Myriam; Masselon, Christophe; Seigneurin-Berny, Daphné; Salvi, Daniel; Garin, Jérôme; Rolland, Norbert // Molecular Plant (Oxford University Press / USA);Nov2009, Vol. 2 Issue 6, p1154 

    Recent advances in the proteomic field have allowed high-throughput experiments to be conducted on chloroplast samples. Many proteomic investigations have focused on either whole chloroplast or sub-plastidial fractions. To date, the Plant Protein Database (PPDB, Sun et al., 2009) presents the...

  • Structural Biology, Protein Conformations and Drug Designing. Kishan, K. V. Radha // Current Protein & Peptide Science;Aug2007, Vol. 8 Issue 4, p376 

    Structure based drug designing is now a popular technique used for increasing the speed of drug designing process. This was made possible by the availability of many protein structures which helped in developing tools to understand the structure function relationships, automated docking and...

  • Representing, storing and accessing molecular interaction data: a review of models and tools. Strömbäck, Lena; Jakoniene, Vaida; He Tan; Lambrix, Patrick // Briefings in Bioinformatics;Dec2006, Vol. 7 Issue 4, p331 

    One important aim within systems biology is to integrate disparate pieces of information, leading to discovery of higher-level knowledge about important functionality within living organisms. This makes standards for representation of data and technology for exchange and integration of data...

  • Antifragility and Tinkering in Biology (and in Business) Flexibility Provides an Efficient Epigenetic Way to Manage Risk. Danchin, Antoine; Binder, Philippe M.; Noria, Stanislas // Genes;Dec2011, Vol. 2 Issue 4, p998 

    The notion of antifragility, an attribute of systems that makes them thrive under variable conditions, has recently been proposed by Nassim Taleb in a business context. This idea requires the ability of such systems to 'tinker', i.e., to creatively respond to changes in their environment. A...

  • A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Ballester, Pedro J.; Mitchell, John B. O.; Rost, Burkhard // Bioinformatics;May2010, Vol. 26 Issue 9, p1169 

    Motivation: Accurately predicting the binding affinities of large sets of diverse protein--ligand complexes is an extremely challenging task. The scoring functions that attempt such computational prediction are essential for analysing the outputs of molecular docking, which in turn is an...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics