Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae

Cornman, R. Scott; Togawa, Toru; Dunn, W. Augustine; Ningjia He; Emmons, Aaron C.; Willis, Judith H.
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting speculation as to the functional importance of this large number and diversity. Results: We have identified 156 genes in Anopheles gambiae that code for putative cuticular proteins in this CPR family, over 1% of the total number of predicted genes in this species. Annotation was verified using several criteria including identification of TATA boxes, INRs, and DPEs plus support from proteomic and gene expression analyses. Two previously recognized CPR classes, RR-1 and RR-2, form separate, well-supported clades with the exception of a small set of genes with long branches whose relationships are poorly resolved. Several of these outliers have clear orthologs in other species. Although both clades are under purifying selection, the RR-1 variant of the R&R Consensus is evolving at twice the rate of the RR-2 variant and is structurally more labile. In contrast, the regions flanking the R&R Consensus have diversified in amino-acid composition to a much greater extent in RR-2 genes compared with RR-1 genes. Many genes are found in compact tandem arrays that may include similar or dissimilar genes but always include just one of the two classes. Tandem arrays of RR-2 genes frequently contain subsets of genes coding for highly similar proteins (sequence clusters). Properties of the proteins indicated that each cluster may serve a distinct function in the cuticle. Conclusion: The complete annotation of this large gene family provides insight on the mechanisms of gene family evolution and clues about the need for so many CPR genes. These data also should assist annotation of other Anopheles genes.


Related Articles

  • Identification of the substrate recognition region in the Δ-fatty acid and Δ-sphingolipid desaturase by fusion mutagenesis. Song, Li-Ying; Zhang, Yan; Li, Shu-Fen; Hu, Jun; Yin, Wei-Bo; Chen, Yu-Hong; Hao, Shan-Ting; Wang, Bai-Lin; Wang, Richard; Hu, Zan-Min // Planta;Apr2014, Vol. 239 Issue 4, p753 

    Δ-sphingolipid desaturase and Δ-fatty acid desaturase share high protein sequence identity. Thus, it has been hypothesized that Δ-fatty acid desaturase is derived from Δ-sphingolipid desaturase; however, there is no direct proof. The substrate recognition regions of Δ-fatty...

  • Buffered codons in human transcriptional units. Mahdi, Rami; Rouchka, Eric C. // BMC Bioinformatics;2008 Supplement 7, Vol. 9, Special section p1 

    Background Codon usage is well established for a number of different species. Multiple models have been proposed to show codon bias as a balance between mutation and selection. Most of these models emphasize controlling the speed of protein translation from the mRNA and increasing the accuracy...

  • Cloning of Genes Encoding Auxin-Binding Proteins (ABP19/20) from Peach: Significant Peptide Sequence Similarity with Germin-Like Proteins. Ohmiya, Akemi; Tanaka, Yoshiyuki; Kadowaki, Koh-ichi; Hayashi, Tateki // Plant & Cell Physiology;May1998, Vol. 39 Issue 5, p492 

    An auxin-binding protein (ABP) was previously isolated from shoot apices of peach trees to homogenity on standard SDS-PAGE. Analysis of low-bis SDS-PAGE and direct peptide sequencing of purified peach ABP demonstrated that the ABP was composed of two types of polypeptides (designated ABP19 and...

  • Molecular characterization and expression analysis of a gene encoding an isoamylase-type starch debranching enzyme 3 (ISA3) in grain amaranths. Park, Young-Jun; Nemoto, Kazuhiro; Tomooka, Norihiko; Nishikawa, Tomotaro // Molecular Breeding;Apr2014, Vol. 33 Issue 4, p793 

    A cDNA clone from amaranth perisperm that encodes an isoamylase (ISA)-type starch debranching enzyme 3 was isolated and analyzed for the first time. The cDNA consisted of 2,715 bp with a single open reading frame of 2,346 bp, encoding a protein of 781 amino acid residues. The deduced amino acid...

  • Selection on GGU and CGU Codons in the High Expression Genes in Bacteria. Satapathy, Siddhartha; Powdel, Bhesh; Dutta, Malay; Buragohain, Alak; Ray, Suvendra // Journal of Molecular Evolution;Jan2014, Vol. 78 Issue 1, p13 

    The fourfold degenerate site (FDS) in coding sequences is important for studying the effect of any selection pressure on codon usage bias (CUB) because nucleotide substitution per se is not under any such pressure at the site due to the unaltered amino acid sequence in a protein. We estimated...

  • Molecular Cloning and Characterization of a cDNA Encoding Proline Transporter in Rice. Igarashi, Yumiko; Yoshiba, Yoshu; Takeshita, Tomoko; Nomura, Sayuri; Otomo, Jun; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo // Plant & Cell Physiology;Jun2000, Vol. 41 Issue 6, p750 

    A cDNA encoding a proline (Pro) transporter (ProT) was isolated and characterized from a cDNA library prepared from 14-d-old seedlings of Oryza sativa cv. Akibare. The deduced amino acid sequence of the rice ProT protein (OsProT) had 68.8% homology to the ProT protein 1 from Arabidopsis thaliana...

  • Chaperones Divide Yeast Proteins into Classes of Expression Level and Evolutionary Rate. Bogumil, David; Landan, Giddy; Ilhan, Judith; Dagan, Tal // Genome Biology & Evolution;Sep2012, Vol. 4 Issue 9, p618 

    It has long been known that many proteins require folding via molecular chaperones for their function. Although it has become apparent that folding imposes constraints on protein sequence evolution, the effects exerted by different chaperone classes are so far unknown. We have analyzed data of...

  • Investigation of Genes Encoding Calcineurin B-Like Protein Family in Legumes and Their Expression Analyses in Chickpea (Cicer arietinum L.). Meena, Mukesh Kumar; Ghawana, Sanjay; Sardar, Atish; Dwivedi, Vikas; Khandal, Hitaishi; Roy, Riti; Chattopadhyay, Debasis // PLoS ONE;Apr2015, Vol. 10 Issue 4, p1 

    Calcium ion (Ca2+) is a ubiquitous second messenger that transmits various internal and external signals including stresses and, therefore, is important for plants’ response process. Calcineurin B-like proteins (CBLs) are one of the plant calcium sensors, which sense and convey the...

  • Cloning and Characterization of the Gene Encoding O-Acetylserine Lyase from Streptococcus suis. Osaki, Makoto; Takamatsu, Daisuke; Tsuji, Naotoshi; Sekizaki, Tsutomu // Current Microbiology;Jan2000, Vol. 40 Issue 1, p67 

    We have cloned and sequenced a gene encoding O-acetylserine lyase from Streptococcus suis. The gene encodes a protein of 309 amino acids with a calculated molecular mass of 32,038 Da. The deduced amino acid sequence showed more extensive similarities to the CysK proteins than to the CysM...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics