The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq

Renaut, Sébastien; Grassa, Christopher J.; Moyers, Brook T.; Kane, Nolan C.; Rieseberg, Loren H.
December 2012
Biology (2079-7737);Dec2012, Vol. 1 Issue 3, p575
Academic Journal
Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq) to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp.) and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs) in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis), with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha) and identified gene ontology categories with elevated values of alpha. The "response to biotic stimulus" category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio) and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi). We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.


Related Articles

  • Natural Variation of the RICE FLOWERING LOCUS T 1 Contributes to Flowering Time Divergence in Rice. Ogiso-Tanaka, Eri; Matsubara, Kazuki; Yamamoto, Shin-ichi; Nonoue, Yasunori; Wu, Jianzhong; Fujisawa, Hiroko; Ishikubo, Harumi; Tanaka, Tsuyoshi; Ando, Tsuyu; Matsumoto, Takashi; Yano, Masahiro // PLoS ONE;Oct2013, Vol. 8 Issue 10, p1 

    In rice (Oryza sativa L.), there is a diversity in flowering time that is strictly genetically regulated. Some indica cultivars show extremely late flowering under long-day conditions, but little is known about the gene(s) involved. Here, we demonstrate that functional defects in the florigen...

  • Radiation hybrid maps of the D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes. Kumar, Ajay; Seetan, Raed; Mergoum, Mohamed; Tiwari, Vijay K.; Iqbal, Muhammad J.; Yi Wang; Al-Azzam, Omar; Šimková, Hana; Ming-Cheng Luo; Dvorak, Jan; Yong Q. Gu; Denton, Anne; Kilian, Andrzej; Lazo, Gerard R.; Kianian, Shahryar F. // BMC Genomics;10/16/2015, Vol. 16, p1 

    Background: The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high resolution genome maps with saturated marker scaffolds to anchor and orient BAC contigs/ sequence scaffolds for whole genome assembly. Radiation hybrid (RH) mapping has proven to be an excellent...

  • Genome Wide Allele Frequency Fingerprints (GWAFFs) of Populations via Genotyping by Sequencing. Byrne, Stephen; Czaban, Adrian; Studer, Bruno; Panitz, Frank; Bendixen, Christian; Asp, Torben // PLoS ONE;Mar2013, Vol. 8 Issue 3, p1 

    Genotyping-by-Sequencing (GBS) is an excellent tool for characterising genetic variation between plant genomes. To date, its use has been reported only for genotyping of single individuals. However, there are many applications where resolving allele frequencies within populations on a...

  • Identification of Pummelo Cultivars by Using a Panel of 25 Selected SNPs and 12 DNA Segments. Wu, Bo; Zhong, Guang-yan; Yue, Jian-qiang; Yang, Run-ting; Li, Chong; Li, Yue-jia; Zhong, Yun; Wang, Xuan; Jiang, Bo; Zeng, Ji-wu; Zhang, Li; Yan, Shu-tang; Bei, Xue-jun; Zhou, Dong-guo // PLoS ONE;Apr2014, Vol. 9 Issue 4, p1 

    Pummelo cultivars are usually difficult to identify morphologically, especially when fruits are unavailable. The problem was addressed in this study with the use of two methods: high resolution melting analysis of SNPs and sequencing of DNA segments. In the first method, a set of 25 SNPs with...

  • Auxin response factor gene family in Brassica rapa: genomic organization, divergence, expression, and evolution. Mun, Jeong-Hwan; Yu, Hee-Ju; Shin, Ja; Oh, Mijin; Hwang, Hyun-Ju; Chung, Hee // Molecular Genetics & Genomics;Oct2012, Vol. 287 Issue 10, p765 

    Completion of the sequencing of the Brassica rapa genome enabled us to undertake a genome-wide identification and functional study of the gene families related to the morphological diversity and agronomic traits of Brassica crops. In this study, we identified the auxin response factor ( ARF)...

  • Molecular cloning and expression analysis of a gene encoding soluble starch synthase I from grain amaranth ( Amaranthus cruentus L.). Park, Young-Jun; Nishikawa, Tomotaro; Tomooka, Norihiko; Nemoto, Kazuhiro // Molecular Breeding;Aug2012, Vol. 30 Issue 2, p1065 

    A full-length cDNA clone encoding a soluble starch synthase I (SSSI) from Amaranthus cruentus L. was isolated and characterized. The cDNA clone is 2,076 bp in length and contains an open reading frame of 1,821 bp that encodes 606 amino acid residues. Comparison of the cDNA and genomic sequences...

  • Consequences of Normalizing Transcriptomic and Genomic Libraries of Plant Genomes Using a Duplex-Specific Nuclease and Tetramethylammonium Chloride. Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard // PLoS ONE;Feb2013, Vol. 8 Issue 2, p1 

    Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a...

  • Transcriptome Assembly and Analysis of Tibetan Hulless Barley (Hordeum vulgare L. var. nudum) Developing Grains, with Emphasis on Quality Properties. Chen, Xin; Long, Hai; Gao, Ping; Deng, Guangbing; Pan, Zhifen; Liang, Junjun; Tang, Yawei; Tashi, Nyima; Yu, Maoqun // PLoS ONE;May2014, Vol. 9 Issue 5, p1 

    Background: Hulless barley is attracting increasing attention due to its unique nutritional value and potential health benefits. However, the molecular biology of the barley grain development and nutrient storage are not well understood. Furthermore, the genetic potential of hulless barley has...

  • Unique and Conserved MicroRNAs in Wheat Chromosome 5D Revealed by Next-Generation Sequencing. Kurtoglu, Kuaybe Yucebilgili; Kantar, Melda; Lucas, Stuart J.; Budak, Hikmet // PLoS ONE;Jul2013, Vol. 8 Issue 7, p1 

    MicroRNAs are a class of short, non-coding, single-stranded RNAs that act as post-transcriptional regulators in gene expression. miRNA analysis of Triticum aestivum chromosome 5D was performed on 454 GS FLX Titanium sequences of flow-sorted chromosome 5D with a total of 3,208,630 good quality...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics