De Novo Assembly, Gene Annotation and Marker Development Using Illumina Paired-End Transcriptome Sequences in Celery (Apium graveolens L.)

Fu, Nan; Wang, Qian; Shen, Huo-Lin
February 2013
PLoS ONE;Feb2013, Vol. 8 Issue 2, p1
Academic Journal
Background: Celery is an increasing popular vegetable species, but limited transcriptome and genomic data hinder the research to it. In addition, a lack of celery molecular markers limits the process of molecular genetic breeding. High-throughput transcriptome sequencing is an efficient method to generate a large transcriptome sequence dataset for gene discovery, molecular marker development and marker-assisted selection breeding. Principal Findings: Celery transcriptomes from four tissues were sequenced using Illumina paired-end sequencing technology. De novo assembling was performed to generate a collection of 42,280 unigenes (average length of 502.6 bp) that represent the first transcriptome of the species. 78.43% and 48.93% of the unigenes had significant similarity with proteins in the National Center for Biotechnology Information (NCBI) non-redundant protein database (Nr) and Swiss-Prot database respectively, and 10,473 (24.77%) unigenes were assigned to Clusters of Orthologous Groups (COG). 21,126 (49.97%) unigenes harboring Interpro domains were annotated, in which 15,409 (36.45%) were assigned to Gene Ontology(GO) categories. Additionally, 7,478 unigenes were mapped onto 228 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Large numbers of simple sequence repeats (SSRs) were indentified, and then the rate of successful amplication and polymorphism were investigated among 31 celery accessions. Conclusions: This study demonstrates the feasibility of generating a large scale of sequence information by Illumina paired-end sequencing and efficient assembling. Our results provide a valuable resource for celery research. The developed molecular markers are the foundation of further genetic linkage analysis and gene localization, and they will be essential to accelerate the process of breeding.


Related Articles

  • Identification and characterization of a novel heat shock transcription factor gene, GmHsfA1, in soybeans ( Glycine max). Baoge Zhu; Chunjiang Ye; Huiying Lü; Xiaojun Chen; Guohua Chai; Jiannan Chen; Chao Wang // Journal of Plant Research;May2006, Vol. 119 Issue 3, p247 

    Plants have a large family of HSFs with different roles in the heat shock response that mediate the expression of HSP regulated genes. The HSF encoding genes are easily identified by their highly conserved modular structure and motifs. In the present study, a putative GmHsfA1 was identified and...

  • Sequence Motifs in MADS Transcription Factors Responsible for Specificity and Diversification of Protein-Protein Interaction. van Dijk, Aalt D. J.; Morabito, Giuseppa; Fiers, Martijn; van Ham, Roeland C. H. J.; Angenent, Gerco C.; Immink, Richard G. H. // PLoS Computational Biology;Nov2010, Vol. 6 Issue 11, p1 

    Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein...

  • Plant Genome DataBase Japan (PGDBj): A Portal Website for the Integration of Plant Genome-Related Databases. Asamizu, Erika; Ichihara, Hisako; Nakaya, Akihiro; Nakamura, Yasukazu; Hirakawa, Hideki; Ishii, Takahiro; Tamura, Takuro; Fukami-Kobayashi, Kaoru; Nakajima, Yukari; Tabata, Satoshi // Plant & Cell Physiology;Jan2014, Vol. 55 Issue 1, pe8 

    The Plant Genome DataBase Japan (PGDBj, http://pgdbj.jp/?ln=en) is a portal website that aims to integrate plant genome-related information from databases (DBs) and the literature. The PGDBj is comprised of three component DBs and a cross-search engine, which provides a seamless search over the...

  • Analysis of Brassica rapa ESTs: gene discovery and expression patterns of AP2/ERF family genes. Jing Zhuang; Ai-Sheng Xiong; Ri-He Peng; Feng Gao; Bo Zhu; Jian Zhang; Xiao-Yan Fu; Xiao-Feng Jin; Jian-Min Chen; Zhen Zhang; Yu-Shan Qiao; Quan-Hong Yao // Molecular Biology Reports;Jun2010, Vol. 37 Issue 5, p2485 

    Chinese cabbage ( Brassica rapa subsp. pekinensis) is among the most important vegetables and is widely cultivated in world. Genes in the AP2/ERF family encode transcriptional regulators that serve a variety of functions in the plants. Expressed sequence tags (ESTs) are created by partially...

  • Members of theS-receptor kinase multigene family inSenecio squalidusL. (Asteraceae), a species with sporophytic self-incompatibility. Tabah, David A.; Mclnnis, Stephanie M.; Hiscock, Simon J. // Sexual Plant Reproduction;Sep2004, Vol. 17 Issue 3, p131 

    While the molecular basis of sporophytic self-incompatibility (SSI) has been investigated extensively in the Brassicaceae, almost nothing is known about the molecular regulation of SSI in other families, such as the Asteraceae. In species ofBrassicaand inArabidopsis lyrata, a stigma-specific...

  • Genome-wide identification of BURP domain-containing genes in rice reveals a gene family with diverse structures and responses to abiotic stresses. Xipeng Ding; Xin Hou; Kabin Xie; Lizhong Xiong // Planta;Jun2009, Vol. 230 Issue 1, p149 

    Increasing evidence suggests that a gene family encoding proteins containing BURP domains have diverse functions in plants, but systematic characterization of this gene family have not been reported. In this study, 17 BURP family genes ( OsBURP01– 17) were identified and analyzed in rice...

  • Transcriptome Analysis in Sheepgrass (Leymus chinensis): A Dominant Perennial Grass of the Eurasian Steppe. Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe // PLoS ONE;Jul2013, Vol. 8 Issue 7, p1 

    Background: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of...

  • How Effective Are DNA Barcodes in the Identification of African Rainforest Trees? Parmentier, Ingrid; Duminil, Jérôme; Kuzmina, Maria; Philippe, Morgane; Thomas, Duncan W.; Kenfack, David; Chuyong, George B.; Cruaud, Corinne; Hardy, Olivier J. // PLoS ONE;Apr2013, Vol. 8 Issue 4, p1 

    Background: DNA barcoding of rain forest trees could potentially help biologists identify species and discover new ones. However, DNA barcodes cannot always distinguish between closely related species, and the size and completeness of barcode databases are key parameters for their successful...

  • Transcriptional Regulations on the Low-Temperature-Induced Floral Transition in an Orchidaceae Species, Dendrobium nobile: An Expressed Sequence Tags Analysis. Shan Liang; Ye, Qing-Sheng; Li, Rui-Hong; Leng, Jia-Yi; Li, Mei-Ru; Wang, Xiao-Jing; Li, Hong-Qing // Comparative & Functional Genomics;2012, p1 

    Vernalization-induced flowering is a cold-relevant adaptation inmany species, but little is known about the genetic basis behind in Orchidaceae species. Here, we reported a collection of 15017 expressed sequence tags (ESTs) from the vernalized axillary buds of an Orchidaceae species, Dendrobium...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics