Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology

Cheung, Foo; Haas, Brian J; Goldberg, Susanne MD; May, Gregory D; Xiao, Yongli; Town, Christopher D
January 2006
BMC Genomics;2006, Vol. 7, p1
Academic Journal
Background: In this study, we addressed whether a single 454 Life Science GS20 sequencing run provides new gene discovery from a normalized cDNA library, and whether the short reads produced via this technology are of value in gene structure annotation. Results: A single 454 GS20 sequencing run on adapter-ligated cDNA, from a normalized cDNA library, generated 292,465 reads that were reduced to 252,384 reads with an average read length of 92 nucleotides after cleaning. After clustering and assembly, a total of 184,599 unique sequences were generated containing over 400 SSRs. The 454 sequences generated hits to more genes than a comparable amount of sequence from MtGI. Although short, the 454 reads are of sufficient length to map to a unique genome location as effectively as longer ESTs produced by conventional sequencing. Functional interpretation of the sequences was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of GO categories. 53,796 assemblies and singletons (29%) had no match in the existing MtGI. Within the previously unobserved Medicago transcripts, thousands had matches in a comprehensive protein database and one or more of the TIGR Plant Gene Indices. Approximately 20% of these novel sequences could be found in the Medicago genome sequence. A total of 70,026 reads generated by the 454 technology were mapped to 785 Medicago finished BACs using PASA and over 1,000 gene models required modification. In parallel to 454 sequencing, 4,445 5'-prime reads were generated by conventional sequencing using the same library and from the assembled sequences it was shown to contain about 52% full length cDNAs encoding proteins from 50 to over 500 amino acids in length. Conclusion: Due to the large number of reads afforded by the 454 DNA sequencing technology, it is effective in revealing the expression of transcripts from a broad range of GO categories and contains many rare transcripts in normalized cDNA libraries, although only a limited portion of their sequence is uncovered. As with longer ESTs, 454 reads can be mapped uniquely onto genomic sequence to provide support for, and modifications of, gene predictions.


Related Articles

  • Cloning and bioinformatic analysis of an acidophilic β-mannanase gene, Anman5A, from Aspergillus niger LW-1. Zhao, S.; Wu, M.; Tang, C.; Gao, S.; Zhang, H.; Li, J. // Applied Biochemistry & Microbiology;Sep2012, Vol. 48 Issue 5, p473 

    Using 3′ and 5′ rapid amplification of cDNA ends (RACE) techniques, the full-length cDNA sequence of the Anman5A, a gene that encodes an acidophilic β-mannanase of Aspergillus niger LW-1 (abbreviated to AnMan5A), was identified from the total RNA. The cDNA sequence was 1417 bp in...

  • Expression, purification and characterization of an analgesic peptide from Buthus martensii Karsch in Pichia pastoris. Jin-Ling Yang; Ping Zhu; Gui-Fang Cheng; Ke-Di Cheng; Hui-Xia He; Hui-Xin Zhu // World Journal of Microbiology & Biotechnology;Nov2009, Vol. 25 Issue 11, p2053 

    Abstract  BmK AngM1 is an analgesic peptide from the venom of Buthus martensii Karsch (BmK). The synthetic gene encoding BmK AngM1 was optimized on the basis of its cDNA sequence and the codon usage preference of Pichia pastoris. The codon-optimized gene was cloned into pPIC9K and then...

  • A platform independent RNA-Seq protocol for the detection of transcriptome complexity. Calabrese, Claudia; Mangiulli, Marina; Manzari, Caterina; Paluscio, Anna Maria; Caratozzolo, Mariano Francesco; Marzano, Flaviana; Kurelac, Ivana; D'Erchia, Anna Maria; D'Elia, Domenica; Licciulli, Flavio; Liuni, Sabino; Picardi, Ernesto; Attimonelli, Marcella; Gasparre, Giuseppe; Porcelli, Anna Maria; Pesole, Graziano; Sbisà, Elisabetta; Tullo, Apollonia // BMC Genomics;2013, Vol. 14 Issue 1, p1 

    Background Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result...

  • Molecular characterization and expression analysis of Hsp90 in Schizothorax prenanti. Pu, Yundan; Zhu, Jieyao; Wang, Hong; Zhang, Xin; Hao, Jin; Wu, Yuanbin; Geng, Yi; Wang, Kaiyu; Li, Zhiqiong; Zhou, Jian; Chen, Defang // Cell Stress & Chaperones;Nov2016, Vol. 21 Issue 6, p983 

    Aquatic animals suffer from various environmental stresses because the aquatic environment is a very complex system. To monitor the health status of fish, Hsp90 a potential early warning marker was determined in Schizothorax prenanti after infection with a bacterium. In this study, we cloned...

  • Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project. Fernández, Paula; Paniego, Norma; Lew, Sergio; Hopp, H. Esteban; Heinz, Ruth A. // BMC Genomics;2003, Vol. 4, p40 

    Background: Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and...

  • The Linguistics of DNA. Searls, David B. // American Scientist;Nov/Dec92, Vol. 80 Issue 6, p579 

    Discusses the construction of a grammar of genes in the quest to understand the language of life. Information on a generative grammar; Details of biological palindromes and genetic codes; Importance of understanding the linguistics of DNA sequences.

  • GENE ORGANIZATION AND EXPRESSION OF THE DIVALENT CATION TRANSPORTER NRAMP IN THE PROTISTAN PARASITE PERKINSUS MARINUS. José-Antonio; Robledo, F.; Courville, Pascal; Cellier, Mathieu F. M.; Vasta, Gerardo R. // Journal of Parasitology;Oct2004, Vol. 90 Issue 5, p1004 

    Focuses on the molecular characterization and expression of an Nramp homologue in a protistan parasite Perkinsus marinus and in the Alveolata overall. Determination of the intracellular parasite-host interaction outcome; Effect of cation depletion and repletion on Perkinsus marinus Nramp...

  • Crystal structure of ATF-2/c-Jun and IRF-3 bound to the interferon-ßenhancer. Panne, Daniel; Maniatis, Tom; Harrison, Stephen C. // EMBO Journal;11/10/2004, Vol. 23 Issue 22, p4384 

    Transcriptional activation of the interferon-ß(IFN-ß) gene requires assembly of an enhanceosome containing the transcription factors ATF-2/c-Jun, IRF-3/IRF-7, NF-?B and HMGI(Y). These factors cooperatively bind a composite DNA site and activate expression of the IFN-ßgene. The...

  • Large-Scale Sequencing Analysis of the Full-Length cDNA Library of Human Hepatocellular Carcinoma. Chia-Chu Tsai, John T.; Yi-Da Chung, John T.; Hong-Jen Lee, John T.; Wen-Hsin Chang, Yutaka; Suzuku, Sumio; Sugano; Jung-Yaw Lin, Sumio // Journal of Biomedical Science;Nov/Dec2003, Vol. 10 Issue 6, p636 

    Hepatocellular carcinoma (HCC) is one of the human cancers clearly linked to viral infections. Although the major risk factors for HCC development have been elucidated, the hepatocellular carcinogenesis pathway resulting in malignant transformation of liver cells remains to be clarified....


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics