SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions

Kuo, Byron Yu-Lin; Ying Chen; Bohacec, Slavita; Johansson, Öjvind; Wasserman, Wyeth W.; Simpson, Elizabeth M.
April 2006
PLoS Computational Biology;Apr2006, Vol. 2 Issue 4, pe34
Academic Journal
Serial analysis of gene expression (SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript nor to the genome were hypothesized to span a splice junction, for which the exon combination or exon(s) are unknown. To test this hypothesis, we have developed an algorithm, SAGE2Splice, to efficiently map SAGE tags to potential splice junctions in a genome. The algorithm consists of three search levels. A scoring scheme was designed based on position weight matrices to assess the quality of candidates. Using optimized parameters for SAGE2Splice analysis and two sets of SAGE data, candidate junctions were discovered for 5%-6% of unmapped tags. Candidates were classified into three categories, reflecting the previous annotations of the putative splice junctions. Analysis of predicted tags extracted from EST sequences demonstrated that candidate junctions having the splice junction located closer to the center of the tags are more reliable. Nine of these 12 candidates were validated by RT-PCR and sequencing, and among these, four revealed previously uncharacterized exons. Thus, SAGE2Splice provides a new functionality for the identification of novel transcripts and exons. SAGE2Splice is available online at http://www.cisreg.ca.


Related Articles

  • Evolutionary changes in cis and trans gene regulation. Wittkopp, Patricia J.; Haerum, Belinda K.; Clark, Andrew G. // Nature;7/1/2004, Vol. 430 Issue 6995, p85 

    Differences in gene expression are central to evolution. Such differences can arise from cis-regulatory changes that affect transcription initiation, transcription rate and/or transcript stability in an allele-specific manner, or from trans-regulatory changes that modify the activity or...

  • Identification of QTLs controlling gene expression networks defined a priori. Kliebenstein, Daniel J; West, Marilyn AL; van Leeuwen, Hans; Loudet, Olivier; Doerge, RW; St Clair, Dina A // BMC Bioinformatics;2006, Vol. 7, p1 

    Background: Gene expression microarrays allow the quantification of transcript accumulation for many or all genes in a genome. This technology has been utilized for a range of investigations, from assessments of gene regulation in response to genetic or environmental fluctuation to global...

  • Regulatory Snapshots: Integrative Mining of Regulatory Modules from Expression Time Series and Regulatory Networks. Gonçalves, Joana P.; Aires, Ricardo S.; Francisco, Alexandre P.; Madeira, Sara C. // PLoS ONE;May2012, Vol. 7 Issue 5, p1 

    Explaining regulatory mechanisms is crucial to understand complex cellular responses leading to system perturbations. Some strategies reverse engineer regulatory interactions from experimental data, while others identify functional regulatory units (modules) under the assumption that biological...

  • Multiple Promoters and Alternative Splicing: Hoxa5 Transcriptional Complexity in the Mouse Embryo. Coulombe, Yan; Lemieux, Margot; Moreau, Julie; Aubin, Josée; Joksimovic, Milan; Bérubé-Simard, Félix-Antoine; Tabariès, Sébastien; Boucherat, Olivier; Guillou, François; Larochelle, Christian; Tuggle, Christopher K.; Jeannotte, Lucie // PLoS ONE;2010, Vol. 5 Issue 5, p1 

    Background: The genomic organization of Hox clusters is fundamental for the precise spatio-temporal regulation and the function of each Hox gene, and hence for correct embryo patterning. Multiple overlapping transcriptional units exist at the Hoxa5 locus reflecting the complexity of Hox...

  • Directed Mammalian Gene Regulatory Networks Using Expression and Comparative Genomic Hybridization Microarray Data from Radiation Hybrids. Ahn, Sangtae; Wang, Richard T.; Park, Christopher C.; Lin, Andy; Leahy, Richard M.; Lange, Kenneth; Smith, Desmond J. // PLoS Computational Biology;Jun2009, Vol. 5 Issue 6, p1 

    Meiotic mapping of quantitative trait loci regulating expression (eQTLs) has allowed the construction of gene networks. However, the limited mapping resolution of these studies has meant that genotype data are largely ignored, leading to undirected networks that fail to capture regulatory...

  • Regional copy number—independent deregulation of transcription in cancer. Stransky, Nicolas; Vallot, Céline; Reyal, Fabien; Bernard-Pierrot, Isabelle; de Medina, Sixtina Gil Diez; Segraves, Rick; de Rycke, Yann; Elvin, Paul; Cassidy, Andrew; Spraggon, Carolyn; Graham, Alexander; Southgate, Jennifer; Asselain, Bernard; Allory, Yves; Abbou, Claude C.; Albertson, Donna G.; Thiery, Jean Paul; Chopin, Dominique K.; Pinkel, Daniel; Radvanyi, François // Nature Genetics;Dec2006, Vol. 38 Issue 12, p1386 

    Genetic and epigenetic alterations have been identified that lead to transcriptional deregulation in cancers. Genetic mechanisms may affect single genes or regions containing several neighboring genes, as has been shown for DNA copy number changes. It was recently reported that epigenetic...

  • Cerd4, third member of the d4 gene family: expression and organization of genomic locus. Ninkina, Natalia N.; Mertsalov, Ilja B.; Kulikova, Dina A.; Alimova-Kost, Maria V.; Simonova, Olga B.; Korochkin, Leonid I.; Kiselev, Sergey L.; Buchman, Vladimir L. // Mammalian Genome;Nov2001, Vol. 12 Issue 11, p862 

    Two members of the d4 family of presumptive transcription modulators, neuro-d4 (Neud4) and ubi-d4/Requiem (Req), have been characterized previously. We cloned and characterized the third member of this gene family, cer-d4 (Cerd4), from chicken and mouse cDNA libraries. The expression patterns of...

  • Complex imprinting. Beaudet, Arthur L. // Nature Genetics;Aug2004, Vol. 36 Issue 8, p793 

    Two new papers report advances in the understanding of the GNAS complex locus of overlapping and oppositely directed transcripts that are subject to genomic imprinting.

  • Transcriptional Regulation of an Evolutionary Conserved Intergenic Region of CDT2-INTS7. Nakagawa, Hiroki; Tategu, Moe; Yamauchi, Rieko; Sasaki, Kaori; Sekimachi, Sota; Yoshida, Kenichi // PLoS ONE;2008, Vol. 3 Issue 1, p1 

    Background. In the mammalian genome, a substantial number of gene pairs (approximately 10%) are arranged head-to-head on opposite strands within 1,000 base pairs, and separated by a bidirectional promoter(s) that generally drives the co-expression of both genes and results in functional...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics