CASCAD: a database of annotated candidate single nucleotide polymorphisms associated with expressed sequences

Guryev, Victor; Berezikov, Eugene; Cuppen, Edwin
January 2005
BMC Genomics;2005, Vol. 6, p10
Academic Journal
Background: With the recent progress made in large-scale genome sequencing projects a vast amount of novel data is becoming available. A comparative sequence analysis, exploiting sequence information from various resources, can be used to uncover hidden information, such as genetic variation. Although there are enormous amounts of SNPs for a wide variety of organisms submitted to NCBI dbSNP and annotated in most genome assembly viewers like Ensembl and the UCSC Genome Browser, these platforms do not easily allow for extensive annotation and incorporation of experimental data supporting the polymorphism. However, such information is very important for selecting the most promising and useful candidate polymorphisms for use in experimental setups. Description: The CASCAD database is designed for presentation and query of candidate SNPs that are retrieved by in silico mining of high-throughput sequencing data. Currently, the database provides collections of laboratory rat (Rattus norvegicus) and zebrafish (Danio rerio) candidate SNPs. The database stores detailed information about raw data supporting the candidate, extensive annotation and links to external databases (e.g. GenBank, Ensembl, UniGene, and LocusLink), verification information, and predictions of a potential effect for non-synonymous polymorphisms in coding regions. The CASCAD website allows search based on an arbitrary combination of 27 different parameters related to characteristics like candidate SNP quality, genomic localization, and sequence data source or strain. In addition, the database can be queried with any custom nucleotide sequences of interest. The interface is crosslinked to other public databases and tightly coupled with primer design and local genome assembly interfaces in order to facilitate experimental verification of candidates. Conclusions: The CASCAD database discloses detailed information on rat and zebrafish candidate SNPs, including the raw data underlying its discovery. An advanced web-based search interface http://cascad.niob.knaw.nl allows universal access to the database content and allows various queries supporting many types of research utilizing single nucleotide polymorphisms.


Related Articles

  • Whole genome comparative analysis of channel catfish (Ictalurus punctatus) with four model fish species. Yanliang Jiang; Xiaoyu Gao; Shikai Liu; Yu Zhang; Hong Liu; Fanyue Sun; Lisui Bao; Waldbieser, Geoff; Zhanjiang Liu // BMC Genomics;2013, Vol. 14 Issue 1, p1 

    Background Comparative mapping is a powerful tool to study evolution of genomes. It allows transfer of genome information from the well-studied model species to non-model species. Catfish is an economically important aquaculture species in United States. A large amount of genome resources have...

  • A study of biases of DNA copy number estimation based on PICR model. Wang, Quan; Qu, Jianghan; Cheng, Xiaoxing; Kang, Yongjian; Wan, Lin; Qian, Minping; Deng, Minghua // Frontiers of Mathematics in China;Dec2011, Vol. 6 Issue 6, p1203 

    Affymetrix single-nucleotide polymorphism (SNP) arrays have been widely used for SNP genotype calling and copy number variation (CNV) studies, both of which are dependent on accurate DNA copy number estimation significantly. However, the methods for copy number estimation may suffer from kinds...

  • Detecting disease rare alleles using single SNPs in families and haplotyping in unrelated subjects from the Genetic Analysis Workshop 17 data. Kraja, Aldi T.; Czajkowski, Jacek; Feitosa, Mary F.; Borecki, Ingrid B.; Province, Michael A. // BMC Proceedings;2011 Supplement 9, Vol. 5 Issue Suppl 9, p1 

    We present an evaluation of discovery power for two association tests that work well with common alleles but are applied to the Genetic Analysis Workshop 17 simulations with rare causative single-nucleotide polymorphisms (SNPs) (minor allele frequency [MAF] < 1%). The methods used were...

  • Quantifying single nucleotide variant detection sensitivity in exome sequencing. Meynert, Alison M.; Bicknell, Louise S.; Hurles, Matthew E.; Jackson, Andrew P.; Taylor, Martin S. // BMC Bioinformatics;2013, Vol. 14 Issue 1, p1 

    Background: The targeted capture and sequencing of genomic regions has rapidly demonstrated its utility in genetic studies. Inherent in this technology is considerable heterogeneity of target coverage and this is expected to systematically impact our sensitivity to detect genuine polymorphisms....

  • DArT markers for the rye genome--genetic diversity and mapping. Bolibok-Brągoszewska, Hanna; Heller-Uszyńska, Katarzyna; Wenzl, Peter; Uszyński, Grzegorz; Kilian, Andrzej; Rakoczy-Trojanowska, Monika // BMC Genomics;2009, Vol. 10, p578 

    Background: Implementation of molecular breeding in rye (Secale cereale L.) improvement programs depends on the availability of high-density molecular linkage maps. However, the number of sequence-specific PCR-based markers available for the species is limited. Diversity Arrays Technology (DArT)...

  • Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags. Hohenlohe, Paul A.; Bassham, Susan; Etter, Paul D.; Stiffler, Nicholas; Johnson, Eric A.; Cresko, William A. // PLoS Genetics;Feb2010, Vol. 6 Issue 2, p1 

    Next-generation sequencing technology provides novel opportunities for gathering genome-scale sequence data in natural populations, laying the empirical foundation for the evolving field of population genomics. Here we conducted a genome scan of nucleotide diversity and differentiation in...

  • A hitchhiker's guide to expressed sequence tag (EST) analysis. Nagaraj, Shivashankar H.; Gasser, Robin B.; Ranganathan, Shoba // Briefings in Bioinformatics;Jan2007, Vol. 8 Issue 1, p6 

    Expressed sequence tag (EST) sequencing projects are underway for numerous organisms, generating millions of short, single-pass nucleotide sequence reads, accumulating in EST databases. Extensive computational strategies have been developed to organize and analyse both small- and large-scale EST...

  • PGAT: a multistrain analysis resource for microbial genomes. Brittnacher, M. J.; Fong, C.; Hayden, H. S.; Jacobs, M. A.; Radey, Matthew; Rohmer, L. // Bioinformatics;Sep2011, Vol. 27 Issue 17, p2429 

    Motivation: The Prokaryotic-genome Analysis Tool (PGAT) is a web-based database application for comparing gene content and sequence across multiple microbial genomes facilitating the discovery of genetic differences that may explain observed phenotypes. PGAT supports database queries to identify...

  • Evaluation of cis-regulatory function in zebrafish. Pashos, Evanthia E.; Kague, Erika; Fisher, Shannon // Briefings in Functional Genomics & Proteomics;Nov2008, Vol. 7 Issue 6, p465 

    As increasing numbers of vertebrate genomes are sequenced, comparative genomics offers tremendous promise to unveil mechanisms of transcriptional gene regulation on a large scale. However, the challenge of analysing immense amounts of sequence data and relating primary sequence to function is...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics