WildSilkbase: An EST database of wild silkmoths

Arunkumar, K. P.; Tomar, Archana; Daimon, Takaaki; Shimada, Toru; Nagaraju, J.
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: Functional genomics has particular promise in silkworm biology for identifying genes involved in a variety of biological functions that include: synthesis and secretion of silk, sex determination pathways, insect-pathogen interactions, chorionogenesis, molecular clocks. Wild silkmoths have hardly been the subject of detailed scientific investigations, owing largely to non-availability of molecular and genetic data on these species. As a first step, in the present study we generated large scale expressed sequence tags (EST) in three economically important species of wild silkmoths. In order to make these resources available for the use of global scientific community, an EST database called 'WildSilkbase' was developed. Description: WildSilkbase is a catalogue of ESTs generated from several tissues at different developmental stages of 3 economically important saturniid silkmoths, an Indian golden silkmoth, Antheraea assama, an Indian tropical tasar silkmoth, A. mylitta and eri silkmoth, Samia cynthia ricini. Currently the database is provided with 57,113 ESTs which are clustered and assembled into 4,019 contigs and 10,019 singletons. Data can be browsed and downloaded using a standard web browser. Users can search the database either by BLAST query, keywords or Gene Ontology query. There are options to carry out searches for species, tissue and developmental stage specific ESTs in BLAST page. Other features of the WildSilkbase include cSNP discovery, GO viewer, homologue finder, SSR finder and links to all other related databases. The WildSilkbase is freely available from http://www.cdfd.org.in/wildsilkbase/. Conclusion: A total of 14,038 putative unigenes was identified in 3 species of wild silkmoths. These genes provide important resources to gain insight into the functional and evolutionary study of wild silkmoths. We believe that WildSilkbase will be extremely useful for all those researchers working in the areas of comparative genomics, functional genomics and molecular evolution in general, and gene discovery, gene organization, transposable elements and genome variability of insect species in particular.


Related Articles

  • CGAT: computational genomics analysis toolkit. Sims, David; Ilott, Nicholas E.; Sansom, Stephen N.; Sudbery, Ian M.; Johnson, Jethro S.; Fawcett, Katherine A.; Berlanga-Taylor, Antonio J.; Luna-Valero, Sebastian; Ponting, Chris P.; Heger, Andreas // Bioinformatics;May2014, Vol. 30 Issue 9, p1290 

    Summary: Computational genomics seeks to draw biological inferences from genomic datasets, often by integrating and contextualizing next-generation sequencing data. CGAT provides an extensive suite of tools designed to assist in the analysis of genome scale data from a range of standard file...

  • Genome-wide synteny through highly sensitive sequence alignment: Satsuma. Grabherr, Manfred G.; Russell, Pamela; Meyer, Miriah; Mauceli, Evan; Alföldi, Jessica; Di Palma, Federica; Lindblad-Toh, Kerstin; Frishman, Dmitrij // Bioinformatics;May2010, Vol. 26 Issue 9, p1145 

    Motivation: Comparative genomics heavily relies on alignments of large and often complex DNA sequences. From an engineering perspective, the problem here is to provide maximum sensitivity (to find all there is to find), specificity (to only find real homology) and speed (to accommodate the...

  • Measuring guide-tree dependency of inferred gaps in progressive aligners. Capella-Gutiérrez, Salvador; Gabaldón, Toni // Bioinformatics;Apr2013, Vol. 29 Issue 8, p1011 

    Motivation: Multiple sequence alignments are generally reconstructed using a progressive approach that follows a guide-tree. During this process, gaps are introduced at a cost to maximize residue pairing, but it is unclear whether inferred gaps reflect actual past events of sequence insertions...

  • Promoter Sequences Prediction Using Relational Association Rule Mining. Czibula, Gabriela; Bocicor, Maria-Iuliana; Gergely Czibula, Istvan // Evolutionary Bioinformatics;2012, Issue 8, p181 

    In this paper we are approaching, from a computational perspective, the problem of promoter sequences prediction, an important problem within the field of bioinformatics. As the conditions for a DNA sequence to function as a promoter are not known, machine learning based classification models...

  • Laboratory Information Management Software for genotyping workflows: applications in high throughput crop genotyping. Jayashree, B; Reddy, Praveen T; Leeladevi, Y; Crouch, Jonathan H; Mahalakshmi, V; Buhariwalla, Hutokshi K; Eshwar, KE; Mace, Emma; Folksterma, Rolf; Senthilvel, S; Varshney, Rajeev K; Seetha, K; Rajalakshmi, R; Prasanth, VP; Chandra, Subhash; Swarupa, L; SriKalyani, P; Hoisington, David A // BMC Bioinformatics;2006, Vol. 7, p383 

    Background: With the advances in DNA sequencer-based technologies, it has become possible to automate several steps of the genotyping process leading to increased throughput. To efficiently handle the large amounts of genotypic data generated and help with quality control, there is a strong need...

  • A Design of a Hybrid System for DNA Sequence Alignment. Khaled, Heba; Faheem, Hossam M.; Hasan, Tayseer; Ghoneimy, Saeed // International MultiConference of Engineers & Computer Scientists;2008, p162 

    This paper describes a parallel algorithm and its needed architecture and a complementary sequential algorithm for solving sequence alignment problem on DNA (Deoxyribonucleic acid) molecules. The parallel algorithm is considered much faster than sequential algorithms used to perform sequence...

  • QTrim: a novel tool for the quality trimming of sequence reads generated using the Roche/454 sequencing platform. Shrestha, Ram Krishna; Lubinsky, Baruch; Bansode, Vijay B.; Moinz, Mónica B. J.; McCormack, Grace P.; Travers, Simon A. // BMC Bioinformatics;2014, Vol. 15 Issue 1, p2 

    Background Many high throughput sequencing (HTS) approaches, such as the Roche/454 platform, produce sequences in which the quality of the sequence (as measured by a Phred-like quality scores) decreases linearly across a sequence read. Undertaking quality trimming of this data is essential to...

  • SSR Locator: Tool for Simple Sequence Repeat Discovery Integrated with Primer Design and PCR Simulation. Carlos daMaia, Luciano; Palmieri, Dario Abel; de Souza, Velci Queiroz; Kopp, Mauricio Marini; de Carvalho, Fernando Irajá Félix; de Oliveira, Antonio Costa // International Journal of Plant Genomics;2008, Vol. 2008, p1 

    Microsatellites or SSRs (simple sequence repeats) are ubiquitous short tandem duplications occurring in eukaryotic organisms. These sequences are among the best marker technologies applied in plant genetics and breeding. The abundant genomic, BAC, and EST sequences available in databases allow...

  • Coverage tradeoffs and power estimation in the design of whole-genome sequencing experiments for detecting association. Shen, Yufeng; Song, Ruijie; Pe'er, Itsik // Bioinformatics;Jul2011, Vol. 27 Issue 14, p1995 

    Motivation: Whole-genome sequencing (WGS) allows direct interrogation of previously undetected uncommon or rare variants, which potentially contribute to the missing heritability of human disease. However, cost of sequencing large numbers of samples limits its application in case–control...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics