Matching curated genome databases: a non trivial task

January 2008
BMC Genomics;2008 Supplement 2, Vol. 9, p501
Academic Journal
No abstract available.


Related Articles

  • motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites. Coetzee, Simon G.; Coetzee, Gerhard A.; Hazelett, Dennis J. // Bioinformatics;12/1/2015, Vol. 31 Issue 23, p3847 

    Functional annotation represents a key step toward the understanding and interpretation of germline and somatic variation as revealed by genome-wide association studies (GWAS) and The Cancer Genome Atlas (TCGA), respectively. GWAS have revealed numerous genetic risk variants residing in...

  • Multi-sorting algorithm for finding pairs of similar short substrings from large-scale string data. Uno, Takeaki // Knowledge & Information Systems;Nov2010, Vol. 25 Issue 2, p229 

    Finding similar substrings/substructures is a central task in analyzing huge string data such as genome sequences, Web documents, log data, feature vectors of pictures, photos, videos, etc. Although the existence of polynomial time algorithms for such problems is trivial since the number of...

  • A computational platform to maintain and migrate manual functional annotations for BioCyc databases. Walsh, Jesse R.; Sen, Taner Z.; Dickerson, Julie A. // BMC Systems Biology;2014, Vol. 8 Issue 1, p1 

    Background BioCyc databases are an important resource for information on biological pathways and genomic data. Such databases represent the accumulation of biological data, some of which has been manually curated from literature. An essential feature of these databases is the continuing data...

  • Advances in the Exon--Intron Database (EID). Shepelev, Valery; Fedorov, Alexei // Briefings in Bioinformatics;Jun2006, Vol. 7 Issue 2, p178 

    Investigation of exon-intron gene structures is a non-trivial task due to enormous expansions of the eukaryotic genomes, great variety of gene forms, and the imperfectness in sequence data. A number of available informational systems on various gene characteristics complement each other and are...

  • Automated hierarchical classification of protein domain subfamilies based on functionally-divergent residue signatures. Neuwald, Andrew F.; Lanczycki, Christopher J.; Marchler-Bauer, Aron // BMC Bioinformatics;2012, Vol. 13 Issue 1, p1 

    Background: The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain...

  • Weakly supervised learning of biomedical information extraction from curated data. Jain, Suvir; R., Kashyap; Tsung-Ting Kuo; Bhargava, Shitij; Lin, Gordon; Chun-Nan Hsu // BMC Bioinformatics;1/11/2016, Vol. 17, p1 

    Background: Numerous publicly available biomedical databases derive data by curating from literatures. The curated data can be useful as training examples for information extraction, but curated data usually lack the exact mentions and their locations in the text required for supervised machine...

  • A Survey on Preprocessing in Text Mining. Anadakumar, K.; Padmavathy, V. // International Journal of Advanced Research in Computer Science;Jul/Aug2013, Vol. 4 Issue 9, p79 

    Now-a-days information's are stored electronically in databases. Extracting reliable, unknown and useful information from the abundant source is an eminent task. Data mining and Text mining are the process for extracting unknown and useful information. Text Mining is the process of extracting...

  • Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction. Palmer, Lance E; Dejori, Mathaeus; Bolanos, Randall; Fasulo, Daniel // BMC Bioinformatics;2010, Vol. 11, Special section p1 

    Background: With the rapid expansion of DNA sequencing databases, it is now feasible to identify relevant information from prior sequencing projects and completed genomes and apply it to de novo sequencing of new organisms. As an example, this paper demonstrates how such extra information can be...

  • VEGA, the genome browser with a difference. Loveland, Jane // Briefings in Bioinformatics;Jun2005, Vol. 6 Issue 2, p189 

    The Vertebrate Genome Annotation (Vega) database is a community resource for browsing manual annotation from a variety of vertebrate genomes of finished sequence (http://vega.sanger.ac.uk). Vega is different from other genome browsers as it has a standardised classification of genes which...

  • The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs. Mituyama, Toutai; Yamada, Kouichirou; Hattori, Emi; Okida, Hiroaki; Ono, Yukiteru; Terai, Goro; Yoshizawa, Aya; Komori, Takashi; Asai, Kiyoshi // Nucleic Acids Research;2008, Vol. 37 Issue suppl_1, pD89 

    We developed a pair of databases that support two important tasks: annotation of anonymous RNA transcripts and discovery of novel non-coding RNAs. The database combo is called the Functional RNA Database and consists of two databases: a rewrite of the original version of the Functional RNA...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics