Batch Blast Extractor: an automated blastx parser application

Pirooznia, Mehdi; Perkins, Edward J.; Youping Deng
January 2008
BMC Genomics;2008 Supplement 2, Vol. 9, Special section p1
Academic Journal
Motivation: BLAST programs are very efficient in finding similarities for sequences. However for large datasets such as ESTs, manual extraction of the information from the batch BLAST output is needed. This can be time consuming, insufficient, and inaccurate. Therefore implementation of a parser application would be extremely useful in extracting information from BLAST outputs. Results: We have developed a java application, Batch Blast Extractor, with a user friendly graphical interface to extract information from BLAST output. The application generates a tab delimited text file that can be easily imported into any statistical package such as Excel or SPSS for further analysis. For each BLAST hit, the program obtains and saves the essential features from the BLAST output file that would allow further analysis. The program was written in Java and therefore is OS independent. It works on both Windows and Linux OS with java 1.4 and higher. It is freely available from: http://mcbc.usm.edu/BatchBlastExtractor/


Related Articles

  • JDet: interactive calculation and visualization of function-related conservation patterns in multiple sequence alignments and structures. Muth, Thilo; García-Martín, Juan A.; Rausell, Antonio; Juan, David; Valencia, Alfonso; Pazos, Florencio // Bioinformatics;Feb2012, Vol. 28 Issue 4, p584 

    Summary: We have implemented in a single package all the features required for extracting, visualizing and manipulating fully conserved positions as well as those with a family-dependent conservation pattern in multiple sequence alignments. The program allows, among other things, to run...

  • Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Andrew M. Waterhouse; James B. Procter; David M. A. Martin; Michèle Clamp; Geoffrey J. Barton // Bioinformatics;May2009, Vol. 25 Issue 9, p1189 

    Summary: Jalview Version 2 is a system for interactive WYSIWYG editing, analysis and annotation of multiple sequence alignments. Core features include keyboard and mouse-based editing, multiple views and alignment overviews, and linked structure display with Jmol. Jalview 2 is available in two...

  • DASher: a stand-alone protein sequence client for DAS, the Distributed Annotation System. David N. Messina; Erik L. L. Sonnhammer // Bioinformatics;May2009, Vol. 25 Issue 10, p1333 

    Summary: The rise in biological sequence data has led to a proliferation of separate, specialized databases. While there is great value in having many independent annotations, it is critical that there be a way to integrate them in one combined view. The Distributed Annotation System (DAS) was...

  • BamView: viewing mapped read alignment data in the context of the reference sequence. Carver, Tim; Böhme, Ulrike; Otto, Thomas D.; Parkhill, Julian; Berriman, Matthew // Bioinformatics;Mar2010, Vol. 26 Issue 5, p676 

    Summary: BamView is an interactive Java application for visualizing the large amounts of data stored for sequence reads which are aligned against a reference genome sequence. It supports the BAM (Binary Alignment/Map) format. It can be used in a number of contexts including SNP calling and...

  • ACCUSA—accurate SNP calling on draft genomes. Fröhler, Sebastian; Dieterich, Christoph // Bioinformatics;May2010, Vol. 26 Issue 10, p1364 

    Summary: Next generation sequencing technologies facilitate genome-wide analysis of several biological processes. We are interested in whole-genome genotyping. To our knowledge, none of the existing single nucleotide polymorphism (SNP) callers consider the quality of the reference genome, which...

  • BioJava: an open-source framework for bioinformatics in 2012. Prlić, Andreas; Yates, Andrew; Bliven, Spencer E.; Rose, Peter W.; Jacobsen, Julius; Troshin, Peter V.; Chapman, Mark; Gao, Jianjiong; Koh, Chuan Hock; Foisy, Sylvain; Holland, Richard; Rimša, Gediminas; Heuer, Michael L.; Brandstätter–Müller, H.; Bourne, Philip E.; Willis, Scooter // Bioinformatics;Oct2012, Vol. 28 Issue 20, p2693 

    Motivation: BioJava is an open-source project for processing of biological data in the Java programming language. We have recently released a new version (3.0.5), which is a major update to the code base that greatly extends its functionality.Results: BioJava now consists of several independent...

  • HotSwap for bioinformatics: A STRAP tutorial. Christoph6Gille; Robinson, Peter N // BMC Bioinformatics;2006, Vol. 7, p64 

    Background: Bioinformatics applications are now routinely used to analyze large amounts of data. Application development often requires many cycles of optimization, compiling, and testing. Repeatedly loading large datasets can significantly slow down the development process. We have incorporated...

  • Construction of a Pig Physical Interactome Using Sequence Homology and a Comprehensive Reference Human Interactome. Dreher, Felix; Kamburov, Atanas; Herwig, Ralf // Evolutionary Bioinformatics;2012, Issue 8, p119 

    The analysis of interaction networks is crucial for understanding molecular function and has an essential impact for genome-wide studies. However, the interactomes of most species are largely incomplete and computational strategies that take into account sequence homology can help compensating...

  • Java GUI for InterProScan (JIPS): A tool to help process multiple InterProScans and perform ortholog analysis. Syed, Aijazuddin; Upton, Chris // BMC Bioinformatics;2006, Vol. 7, p462 

    Background: Recent, rapid growth in the quantity of available genomic data has generated many protein sequences that are not yet biochemically classified. Thus, the prediction of biochemical function based on structural motifs is an important task in post-genomic analysis. The InterPro databases...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics