Unknown

Dataset Information

0

BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data.


ABSTRACT: BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. RESULTS: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. CONCLUSION: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations.

SUBMITTER: Neumann RS 

PROVIDER: S-EPMC4062517 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data.

Neumann Ralf Stefan RS   Kumar Surendra S   Haverkamp Thomas Hendricus Augustus TH   Shalchian-Tabrizi Kamran K  

BMC bioinformatics 20140505


<h4>Background</h4>Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes  ...[more]

Similar Datasets

| S-EPMC6321710 | biostudies-literature
| S-EPMC168908 | biostudies-literature
| S-EPMC5838108 | biostudies-literature
| S-EPMC416473 | biostudies-literature
| S-EPMC7423749 | biostudies-literature
| S-EPMC10187674 | biostudies-literature
| S-EPMC3001099 | biostudies-literature
| S-EPMC1538791 | biostudies-literature
| S-EPMC5860071 | biostudies-literature
| S-EPMC6364044 | biostudies-literature