Unknown

Dataset Information

0

BlaSTorage: a fast package to parse, manage and store BLAST results.


ABSTRACT:

Unlabelled

Background

Large-scale sequence studies requiring BLAST-based analysis produce huge amounts of data to be parsed. BLAST parsers are available, but they are often missing some important features, such as keeping all information from the raw BLAST output, allowing direct access to single results, and performing logical operations over them.

Findings

We implemented BlaSTorage, a Python package that parses multi BLAST results and returns them in a purpose-built object-database format. Unlike other BLAST parsers, BlaSTorage retains and stores all parts of BLAST results, including alignments, without loss of information; a complete API allows access to all the data components.

Conclusions

BlaSTorage shows comparable speed of more basic parser written in compiled languages as C++ and can be easily integrated into web applications or software pipelines.

SUBMITTER: Orsini M 

PROVIDER: S-EPMC3571973 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6544205 | biostudies-literature
| S-EPMC10914741 | biostudies-literature
| S-EPMC8236273 | biostudies-literature
| S-EPMC7884812 | biostudies-literature
| S-EPMC4851773 | biostudies-literature
| S-EPMC2962639 | biostudies-literature
| S-EPMC10428106 | biostudies-literature
| S-EPMC7214040 | biostudies-literature
| S-EPMC1522020 | biostudies-literature
| S-EPMC1933160 | biostudies-literature