Unknown

Dataset Information

0

PowerBLAST: a new network BLAST application for interactive or automated sequence analysis and annotation.


ABSTRACT: As the rate of DNA sequencing increases, analysis by sequence similarity search will need to become much more efficient in terms of sensitivity, specificity, automation potential, and consistency in annotation. PowerBLAST was developed, in part, to address these problems. PowerBLAST includes a number of options for masking repetitive elements and low complexity subsequences. It also has the capacity to restrict the search to any level of NCBI's taxonomy index, thus supporting "comparative genomics" applications. Postprocessing of the BLAST output using the SIM series of algorithms produces optimal, gapped alignments, and multiple alignments when a region of the query sequence matches multiple database sequences. PowerBLAST is capable of processing sequences of any length because it divides long query sequences into overlapping fragments and then merges the results after searching. The results may be viewed graphically, as a textual representation, or as an HTML page with links to GenBank and Entrez. For matching database sequences, annotated features are superimposed on the aligned query sequence in the output, thus greatly increasing the ease of interpretation. Such features may be used for automated annotation of new sequence because PowerBLAST output in ASN.1 form may be "dragged and dropped" into NCBI's Sequin program for sequence annotation and submission. PowerBLAST is capable of analyzing and annotating a 100-kb query in 60 min on NCBI's BLAST server.

SUBMITTER: Zhang J 

PROVIDER: S-EPMC310664 | biostudies-literature | 1997 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

PowerBLAST: a new network BLAST application for interactive or automated sequence analysis and annotation.

Zhang J J   Madden T L TL  

Genome research 19970601 6


As the rate of DNA sequencing increases, analysis by sequence similarity search will need to become much more efficient in terms of sensitivity, specificity, automation potential, and consistency in annotation. PowerBLAST was developed, in part, to address these problems. PowerBLAST includes a number of options for masking repetitive elements and low complexity subsequences. It also has the capacity to restrict the search to any level of NCBI's taxonomy index, thus supporting "comparative genomi  ...[more]

Similar Datasets

| S-EPMC441540 | biostudies-literature
| S-EPMC8494211 | biostudies-literature
| S-EPMC168988 | biostudies-literature
| S-EPMC3476339 | biostudies-literature
| S-EPMC8974006 | biostudies-literature
| S-EPMC8062096 | biostudies-literature
| S-EPMC1538791 | biostudies-literature
| S-EPMC8860439 | biostudies-literature
| S-EPMC3840059 | biostudies-literature
| S-EPMC6916222 | biostudies-literature