Unknown

Dataset Information

0

Grabseqs: simple downloading of reads and metadata from multiple next-generation sequencing data repositories.


ABSTRACT:

Summary

High-throughput sequencing is a powerful technique for addressing biological questions. Grabseqs streamlines access to publicly available metagenomic data by providing a single, easy-to-use interface to download data and metadata from multiple repositories, including the Sequence Read Archive, the Metagenomics Rapid Annotation through Subsystems Technology server and iMicrobe. Users can download data and metadata in a standardized format from any number of samples or projects from a given repository with a single grabseqs command.

Availability and implementation

Grabseqs is an open-source tool implemented in Python and licensed under the MIT license. The source code is freely available at https://github.com/louiejtaylor/grabseqs, the Python Package Index and Anaconda Cloud repository.

Contact

bushman@pennmedicine.upenn.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Taylor LJ 

PROVIDER: S-EPMC7267817 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6748601 | biostudies-literature
| S-EPMC3493122 | biostudies-literature
| S-EPMC6580563 | biostudies-literature
| S-EPMC3532080 | biostudies-literature
| S-EPMC8277151 | biostudies-literature
| S-EPMC3031631 | biostudies-other
| S-EPMC6505635 | biostudies-literature
| S-EPMC4265526 | biostudies-literature
| S-EPMC3581251 | biostudies-literature
| S-EPMC3096631 | biostudies-literature