Unknown

Dataset Information

0

Collecting and managing taxonomic data with NCBI-taxonomist.


ABSTRACT:

Summary

We present NCBI-taxonomist - a command-line tool written in Python that collects and manages taxonomic data from the National Center for Biotechnology Information (NCBI). NCBI-taxonomist does not depend on a pre-downloaded taxonomic database but can store data locally. NCBI-taxonomist has six commands to map, collect, extract, resolve, import and group taxonomic data that can be linked together to create powerful analytical pipelines. Because many life science databases use the same taxonomic information, the data managed by NCBI-taxonomist is not limited to NCBI and can be used to find data linked to taxonomic information present in other scientific databases.

Availability and implementation

NCBI-taxonomist is implemented in Python 3 (≥3.8) and available at https://gitlab.com/janpb/ncbi-taxonomist and via PyPi (https://pypi.org/project/ncbi-taxonomist/), as a Docker container (https://gitlab.com/janpb/ncbi-taxonomist/container_registry/) and Singularity (v3.5.3) image (https://cloud.sylabs.io/library/jpb/ncbi-taxonomist). NCBI-taxonomist is licensed under the GPLv3.

Supplementary information

https://ncbi-taxonomist.readthedocs.io/en/latest/.

SUBMITTER: Buchmann JP 

PROVIDER: S-EPMC8016462 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7849379 | biostudies-literature
| S-EPMC4702849 | biostudies-literature
| S-EPMC6978984 | biostudies-literature
| S-EPMC7523648 | biostudies-literature
| S-EPMC5943443 | biostudies-literature
| S-EPMC5467576 | biostudies-literature
| S-EPMC7017537 | biostudies-literature
| S-EPMC3323351 | biostudies-literature
| S-EPMC3531084 | biostudies-other
| S-EPMC11365159 | biostudies-literature