Unknown

Dataset Information

0

The JCVI standard operating procedure for annotating prokaryotic metagenomic shotgun sequencing data.


ABSTRACT: The JCVI metagenomics analysis pipeline provides for the efficient and consistent annotation of shotgun metagenomics sequencing data for sampling communities of prokaryotic organisms. The process can be equally applied to individual sequence reads from traditional Sanger capillary electrophoresis sequences, newer technologies such as 454 pyrosequencing, or sequence assemblies derived from one or more of these data types. It includes the analysis of both coding and non-coding genes, whether full-length or, as is often the case for shotgun metagenomics, fragmentary. The system is designed to provide the best-supported conservative functional annotation based on a combination of trusted homology-based scientific evidence and computational assertions and an annotation value hierarchy established through extensive manual curation. The functional annotation attributes assigned by this system include gene name, gene symbol, GO terms, EC numbers, and JCVI functional role categories.

SUBMITTER: Tanenbaum DM 

PROVIDER: S-EPMC3035284 | biostudies-literature | 2010 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

The JCVI standard operating procedure for annotating prokaryotic metagenomic shotgun sequencing data.

Tanenbaum David M DM   Goll Johannes J   Murphy Sean S   Kumar Prateek P   Zafar Nikhat N   Thiagarajan Mathangi M   Madupu Ramana R   Davidsen Tanja T   Kagan Leonid L   Kravitz Saul S   Rusch Douglas B DB   Yooseph Shibu S  

Standards in genomic sciences 20100330 2


The JCVI metagenomics analysis pipeline provides for the efficient and consistent annotation of shotgun metagenomics sequencing data for sampling communities of prokaryotic organisms. The process can be equally applied to individual sequence reads from traditional Sanger capillary electrophoresis sequences, newer technologies such as 454 pyrosequencing, or sequence assemblies derived from one or more of these data types. It includes the analysis of both coding and non-coding genes, whether full-  ...[more]

Similar Datasets

| S-EPMC3111993 | biostudies-literature
| S-EPMC6450397 | biostudies-literature
| S-EPMC8256542 | biostudies-literature
| S-EPMC6558284 | biostudies-literature
| S-EPMC7305406 | biostudies-literature
2021-07-26 | E-MTAB-9189 | biostudies-arrayexpress
2021-07-26 | E-MTAB-9191 | biostudies-arrayexpress
| S-EPMC2655764 | biostudies-literature
| S-EPMC7470745 | biostudies-literature
| S-EPMC6883603 | biostudies-literature