Ontology highlight
ABSTRACT:
SUBMITTER: Li W
PROVIDER: S-EPMC7779008 | biostudies-literature | 2021 Jan
REPOSITORIES: biostudies-literature
Li Wenjun W O'Neill Kathleen R KR Haft Daniel H DH DiCuccio Michael M Chetvernin Vyacheslav V Badretdin Azat A Coulouris George G Chitsaz Farideh F Derbyshire Myra K MK Durkin A Scott AS Gonzales Noreen R NR Gwadz Marc M Lanczycki Christopher J CJ Song James S JS Thanki Narmada N Wang Jiyao J Yamashita Roxanne A RA Yang Mingzhang M Zheng Chanjuan C Marchler-Bauer Aron A Thibaud-Nissen Françoise F
Nucleic acids research 20210101 D1
The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) contains nearly 200 000 bacterial and archaeal genomes and 150 million proteins with up-to-date annotation. Changes in the Prokaryotic Genome Annotation Pipeline (PGAP) since 2018 have resulted in a substantial reduction in spurious annotation. The hierarchical collection of protein family models (PFMs) used by PGAP as evidence for structural and functional annotation was expanded to over 35 000 p ...[more]