Unknown

Dataset Information

0

GENCODE 2021.


ABSTRACT: The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.

SUBMITTER: Frankish A 

PROVIDER: S-EPMC7778937 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

GENCODE 2021.

Frankish Adam A   Diekhans Mark M   Jungreis Irwin I   Lagarde Julien J   Loveland Jane E JE   Mudge Jonathan M JM   Sisu Cristina C   Wright James C JC   Armstrong Joel J   Barnes If I   Berry Andrew A   Bignell Alexandra A   Boix Carles C   Carbonell Sala Silvia S   Cunningham Fiona F   Di Domenico Tomás T   Donaldson Sarah S   Fiddes Ian T IT   García Girón Carlos C   Gonzalez Jose Manuel JM   Grego Tiago T   Hardy Matthew M   Hourlier Thibaut T   Howe Kevin L KL   Hunt Toby T   Izuogu Osagie G OG   Johnson Rory R   Martin Fergal J FJ   Martínez Laura L   Mohanan Shamika S   Muir Paul P   Navarro Fabio C P FCP   Parker Anne A   Pei Baikang B   Pozo Fernando F   Riera Ferriol Calvet FC   Ruffier Magali M   Schmitt Bianca M BM   Stapleton Eloise E   Suner Marie-Marthe MM   Sycheva Irina I   Uszczynska-Ratajczak Barbara B   Wolf Maxim Y MY   Xu Jinuri J   Yang Yucheng T YT   Yates Andrew A   Zerbino Daniel D   Zhang Yan Y   Choudhary Jyoti S JS   Gerstein Mark M   Guigó Roderic R   Hubbard Tim J P TJP   Kellis Manolis M   Paten Benedict B   Tress Michael L ML   Flicek Paul P  

Nucleic acids research 20210101 D1


The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure  ...[more]

Similar Datasets

| S-EPMC3491395 | biostudies-literature
| S-EPMC1810553 | biostudies-literature
| S-EPMC3137498 | biostudies-literature
| S-EPMC3431492 | biostudies-literature
2021-01-07 | GSE164352 | GEO
| S-EPMC4895710 | biostudies-literature
| S-EPMC9825462 | biostudies-literature
| S-EPMC5084464 | biostudies-literature
2021-11-06 | GSE188164 | GEO
| S-EPMC3431493 | biostudies-literature