Unknown

Dataset Information

0

GENCODE: reference annotation for the human and mouse genomes in 2023.


ABSTRACT: GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.

SUBMITTER: Frankish A 

PROVIDER: S-EPMC9825462 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

GENCODE: reference annotation for the human and mouse genomes in 2023.

Frankish Adam A   Carbonell-Sala Sílvia S   Diekhans Mark M   Jungreis Irwin I   Loveland Jane E JE   Mudge Jonathan M JM   Sisu Cristina C   Wright James C JC   Arnan Carme C   Barnes If I   Banerjee Abhimanyu A   Bennett Ruth R   Berry Andrew A   Bignell Alexandra A   Boix Carles C   Calvet Ferriol F   Cerdán-Vélez Daniel D   Cunningham Fiona F   Davidson Claire C   Donaldson Sarah S   Dursun Cagatay C   Fatima Reham R   Giorgetti Stefano S   Giron Carlos Garcıa CG   Gonzalez Jose Manuel JM   Hardy Matthew M   Harrison Peter W PW   Hourlier Thibaut T   Hollis Zoe Z   Hunt Toby T   James Benjamin B   Jiang Yunzhe Y   Johnson Rory R   Kay Mike M   Lagarde Julien J   Martin Fergal J FJ   Gómez Laura Martínez LM   Nair Surag S   Ni Pengyu P   Pozo Fernando F   Ramalingam Vivek V   Ruffier Magali M   Schmitt Bianca M BM   Schreiber Jacob M JM   Steed Emily E   Suner Marie-Marthe MM   Sumathipala Dulika D   Sycheva Irina I   Uszczynska-Ratajczak Barbara B   Wass Elizabeth E   Yang Yucheng T YT   Yates Andrew A   Zafrulla Zahoor Z   Choudhary Jyoti S JS   Gerstein Mark M   Guigo Roderic R   Hubbard Tim J P TJP   Kellis Manolis M   Kundaje Anshul A   Paten Benedict B   Tress Michael L ML   Flicek Paul P  

Nucleic acids research 20230101 D1


GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Her  ...[more]

Similar Datasets

| S-EPMC3431492 | biostudies-literature
| S-EPMC1810553 | biostudies-literature
| S-EPMC4895710 | biostudies-literature
| S-EPMC4502323 | biostudies-literature
| S-EPMC7140576 | biostudies-literature
| S-EPMC1534038 | biostudies-literature
| S-EPMC6364042 | biostudies-literature
| S-EPMC4602055 | biostudies-literature
| S-EPMC7265644 | biostudies-literature
| S-EPMC3137498 | biostudies-literature