Unknown

Dataset Information

0

Generation and analysis of a mouse multi-tissue genome annotation atlas.


ABSTRACT: Generating an accurate and complete genome annotation for an organism is complex because the cells within each tissue can express a unique set of transcript isoforms from a unique set of genes. A comprehensive genome annotation should contain information on what tissues express what transcript isoforms at what level. This tissue-level isoform information can then inform a wide range of research questions as well as experiment designs. Long-read sequencing technology combined with advanced full-length cDNA library preparation methods has now achieved throughput and accuracy where generating these types of annotations is achievable. Here, we show this by generating a genome annotation of the mouse (Mus musculus). We used the nanopore-based R2C2 long-read sequencing method to generate 64 million highly accurate full length cDNA consensus reads - averaging 5.4 million reads per tissue for a dozen tissues. Using the Mandalorion tool we processed these reads to generate the Tissue-level Atlas of Mouse Isoforms (TAMI - available at https://genome.ucsc.edu/s/vollmers/TAMI) which we believe will be a valuable complement to conventional, manually curated reference genome annotations.

SUBMITTER: Adams M 

PROVIDER: S-EPMC10862843 | biostudies-literature | 2024 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generation and analysis of a mouse multi-tissue genome annotation atlas.

Adams Matthew M   Vollmers Christopher C  

bioRxiv : the preprint server for biology 20240201


Generating an accurate and complete genome annotation for an organism is complex because the cells within each tissue can express a unique set of transcript isoforms from a unique set of genes. A comprehensive genome annotation should contain information on what tissues express what transcript isoforms at what level. This tissue-level isoform information can then inform a wide range of research questions as well as experiment designs. Long-read sequencing technology combined with advanced full-l  ...[more]

Similar Datasets

| S-EPMC11610592 | biostudies-literature
| S-EPMC3218658 | biostudies-literature
| S-EPMC5413951 | biostudies-literature
| S-EPMC5827353 | biostudies-literature
| S-EPMC7568261 | biostudies-literature
| S-EPMC4368115 | biostudies-literature
| S-EPMC7613894 | biostudies-literature
| PRJEB43879 | ENA
| S-EPMC2743715 | biostudies-literature
| S-EPMC3198151 | biostudies-literature