Proteomics

Dataset Information

0

Novel splicing and open reading frames revealed by long-read direct RNA sequencing of adenovirus transcripts


ABSTRACT: Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.

INSTRUMENT(S): Orbitrap Fusion

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Lung, Epithelial Cell

DISEASE(S): Disease Free

SUBMITTER: Richard Lauman  

LAB HEAD: Matthew Weitzman

PROVIDER: PXD034464 | Pride | 2022-08-30

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
200422_Ad5ProteinIsoformsStudy_DatabaseBuild.fasta Fasta
200514_Transition_List.csv Csv
200817_TransitionListforPRMMethod.csv Csv
20200902_RL_Ad530hr_PRM_1.raw Raw
20200902_RL_Ad530hr_PRM_2.raw Raw
Items per page:
1 - 5 of 25

Similar Datasets

2019-09-24 | GSE137883 | GEO
2010-12-08 | E-GEOD-24138 | biostudies-arrayexpress
2010-12-08 | GSE24138 | GEO
2013-02-19 | E-MTAB-1505 | biostudies-arrayexpress
2014-11-26 | E-MTAB-3122 | biostudies-arrayexpress
2014-12-16 | E-MTAB-3168 | biostudies-arrayexpress
2014-12-16 | E-MTAB-3169 | biostudies-arrayexpress
2016-04-28 | E-MTAB-4644 | biostudies-arrayexpress
2020-08-24 | GSE138421 | GEO
2020-08-24 | GSE138424 | GEO