Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Hominoidea

ABSTRACT: Long- and short-read RNA sequencing for ape genome annotation

PROVIDER: PRJNA1016395 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
	SRR27178646_subreads.fastq.gz	Fastqsanger.gz
	SRR27178647_1.fastq.gz	Fastqsanger.gz
	SRR27178647_2.fastq.gz	Fastqsanger.gz
	SRR27178648_subreads.fastq.gz	Fastqsanger.gz
	SRR27178649_subreads.fastq.gz	Fastqsanger.gz

Items per page:

1 - 5 of 38

Similar Datasets

Proteome of the snow alga Chloromonas typhlos

Project description:a chromosome-level nuclear genome and organelle genomes of the alpine snow alga Chloromonas typhlos were sequenced and assembled by integrating short- and long-read sequencing and proteogenomic strategy

2024-10-12 | PXD056928 |

Novel splicing and open reading frames revealed by long-read direct RNA sequencing of adenovirus transcripts

Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.

2022-08-30 | PXD034464 | Pride

Multi-omic profiling of pathogen-stimulated primary immune cells

Project description:Objectives: To perform long-read transcriptome and proteome profiling of pathogen-stimulated peripheral blood mononuclear cells (PBMCs) from healthy donors. We aim to discover new transcripts and protein isoforms expressed during immune responses to diverse pathogens. Methods: PBMCs were exposed to four microbial stimuli for 24 hours: the TLR4 ligand lipopolysaccharide (LPS), the TLR3 ligand Poly(I:C), heat-inactivated Staphylococcus aureus, Candida albicans, and RPMI medium as negative controls. Long-read sequencing (PacBio) of one donor and secretome proteomics and short-read sequencing of five donors were performed. IsoQuant was used for transcriptome construction, Metamorpheus/FlashLFQ for proteome analysis, and Illumina short-read 3’-end mRNA sequencing for transcript quantification. Results: Long-read transcriptome profiling reveals the expression of novel sequences and isoform switching induced upon pathogen stimulation, including transcripts that are difficult to detect using traditional short-read sequencing. We observe widespread loss of intron retention as a common result of all pathogen stimulations. We highlight novel transcripts of NFKB1 and CASP1 that may indicate novel immunological mechanisms. In general, RNA expression differences did not result in differences in the amounts of secreted proteins. Interindividual differences in the proteome were larger than the differences between stimulated and unstimulated PBMCs. Clustering analysis of secreted proteins revealed a correlation between chemokine (receptor) expression on the RNA and protein levels in C. albicans- and Poly(I:C)-stimulated PBMCs. Conclusion: Isoform aware long-read sequencing of pathogen-stimulated immune cells highlights the potential of these methods to identify novel transcripts, revealing a more complex transcriptome landscape than previously appreciated.

2023-09-16 | PXD045237 | Pride

Identification of the molecular basis of anti-cancer effect of Huaier.

Project description:Intervention type:DRUG. Intervention1:Huaier, Dose form:GRANULES, Route of administration:ORAL, intended dose regimen:20 to 60/day by either bulk or split for 3 months to extended term if necessary. Control intervention1:None. Primary outcome(s): For mRNA libraries, focus on mRNA studies. Data analysis includes sequencing data processing and basic sequencing data quality control, prediction of new transcripts, differential expression analysis of genes. Gene Ontology (GO) and the KEGG pathway database are used for annotation and enrichment analysis of up-regulated genes and down-regulated genes. For small RNA libraries, data analysis includes sequencing data process and sequencing data process QC, small RNA distribution across the genome, rRNA, tRNA, alignment with snRNA and snoRNA, construction of known miRNA expression pattern, prediction New miRNA and Study of their secondary structure Based on the expression pattern of miRNA, we perform not only GO / KEGG annotation and enrichment, but also different expression analysis.. Timepoint:RNA sequencing of 240 blood samples of 80 cases and its analysis, scheduled from June 30, 2022..

| 2612481 | ecrin-mdr-crc

HERV Expression Profile in NCCIT

Project description:Provide a comprehensive picture of HERV RNA expression through both short and long read sequencing in NCCIT cells to be used in an integrated proteogenomic analysis pipeline

2025-01-08 | PXD054413 | Pride

High resolution annotation of Zebrafish transcriptome using long-read sequencing

Project description:With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited by gaps in zebrafish annotation, which is especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the early ZGA-stage zebrafish transcriptome. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2748 high confidence novel transcripts that originated from previously unannotated loci and 1835 new isoforms in previously annotated genes.

2018-05-24 | GSE101843 | GEO

Detection of known and novel small proteins in Pseudomonas stutzeri using a combination of bottom-up and top-down proteomics and proteogenomics

Project description:Here we performed a comprehensive genomic and proteomics analysis of P. stutzeri in aerobic and oxygen-limiting conditions. We combined de novo genome assembly relying on 3rd generation long read sequencing technologies to report the first complete P. stutzeri ATCC14405 genome, which added over 110 kb of sequence and contains 126 full length CDS that were only partially covered in the fragmented short read-based genome assembly available for this strain. With this optimal basis for downstream functional genomics, we next carried out state of the art bottom-up and top-down proteomics analyses to report the most detailed study of proteome remodeling in response to oxygen limitation in P. stutzeri. We identified more than 2900 proteins, i.e. greater than 70% of the theoretical proteome, including 160 annotated small proteins. The proteins included well-established enzymes involved in denitrification and metabolic adaptation to oxygen-limiting conditions, as well as uncharacterized proteins. Notably, we identified 16 novel small proteins that had so far been missed in the genome annotation.

2023-08-04 | PXD037914 | Pride

A nanopore long-read transcriptome in pancreatic cancer cells

Project description:This project aims to leverage Oxford Nanopore Technologies (ONT) long-read RNA sequencing to achieve a comprehensive analysis of the human pancreatic cancer transcriptome. Traditional short-read sequencing methods often struggle with accurately reconstructing full-length transcripts and discerning complex splicing events due to their limited read lengths. In contrast, ONT's long-read sequencing can generate reads that span entire RNA molecules, facilitating precise identification of transcript isoforms, alternative splicing patterns, and poly(A) tail length. By applying this technology, we seek to enhance the annotation of the pancreatic cancer transcriptome, uncover novel transcripts, and gain deeper insights into gene expression dynamics. The findings from this study have the potential to advance our understanding of gene regulation and contribute to the development of novel therapeutic strategies.

2025-07-23 | GSE293661 | GEO

Chimpanzee, orangutan, and human genome assemblies

Project description:Sequence and assembly of great-ape genomes including annotation and comparative analyses using long- and short-read sequencing modalities.

| PRJNA369439 | ENA

Long-read transcriptomics of a diverse human cohort reveals widespread ancestry bias in gene annotation

Project description:Understanding gene expression diversity across human populations is essential for accurate genome annotation and disease interpretation. However, existing annotations are primarily based on European-derived transcriptomic data, potentially limiting their applicability to other populations. This study aims to assess population-specific transcript diversity and its impact on gene annotation. To achieve this, we performed long-read RNA sequencing on lymphoblastoid cell lines from 43 individuals across eight globally diverse populations. Our workflow included RNA extraction, cDNA synthesis, and sequencing using Oxford Nanopore long-read technology, followed by transcript assembly and comparison with existing gene annotations. We also integrated novel transcripts into reference annotations to evaluate their effect on allele-specific transcript usage detection. This work provides a critical step toward improving transcriptome annotation across diverse populations, ensuring a more comprehensive representation of human genetic variation.

2025-03-13 | E-MTAB-14935 | biostudies-arrayexpress

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data