Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

ABSTRACT: Haplotype-aware error correction of Simplex nanopore reads

PROVIDER: PRJNA1112538 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
	SRR29061597_1.fastq.gz	Fastqsanger.gz

Items per page:

1 - 1 of 1

Similar Datasets

ZymoBIOMICS Microbial Community DNA Standard (D6305, lot no. ZRC190633)

Project description:Nanopore consensus error correction of ribosomal RNA operons

| PRJNA540296 | ENA

High throughput error correction using dual nucleotide dimer blocks allows direct single-cell nanopore transcriptome sequencing

Project description:Droplet-based single-cell sequencing techniques have provided unprecedented insight into cellular heterogeneities within tissues. However, these approaches only allow for the measurement of the distal parts of a transcript following short-read sequencing. Therefore, splicing and sequence diversity information is lost for the majority of the transcript. The application of long-read Nanopore sequencing to droplet-based methods is challenging because of the low base-calling accuracy currently associated with Nanopore sequencing. Although several approaches that use additional short-read sequencing to error-correct the barcode and UMI sequences have been developed, these techniques are limited by the requirement to sequence a library using both short- and long-read sequencing. Here we introduce a novel approach termed single-cell Barcode UMI Correction sequencing (scBUC-seq) to efficiently error-correct barcode and UMI oligonucleotide sequences synthesized by using blocks of dimeric nucleotides. The method can be applied to correct both short-read and long-read sequencing, thereby allowing users to recover more reads per cell that permits direct single-cell Nanopore sequencing for the first time. We illustrate our method by using species-mixing experiments to evaluate barcode assignment accuracy and multiple myeloma cell lines to evaluate differential isoform usage and Ewing’s sarcoma cells to demonstrate Ig fusion transcript analysis.

2021-01-31 | GSE162053 | GEO

Project description:Error correction tool benchmark

| PRJNA504496 | ENA

Project description:Haplotype-aware detection of SERPINA1 variants by nanopore sequencing

| PRJNA1129428 | ENA

Multi-cell type deconvolution using a probabilistic model of single-molecule DNA methylation haplotypes

Project description:Deconvolution methods infer quantitative cell type estimates from bulk measurement of mixed samples including blood and tissue. DNA methylation sequencing measures multiple CpGs per read, but few existing deconvolution methods leverage this within-read information. We develop CelFiE-ISH, which extends an existing method (CelFiE) to use within-read haplotype information. CelFiE-ISH outperforms CelFiE and other existing methods, achieving 30% better accuracy and more sensitive detection of rare cell types. We also demonstrate the importance of marker selection and tailoring markers for haplotype-aware methods. While here we use gold-standard short-read sequencing data, haplotype-aware methods will be well-suited for long-read sequencing.

2023-08-02 | GSE239605 | GEO

ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data

Project description:Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.

2022-10-14 | GSE192955 | GEO

Proteomic profiling and characterization of differential allergens in the nematodes Anisakis simplex sensu stricto and A. pegreffii

Project description:The parasite species complex Anisakis simplex sensu lato (Anisakis simplex sensu stricto; (A. simplex s.s.), A. pegreffii, A. simplex C) is the main cause of severe anisakiasis (allergy) worldwide and is now an important health matter. In this study, the relationship of this Anisakis species complex and their allergenic capacities is assessed by studying the differences between the two most frequent species (A. simplex s.s., A. pegreffii) and their hybrid haplotype by studying active L3 larvae parasiting Merluccius merluccius.

2014-07-11 | PXD000662 | Pride

synthetic metagenome

Project description:NoDe: a fast error-correction algorithm for pyrosequencing reads

| PRJNA257992 | ENA

Error-correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis

Project description:Error-correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis

| PRJEB34849 | ENA

LoRTIS: A long read method for analysis of large transposon mutant libraries

Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.

2022-05-30 | E-MTAB-11351 | biostudies-arrayexpress

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data