Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Metagenomics pipelines visualisation

ABSTRACT: Benchmarking of metagenomics pipelines

PROVIDER: PRJEB39393 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
	ERR4343571.fastq.gz	Fastqsanger.gz
	ERR4343857.fastq.gz	Fastqsanger.gz
	ERR4343858.fastq.gz	Fastqsanger.gz

Items per page:

1 - 3 of 3

Similar Datasets

Benchmarking of 4C-seq pipelines based on real and simulated data

Project description:Benchmarking of 4C-seq pipelines based on real and simulated data

| PRJNA507614 | ENA

soils surrounded buried oil pipelines

Project description:soils surrounded buried oil pipelines Raw sequence reads

| PRJNA414370 | ENA

Benchmarking of 4C-seq pipelines based on real and simulated data

Project description:With its capacity for high-resolution data output in one region of interest, chromosome conformation capture combined with high-throughput sequencing (4C-seq) is a state-of-the-art next-generation sequencing technique that provides epigenetic insights, and regularly advances current medical research. However, 4C-seq data is complex and prone to biases, and while specialized programs exist, an unbiased, extensive benchmarking is still lacking. Furthermore, neither substantial datasets with fully characterized ground truth, nor simulation programs for realistic 4C-seq data have been published. We conducted a benchmarking study on 54 4C-seq samples from 12 datasets, including original murine BMM, T-cell, and 416B data, and developed a novel 4C-seq simulation software to allow for more detailed comparisons of 4C-seq algorithms on 50 simulated datasets with 10 to 120 samples each.

2019-05-29 | GSE123131 | GEO

Systematic comparison of RNA-seq pipelines for absolute and relative gene expression quantification

Project description:At present, it is admitted that RNA-seq is a more powerful and adaptable technique than hybridization arrays. Nevertheless, as RNA-seq needs a more complex data analysis, it has generated a lot of research on algorithms and workflows. This has resulted in an exponential increase of the options at each step of the analysis. Consequently, there is no clear consensus on the appropriate algorithms and pipelines that should be used to analyse RNA-seq data. In the present study, 192 pipelines on 18 samples from 2 human cell lines were evaluated. Absolute gene expression quantification was assessed by non-parametric statistics to measure precision and accuracy. Relative gene expression performance was estimated testing 19 differential expression methods. These results were contrasted in parallel with the microarray HTA 2.0 data from Affymetrix using the same set of samples. All procedures were validated by qRT-PCR on 32 genes in all samples. In addition, this study proposes a new statistical approach for precision and accuracy evaluation on real RNA-seq data. It also weights up the advantages and disadvantages of the algorithms and pipelines tested and gives a guide to select the appropriate pipeline to analyse RNA-seq and microarray data.

2021-02-28 | GSE116291 | GEO

A Metagenomics Analysis of Pandemic 2009 H1N1 Infection in Patients in North America

Project description:The Virochip microarray (version 4.0) was used to detect viruses in patients from North America with unexplained influenza-like illness at the onset of the 2009 H1N1 pandemic. We used metagenomics-based technologies (the Virochip microarray) and deep sequencing to analyze nasal swab samples from individuals with 2009 H1N1 infection. This Series includes the Virochip microarray data only.

2010-11-01 | E-GEOD-24034 | biostudies-arrayexpress

Metagenomics

Project description:Metagenomics

| PRJEB105600 | ENA

Assessment of Common and Emerging Bioinformatics Pipelines for Targeted Metagenomics.

Project description:Targeted metagenomics, also known as metagenetics, is a high-throughput sequencing application focusing on a nucleotide target in a microbiome to describe its taxonomic content. A wide range of bioinformatics pipelines are available to analyze sequencing outputs, and the choice of an appropriate tool is crucial and not trivial. No standard evaluation method exists for estimating the accuracy of a pipeline for targeted metagenomics analyses. This article proposes an evaluation protocol containing real and simulated targeted metagenomics datasets, and adequate metrics allowing us to study the impact of different variables on the biological interpretation of results. This protocol was used to compare six different bioinformatics pipelines in the basic user context: Three common ones (mothur, QIIME and BMP) based on a clustering-first approach and three emerging ones (Kraken, CLARK and One Codex) using an assignment-first approach. This study surprisingly reveals that the effect of sequencing errors has a bigger impact on the results that choosing different amplified regions. Moreover, increasing sequencing throughput increases richness overestimation, even more so for microbiota of high complexity. Finally, the choice of the reference database has a bigger impact on richness estimation for clustering-first pipelines, and on correct taxa identification for assignment-first pipelines. Using emerging assignment-first pipelines is a valid approach for targeted metagenomics analyses, with a quality of results comparable to popular clustering-first pipelines, even with an error-prone sequencing technology like Ion Torrent. However, those pipelines are highly sensitive to the quality of databases and their annotations, which makes clustering-first pipelines still the only reliable approach for studying microbiomes that are not well described.

| S-EPMC5215245 | biostudies-literature

Benchmarking second and third-generation sequencing platforms for microbial metagenomics

Project description:Benchmarking second and third-generation sequencing platforms for microbial metagenomics

| PRJEB52977 | ENA

Metagenomics

Project description:Soil metagenomics

| PRJEB15193 | ENA

Mock community taxonomic classification performance of publicly available shotgun metagenomics pipelines.

Project description:Shotgun metagenomic sequencing comprehensively samples the DNA of a microbial sample. Choosing the best bioinformatics processing package can be daunting due to the wide variety of tools available. Here, we assessed publicly available shotgun metagenomics processing packages/pipelines including bioBakery, Just a Microbiology System (JAMS), Whole metaGenome Sequence Assembly V2 (WGSA2), and Woltka using 19 publicly available mock community samples and a set of five constructed pathogenic gut microbiome samples. Also included is a workflow for labelling bacterial scientific names with NCBI taxonomy identifiers for better resolution in assessing results. The Aitchison distance, a sensitivity metric, and total False Positive Relative Abundance were used for accuracy assessments for all pipelines and mock samples. Overall, bioBakery4 performed the best with most of the accuracy metrics, while JAMS and WGSA2, had the highest sensitivities. Furthermore, bioBakery is commonly used and only requires a basic knowledge of command line usage. This work provides an unbiased assessment of shotgun metagenomics packages and presents results assessing the performance of the packages using mock community sequence data.

| S-EPMC10794705 | biostudies-literature

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data