Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Reference genome assemblies for umbrella project PRJEB43248

ABSTRACT: Reference genome assemblies for umbrella project PRJEB43248

PROVIDER: PRJEB43239 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Maximum haplotig genome assemblies for umbrella project PRJEB43248

Project description:Maximum haplotig genome assemblies for umbrella project PRJEB43248

| PRJEB43238 | ENA

Umbrella Project name

Project description:Umbrella Project

| PRJEB44968 | ENA

Anopheles Reference Genomes Project

Project description:Anopheles Reference Genomes Project: Data and assemblies

| PRJEB51690 | ENA

Vet-LIRN-Pathogen Umbrella Project

Project description:Vet-LIRN-Pathogen Umbrella Project

| PRJNA314609 | ENA

Clostridium difficile Ribotype Reference Genome Project

Project description:Clostridium difficile Ribotype Reference Genome Project

| PRJEB2101 | ENA

Discordant genome assemblies drastically alter the interpretation of single cell RNA sequencing data which can be mitigated by a novel integration method

Project description:Advances in sequencing and assembly technology has led to the creation of genome assemblies for a wide variety of non-model organisms. The rapid production and proliferation of updated, novel assembly versions can create create vexing problems for researchers when multiple genome as-sembly versions are available at once, requiring researchers to work with more than one reference genome. Multiple genome assemblies are especially problematic for researchers studying the genetic makeup of individual cells as single cell RNA sequencing (scRNAseq) requires sequenced reads to be mapped and aligned to a single reference genome. Using the Astyanax mexicanus this study highlights how the interpretation of a single cell dataset from the same sample changes when aligned to its two different available genome assemblies. We found that the number of cells and expressed genes detected were drastically different when aligning to the different assemblies. When the genome assemblies were used in isolation with their respective annotation, cell type identification was confounded as some classic cell type markers were assembly-specific, whilst other genes showed differential patterns of expression between the two assemblies. To overcome the problems posed by multiple genome assemblies, we propose that researchers align to each available assembly and then integrate the resultant datasets to produce a final dataset in which all genome alignments can be used simultaneously. We found this approach increased the accuracy of cell type identification and maximised the amount of data that could be extracted from our single cell sample by capturing all possible cells and transcripts. As scRNAseq becomes more widely available, it is imperative that the single cell community is aware how genome assembly alignment can alter single cell data and its interpretation, especially when reviewing studies on non-model organisms.

2022-02-10 | GSE194093 | GEO

Reference genome for Global Pneumococcal Sequence (GPS ) project

Project description:Reference genome for Global Pneumococcal Sequence (GPS ) project

| PRJEB31141 | ENA

ESGLI Legionella data umbrella

Project description:ESGLI Legionella data collection umbrella project

| PRJEB14408 | ENA

Brassica napus Genome Assemblies

Project description:Brassica napus Genome Assemblies

| PRJEB79568 | ENA

The BLUEPRINT Murine Lymphocyte Epigenome Reference Resource. [Whole Genome Bisulfite-Seq_OX]

Project description:This data release is part of the EU BLUEPRINT reference epignome project (http://www.blueprint-epigenome.eu/) and consists of genome wide gene expression, histone modifications and DNA methylation profiles from murine Naive CD4+ T-cells and Resting B cells.

2017-12-15 | GSE94675 | GEO

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data