Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

ABSTRACT: FDA-ARGOS is a database with public quality-controlled reference genomes for diagnostic use and regulatory science

PROVIDER: PRJNA231221 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other
		Other
	SRR10017400_1.fastq.gz	Fastqsanger.gz
	SRR10017400_2.fastq.gz	Fastqsanger.gz
	SRR10027400_1.fastq.gz	Fastqsanger.gz

Items per page:

1 - 5 of 5499

Similar Datasets

Project description:FDA ARGOS Manuscript Use Case Reference Data Sets

| PRJNA495928 | ENA

Bee gut bacteria

Project description:Reference database genomes of bee gut bacteria

| PRJNA471810 | ENA

Project description:Mycobacterium tuberculosis reference-quality clinical genomes

| PRJNA555636 | ENA

CCPRD: A novel analytical framework for comprehensive proteomic reference database construction of non-model organisms

Project description:Protein reference databases are a critical part of producing efficient proteomic analyses. However, the method for constructing clean, efficient, and comprehensive protein reference databases is lacking. Existing methods either do not have contamination control procedures, or these methods rely on a three-frame and/or six-frame translation that sharply increases the search space and harms MS results. Herein we propose a framework for constructing a customized comprehensive proteomic reference database (CCPRD) from draft genomes and deep sequencing transcriptomes. Its effectiveness is demonstrated by incorporating the proteomes of nematocysts from endoparasitic cnidarian: myxozoans. By applying customized contamination removal procedures, contaminations in omic data were successfully identified and removed. This is an effective method that does not result in over-decontamination. This can be shown by comparing the CCPRD MS results with an artificially-contaminated database and another database with removed contaminations in genomes and transcriptomes added back. CCPRD outperformed traditional frame-based methods by identifying 35.2%-50.7% more peptides and 35.8%-43.8% more proteins, with a maximum 84.6% in size reduction. A BUSCO analysis showed that the CCPRD maintained a relatively high level of completeness compared to traditional methods. These results confirm the superiority of the CCPRD over existing methods in peptide and protein identification numbers, database size, and completeness. By providing a general framework for generating the reference database, the CCPRD, which does not need a high-quality genome, can potentially be applied to any organisms and significantly contribute to proteomic research.

2020-07-09 | PXD018851 | Pride

Project description:High quality de novo assembled reference genomes

| PRJNA1250540 | ENA

ExpressionData - A public resource of high quality datasets representing gene expression across tissues, conditions, diseases and genotypes.

Project description:Reference datasets are often used to compare, interpret or validate experimental data and analytical methods. In the field of gene expression, a dozen reference datasets have been published. Typically, they consist of individual baseline or spike-in experiments carried out in a single laboratory and representing a particular set of conditions. For most organisms, however, few or no such reference datasets are publicly available. Here, we describe a new type of datasets highly representative for the spatial, temporal and response dimensions of gene expression. They result from integrating expression data from a large number of globally normalized and quality controlled public experiments and aggregating results by anatomical parts, stages of development, perturbations, drugs, diseases, neoplasms, and genotypes. The proposed datasets were created for human and several model organisms and are publicly available at www.expressiondata.org.

2013-05-15 | GSE44938 | GEO

Cultivated Genome References for protein database construction and high-resolution taxonomic annotation in metaproteomics

Project description:We constructed a protein database (DBCGR2) for gut microbiome metaproteomics, which was based on a database of cultivated genomes (Cultivated Genome Reference 2 - CGR2).

2024-12-06 | PXD053977 | Pride

Analysis of differential gene expression and alternative splicing is significantly influenced by choice of mapping genome

Project description:Purpose: To demonstrate that gene expression and splicing analysis varies considerably depending on the mapping reference genome. Methods: We mapped and analyzed submitted RNA reads using different tools and reference genomes to evaluate the influence of genome on DEG and alternative splicing tools. Results: We observed that these differences in transcriptome analysis are, in part, due to the presence of single nucleotide polymorphisms between the sequenced individual and each respective reference genome, as well as annotation differences between the reference genomes that exist even between syntenic orthologs. Conclusion: We conclude that even between two closely related genomes of similar quality, using the reference genome that is most closely related to the species being sampled significantly improves transcriptome.

2019-02-16 | GSE126650 | GEO

Project description:MOT database genomes - II

| PRJNA643572 | ENA

Comparative analysis of human protein-coding and noncoding RNAs between brain and various cell lines by RNA-Seq

Project description:These two transcriptome sequencing datasets were generated from two reference RNA samples established by the US FDA-led MicroArray Quality Control project with Illumina next-generation sequencing technology. The reference RNA sample A (UHRR, Catalog #740000) consists of total RNA extracted from 10 human cell lines of various origins: Blymphocyte, brain, breast, cervix, liposarcoma, liver, macrophage, skin, testis and Tlymphocyte. Equal quantities of DNAase-treated total RNA from each cell line were pooled to generate the UHRR. The reference RNA sample B (HBRR, Catalog #6050) consists of total RNA extracted from several regions of the brains from 23 adult donors.

2011-12-12 | GSE30250 | GEO

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data