Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

RefSeq

ABSTRACT: NCBI Reference Sequence Database

PROVIDER: PRJEB6684 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Similar Datasets

RefSeq Targeted Loci in fungi

Project description:Collection of targeted loci created at NCBI relevant to fungi classification

| PRJNA41209 | ENA

5-hydroxytryptamine synthesized in the Aorta-Gonad-Mesonephros regulates hematopoietic stem and progenitor cell survival

Project description:Whole-genome gene expression analysis was performed using the AGMs from two littermate pairs of tph2+/+, tph2+/- or tph2-/- at E11.5. The total RNA was extracted using TRNzol (Tiangen, Beijing) and cDNA samples were sequenced using Illumina HiSeq3000 (RiboBio Co. Ltd, Guangzhou). The reference Mus musculus genome and gene information were downloaded from the National Center for Biotechnology Information Database (NCBI: http://www.ncbi.nlm.nih.gov/).

2016-12-19 | GSE90130 | GEO

Temperature-dependent Proteome Study of Thermoanaerobacter tengcongensis

Project description:Comprehensive and quantitative information of the thermophile proteome is an important source for understanding of the survival mechanism under high growth temperature. Thermoanaerobacter tengcongensis (T. tengcongensis), a typical anaerobic thermophilic eubacterium, was selected to quantitatively evaluate its protein abundance changes in response to four different temperatures. Thermoanaerobacter tengcongensis proteins were trypsine digested, separated with high-pH RP, and identified with MS/MS analysis. The raw MS/MS data were converted into MGF format by Proteome Discoverer 1.2 (Thermo Fisher Scientific, Waltham, MA, USA). And the exported MGF files were searched by Mascot 2.3.02 (Matrix Science, Boston, MA, USA) against the database with all 2588 predicted proteins in T. tengcongensis downloaded from NCBI (NCBI reference sequence: NC_003869.1). An automatic decoy database search was performed. Several parameters in Mascot were set for peptide searching, tolerance of one missed cleavage of trypsin, Carbamidomethyl (C),iTRAQ8plex (K) and iTRAQ8plex (N-term) as fixed modification, iTRAQ8plex (Y),Oxidation (M) as variable modification. The precursor mass tolerance was 10 ppm, and the product ion tolerance was 0.02 Da.

2013-08-13 | PXD000264 | Pride

Small RNA sequencing in mouse cardiomyocytes after siRNA mediated knockdown of Srf

Project description:We performed two independent siRNA mediated knockdowns of Srf (Srf si1 & Srf si2) and an unspecific siRNA (siNon) in mouse cardiomyocytes HL-1 cells. Small RNAs were sequenced by Illumina/Solexa next-generation (single-end) sequencing technology. The sequence reads were mapped to the mouse reference genome (NCBI v37, mm9) using MicroRazerS. MicroRazerS searches for the longest possible prefix-match of each read, i.e. the longest possible contiguous match starting at the first base. Hence, it is robust to possible adapter sequence at the 3' end of a read and requires no adapter trimming.

2011-01-03 | GSE19066 | GEO

Transcriptome analysis and molecular marker discovery in red pepper, Capsicum annuum L. TF68

Project description:Transcriptome from high throughput sequencing-by-synthesis is a good resource of molecular markers. In this study, we present utility of massively parallel sequencing by synthesis for profiling the transcriptome of red pepper (Capsicum annuum L. TF68) by 454 GS-FLX pyrosequencing. Through the generation of approximately 30.63 megabases (Mb) of Expressed Sequence Tags (ESTs) data with the average length of 375 base pairs (bp), 9,818 contigs and 23,712 singletons were obtained by assembly. Using BLAST alignment against NCBI non-redundant and a UniProt protein database, 30% of the tentative consensus sequences were assigned to specific function annotation, while 24% returned alignments of unknown function, leaving up to 46% with no alignment. Functional classification using FunCat revealed that sequences with putative known function were distributed cross 18 categories. Furthermore, over 200 high quality single nucleotide discrepancies were discovered using the Bukang cDNA collection as a reference database. Moreover, 758 simple sequence repeat (SSR) motif loci were mined from over 600 contigs, from which 572 primer sets were designed. The SSR motifs corresponded to di- and tri- nucleotide motifs (27.03 and 61.92%, respectively). These molecular markers may be of great value for application in linkage mapping and association mapping research.

2011-09-08 | GSE29215 | GEO

Sparus aurata transcriptomic data comparing long-term starvation and feeding with three diets differing in the macronutrient composition

Project description:Agilent custom-made 8x60k 60-mer oligonucleotide microarray slides were used, covering unique sequences (21,093 annotated sequences and 4,299 unannotated sequences) of Sparus aurata transcriptomic data obtained from assembling of reads deposited in the NCBI SRA database corresponding to BioProject ID PRJNA391557, together with controls (8 transcripts of Sparus aurata plus Agilent controls). Two probes per sequence were used (in sense orientation for annotated sequences).

2017-10-31 | GSE100744 | GEO

CpG island mediated linear and spatial gene partitioning (ChIP-seq)

Project description:In order to elucidate the general rules for gene localization and regulation mediated by CpG islands, we reanalyzed published ChIP-seq data of CXXC domain, H3K9me3, KDM2A, SUV39H1, ATF4, MYBL1, MYOD1, SPI1, and CTCF. Raw data were downloaded from Sequence Read Archive (SRA) in National Center for Biotechnology Information (NCBI) database. FASTQ files were extracted with the SRA Toolkit version 2.5.5 and aligned using Bowtie 2.2.5 onto the mouse and human genome (mm9 and hg19, respectively). For the identification of factor binding sites, model-based analysis for ChIP-seq peak caller (MACS 1.4.2) was used with a p-value cutoff of 1e-5.

2018-02-15 | GSE80791 | GEO

EcoGene-RefSeq: EcoGene tools applied to the RefSeq prokaryotic genomes.

Project description:SummaryEcoGene.org is a genome database and website dedicated to Escherichia coli K-12 substrain MG1655 that is revised daily using information derived from the biomedical literature and in-house analysis. EcoGene is a major source of annotation updates for the MG1655 Genbank record, one of only a few Genbank genome records that are updated by a community effort. The Reference Sequence (RefSeq) database, built by The National Center for Biotechnology Information, comprises a set of duplicate Genbank genome records that can be modified by the NCBI staff annotators. EcoGene-RefSeq is being developed as a stand-alone internet resource to facilitate the usage of EcoGene-based tools on any of the >2400 completed prokaryotic genome records that are currently available at the RefSeq database.AvailabilityThe web interface of EcoGene-RefSeq is available at http://www.ecogene.org/refseq.Contactkrudd@med.miami.edu or j.zhou1@miami.edu.

| S-EPMC3712216 | biostudies-literature

Transcriptome analysis and molecular marker discovery in red pepper, Capsicum annuum L. TF68

2011-09-08 | E-GEOD-29215 | biostudies-arrayexpress

Homo sapiens

Project description:TK6 toxicogenomics reference database

| PRJNA252597 | ENA