Project description:Purpose: The goal of this study is to compare endothelial small RNA transcriptome to identify the target of OASL under basal or stimulated conditions by utilizing miRNA-seq. Methods: Endothelial miRNA profilies of siCTL or siOASL transfected HUVECs were generated by illumina sequencing method, in duplicate. After sequencing, the raw sequence reads are filtered based on quality. The adapter sequences are also trimmed off the raw sequence reads. rRNA removed reads are sequentially aligned to reference genome (GRCh38) and miRNA prediction is performed by miRDeep2. Results: We identified known miRNA in species (miRDeep2) in the HUVECs transfected with siCTL or siOASL. The expression profile of mature miRNA is used to analyze differentially expressed miRNA(DE miRNA). Conclusions: Our study represents the first analysis of endothelial miRNA profiles affected by OASL knockdown with biologic replicates.
Project description:We report the application of ATACseq to differentiating ATDC5 cells with the aim of identifying how chromatin architecture is changing as these cells move from pre-chondrocytes to differentiated chondrocytes. Raw data was generated by sequencing using the Novaseq6000 with 50bp single end reads. Trimming to remove adapters or poor-quality reads was performed with no sequence below 30 nucleotides in length used in the analysis. We found that there were significant changes in chromatin accessibility at both days 3 and 10 of differentiation. Using diffBind, differential peaks were identified at day 3 and 10. These peaks were further analyzed for enriched transcription factor binding sites and identified motifs for the RUNX family transcription factors as well as SOX9.
Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:A cDNA library was constructed by Novogene (CA, USA) using a Small RNA Sample Pre Kit, and Illumina sequencing was conducted according to company workflow, using 20 million reads. Raw data were filtered for quality as determined by reads with a quality score > 5, reads containing N < 10%, no 5' primer contaminants, and reads with a 3' primer and insert tag. The 3' primer sequence was trimmed and reads with a poly A/T/G/C were removed
Project description:Whole exome sequencing of 5 HCLc tumor-germline pairs. Genomic DNA from HCLc tumor cells and T-cells for germline was used. Whole exome enrichment was performed with either Agilent SureSelect (50Mb, samples S3G/T, S5G/T, S9G/T) or Roche Nimblegen (44.1Mb, samples S4G/T and S6G/T). The resulting exome libraries were sequenced on the Illumina HiSeq platform with paired-end 100bp reads to an average depth of 120-134x. Bam files were generated using NovoalignMPI (v3.0) to align the raw fastq files to the reference genome sequence (hg19) and picard tools (v1.34) to flag duplicate reads (optical or pcr), unmapped reads, reads mapping to more than one location, and reads failing vendor QC.
Project description:Species identification of fragmentary bones remains a challenging task in archeology and forensics. A species identification method for such fragmentary bones that has recently attracted interest is the use of bone collagen proteins. We developed a method similar to DNA barcoding that reads collagen protein sequences in bone and automatically determines the species by performing sequence database searches. We tested our method using bone samples from 30 vertebrate species ranging from mammals to fish.
Project description:We use nucleosome maps obtained by high-throughput sequencing to study sequence specificity of intrinsic histone-DNA interactions. In contrast with previous approaches, we employ an analogy between a classical one-dimensional fluid of finite-size particles in an arbitrary external potential and arrays of DNA-bound histone octamers. We derive an analytical solution to infer free energies of nucleosome formation directly from nucleosome occupancies measured in high-throughput experiments. The sequence-specific part of free energies is then captured by fitting them to a sum of energies assigned to individual nucleotide motifs. We have developed hierarchical models of increasing complexity and spatial resolution, establishing that nucleosome occupancies can be explained by systematic differences in mono- and dinucleotide content between nucleosomal and linker DNA sequences, with periodic dinucleotide distributions and longer sequence motifs playing a secondary role. Furthermore, similar sequence signatures are exhibited by control experiments in which genomic DNA is either sonicated or digested with micrococcal nuclease in the absence of nucleosomes, making it possible that current predictions based on highthroughput nucleosome positioning maps are biased by experimental artifacts. Included are raw (eland) and mapped (wig) reads. The mapped reads are provided in eland and wiggle formats, and the raw reads are included in the eland file. This series includes only Mnase control data. The sonicated control is part of this already published accession, as is a in vitro nucleosome map: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15188 We also studied data (in vitro and in vivo maps as well as a model) from http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13622 and from: http://www.ncbi.nlm.nih.gov/sra/?term=SRA001023