Project description:We performed proteomics on naturally infested green ash (F. pennsylvanica) trees at low and high levels of emerald ash borer (A. planipennis) infestation.
Our integrative analysis of the RNA-Seq and proteomics data identified 14 proteins and 4 transcripts that contribute most to the difference between highly infested and low infested trees.
Project description:Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. DNA copy number profiles generated with a new tool, ENCODER, were compared to DNA copy number profiles from SNP6, NimbleGen and low-coverage Whole Genome Sequencing.
Project description:Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. DNA copy number profiles generated with a new tool, ENCODER, were compared to DNA copy number profiles from SNP6, NimbleGen and low-coverage Whole Genome Sequencing.
Project description:MIPP-Seq provides an ultra-sensitive, low-cost approach for detecting and validating known and novel mutations in a highly scalable system with broad utility spanning both research and clinical diagnostic testing applications. The scalability of MIPP-Seq allows for multiplexing mutations and samples, which dramatically reduce costs of variant validation when compared to methods like ddPCR. By leveraging the power of individual analyses of multiple unique and independent reactions, MIPP-Seq can validate and precisely quantitate extremely low AAFs across multiple tissues and mutational categories including both indels and SNVs. Furthermore, using Illumina sequencing technology, MIPP-seq provides a robust method for accurate detection of novel mutations at an extremely low AAF.
Project description:<p>We used massively parallel sequencing technology to profile the genomic DNA and RNA of tumor cells (leukemic bone marrow) and normal cells (skin biopsy) obtained from a single patient with Acute Lymphoblastic Leukemia (ALL), referred to throughout this study as 'ALL1'. Included in this study are samples obtained from a primary tumor, first relapse, second relapse and several intermediate timepoints. We identified somatic mutations present in each tumor by analysis of whole genome (WGS) and exome sequence data. Single nucleotide variants (SNVs) and small insertions and deletions were identified in both the exome and WGS data. Large copy number variations (CNVs) and structural variants (SVs) were identified in the WGS data. A custom capture reagent was designed to target most variants and used to generate deep validation sequence data. The expression status of all somatic variants was assessed by RNA-seq. The RNA-seq data was also used for gene expression analysis and gene fusion detection.</p>
| phs001066 | dbGaP
Project description:Whole-genome variant detection in long-read sequencing data from ultra-low input tumor samples
Project description:Although most disease associations detected by GWAS are nongenic, very few have been mapped to causal regulatory variants. Here, we present a method for detecting regulatory QTLs that does not require genotyping or whole-genome sequencing. The method combines deep, long-read ChIP-seq with a new statistical test that simultaneously scores peak height correlation and allelic imbalance: the Genotype-independent Signal Correlation and Imbalance (G-SCI) test. We performed histone acetylation ChIP-seq on 57 human lymphoblastoid cell lines and used the resulting reads to call 500,066 SNPs de novo within regulatory elements. The G-SCI test annotated 8,764 of these as histone acetylation QTLs (haQTLs) - an order of magnitude larger than the set of candidates detected by expression QTL analysis. Lymphoblastoid haQTLs were highly predictive of autoimmune disease mechanisms. Thus, our method facilitates large-scale regulatory variant detection in any moderately-sized cohort for which functional profiling data can be generated, thus simplifying identification of causal variants within GWAS loci. We applied our method, named Regulatory Variant Ascertainment and chromatin Regression by sequencing (RegVAR-seq), to 57 cell lines from a single population group. We used the resulting sequence data for variant calling, and validated calls using an independent platform. We then identified histone acetylation QTLs (haQTLs) using a novel statistical test that requires no prior genotype information and combines peak height and allelic imbalance data across the 57 individuals. Transcription factor binding site analysis was used to independently support the functionality of haQTLs. Finally, we examined the association between haQTLs and SNPs associated with human phenotypes.
Project description:The first GSSM of V. vinifera was reconstructed (MODEL2408120001). Tissue-specific models for stem, leaf, and berry of the Cabernet Sauvignon cultivar were generated from the original model, through the integration of RNA-Seq data. These models have been merged into diel multi-tissue models to study the interactions between tissues at light and dark phases.
Project description:Here, we report an enrichment-based ultra-low input cfDNA methylation profiling method using methyl-CpG binding proteins capture, termed cfMBD-seq. We optimized the conditions of cfMBD capture by adjusting the amount of MethylCap protein along with using methylated filler DNA. Our data showed that cfMBD-seq performs equally to the standard MBD-seq (>1000 ng input) even when using 1 ng DNA as the input. cfMBD-seq demonstrated equivalent sequencing data quality as well as similar methylation profile when compared to cfMeDIP-seq. We showed that cfMBD-seq outperforms cfMeDIP-seq in the enrichment of CpG islands. This new bisulfite-free ultra-low input methylation profiling technology has a great potential in non-invasive and cost-effective cancer detection and classification.