Project description:DamID is a powerful technique for identifying regions of the genome bound by a DNA-binding (or DNA-associated) protein. Currently no method exists for automatically processing next-generation sequencing DamID (DamID-seq) data, and the use of DamID-seq datasets with normalisation based on read-counts alone can lead to high background and the loss of bound signal. DamID-seq thus presents novel challenges in terms of normalisation and background minimisation. We describe here damidseq_pipeline, a software pipeline that performs automatic normalisation and background reduction on multiple DamID-seq FASTQ or BAM datasets. Single replicate profiling of pol II occupancy in 3rd instar larval neuroblasts of Drosophila
Project description:Analysis of gene expression regulation typically requires identification of genomic sites where regulatory proteins bind. For this purpose, ChIP and DamID methods applied to cell lines or model organisms are now routinely used, even in selected cell types. In this work, we present modifications to experimental DamID protocol, as well as a custom data processing algorithm that allows to confidently identify genomic sites enriched with the proteins of interest. This algorithm is implemented in Perl and is also available as executable files thereby making DamID analysis relatively straightforward. Finally, we demonstrate how this pipeline performs when fed with real experimental data.
Project description:DamID is a powerful technique for identifying regions of the genome bound by a DNA-binding (or DNA-associated) protein. Currently no method exists for automatically processing next-generation sequencing DamID (DamID-seq) data, and the use of DamID-seq datasets with normalisation based on read-counts alone can lead to high background and the loss of bound signal. DamID-seq thus presents novel challenges in terms of normalisation and background minimisation. We describe here damidseq_pipeline, a software pipeline that performs automatic normalisation and background reduction on multiple DamID-seq FASTQ or BAM datasets.
Project description:Paired-Tag is an ultra-high throughput single-cell method for simultaneous profiling of gene expression and histone modifications, enabling identification of cell-type-specific cis-regulatory elements and correlation of their chromatin states with the expression levels of putative target genes. However, the lack of an automated end-to-end pipeline has limited its application. Here, we present easyPairedTag, a Snakemake pipeline for Paired-Tag data processing and quality control. Key features include flexible configuration for diverse experimental setups, automated sub-library merging and sample demultiplexing, and comprehensive quality control metrics. When applied to published mouse brain datasets, easyPairedTag improves overlapping gene quantification via strand-specific analysis for precise cell clustering. In mouse hypothalamus datasets, easyPairedTag is compatible with the processing of paired-end sequencing data to detect histone modification peaks with higher sensitivity and specificity, facilitating the discovery of putitive cell-type-specific cis-regulatory elements.
Project description:This project’s aim was to compare the transcriptional profiles of olfactory sensory neurons in Drosophila melanogaster in order to identify novel genes that specify neuron-specific functions/phenotypes or may otherwise be involved in the development of the olfactory system. The isolation of sufficient numbers of intact olfactory sensory neurons (OSN) from the antenna of Drosophila melanogaster has so far limited single-cell transcriptomic approaches being applied to the adult fly antenna. Targeted DamID (TaDa) provides an alternative approach for profiling transcriptional activity in a cell-specific manor that bypasses the need for isolating OSN. Using the Gal4/UAS system, we applied TaDa to seven OSN populations and compared differences in Pol II occupancy for genes across these datasets.