Project description:This submission is a dataset of single-nucleus multi-omics of uninjured and injured spinal cords of mice harvested and profiled using 10x Multiome ATAC + Gene Expression kit.
Project description:R-loops represent an abundant class of large non-B DNA structures in genomes. Even though they form transiently and at modest frequencies, interfering with R-loop formation or dissolution significantly impacts genome stability. Addressing the mechanism(s) of R-loop-mediated genome destabilization requires a precise characterization of their distribution in genomes. A number of independent methods have been developed to visualize and map R-loops, but their results are at times discordant, leading to confusion. Here we review the main existing methodologies underlying R-loop mapping and assess their limitations and the robustness of existing datasets. We offer a set of best practices to improve the reproducibility of maps, hoping that such guidelines could be useful for authors and referees alike. Finally, we offer a possible resolution to the apparent contradictions in R-loop mapping outcomes between antibody-based and RNase H1-based mapping approaches.
Project description:Histone proteomics from 5 human cell lines, discussed in publication: "A practical guide for analysis of histone post-translational modifications by mass spectrometry: best practices and pitfalls". Further explanation of data types, methods, and analysis are available in Data Explanation document in the Methods and Protocols folder.
Project description:Enhanced cross-linking immunoprecipitation (eCLIP) featuring a size-matched input control has been recently applied to profile the binding sites of more than one hundred RNA binding proteins (RBPs). However computational pipelines and quality control metrics needed to process CLIP data at scale have yet to be well defined. Here, we describe our ENCODE eCLIP processing pipeline (https://github.com/YeoLab/eclip), enabling users to go from raw reads to processed peaks that are enriched above paired input, reproducible across biological replicates, and can be directly compared against the public ENCODE eCLIP resource. In particular, we discuss processing steps designed to address common artifacts, including properly quantifying unique RNA fragments bound by both unique genomic- and repetitive element-mapped reads. Using manual quality annotation of 350 ENCODE eCLIP experiments, we develop metrics for quality assessment of eCLIP experiments prior to and after sequencing, including library yield, number of unique fragments in the library, total binding relative information, and biological reproducibility. In particular, we quantify the commonly believed linkage between depth of sequencing and peak discovery, and derive methods for estimating required sequencing depth based on pre-sequencing metrics. Finally we provide recommendations for the common question of integrating RBP binding information with RNA-seq to generate splicing maps representing the positional effect of binding on alternative splicing. These pipelines and QC metrics enable large-scale processing and analysis of eCLIP data, and will help to standardize rigorous analysis of RBP binding data.
Project description:Enhanced cross-linking immunoprecipitation (eCLIP) featuring a size-matched input control has been recently applied to profile the binding sites of more than one hundred RNA binding proteins (RBPs). However computational pipelines and quality control metrics needed to process CLIP data at scale have yet to be well defined. Here, we describe our ENCODE eCLIP processing pipeline (https://github.com/YeoLab/eclip), enabling users to go from raw reads to processed peaks that are enriched above paired input, reproducible across biological replicates, and can be directly compared against the public ENCODE eCLIP resource. In particular, we discuss processing steps designed to address common artifacts, including properly quantifying unique RNA fragments bound by both unique genomic- and repetitive element-mapped reads. Using manual quality annotation of 350 ENCODE eCLIP experiments, we develop metrics for quality assessment of eCLIP experiments prior to and after sequencing, including library yield, number of unique fragments in the library, total binding relative information, and biological reproducibility. In particular, we quantify the commonly believed linkage between depth of sequencing and peak discovery, and derive methods for estimating required sequencing depth based on pre-sequencing metrics. Finally we provide recommendations for the common question of integrating RBP binding information with RNA-seq to generate splicing maps representing the positional effect of binding on alternative splicing. These pipelines and QC metrics enable large-scale processing and analysis of eCLIP data, and will help to standardize rigorous analysis of RBP binding data.