Project description:DNA mate pair and RNA sequencing data of conventional osteosarcomas. Mate pair libraries, with average insert sizes of 2-4 kb, were prepared for sequencing using the Nextera Mate Pair Library Preparation Kit. Paired-end 76 base pair reads were generated using an Illumina NextSeq 500 sequencing instrument. Total RNA was enriched for polyadenylated RNA using magnetic oligo(dT) beads. Enriched RNA was prepared for sequencing using the TruSeq RNA Sample Preparation Kit v2 and paired-end 151 base pair reads were generated from the cDNA libraries using an Illumina NextSeq 500 instrument.
Project description:Mate pair sequencing for the detection of chromosomal aberrations in patients with intellectual disability and congenital malformations
Project description:Copy number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have suggested replication stress, such as that caused by the polymerase inhibitor aphidicolin, as a causative factor in CNV formation, but existing data are technically limited in the quality of the comparisons which can be made to experimentally induced variants. Here we used 1M feature single-nucleotide polymorphism (SNP) arrays and mate-pair sequencing as high resolution methods for characterizing CNVs in a common set of samples, to compare both the properties of constitutional and induced CNVs as well as the utility of the two methods in an experimental setting. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions that allowed some CNVs to be associated with inversions, ectopic duplications or LINE insertions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs. One untreated and one aphidicolin-treated subclone of human fibroblast cell line HGMDFN090 were analyzed by Illumina HumanOmni1-Quad SNP array and low-density mate-pair sequencing.
Project description:Copy number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have suggested replication stress, such as that caused by the polymerase inhibitor aphidicolin, as a causative factor in CNV formation, but existing data are technically limited in the quality of the comparisons which can be made to experimentally induced variants. Here we used 1M feature single-nucleotide polymorphism (SNP) arrays and mate-pair sequencing as high resolution methods for characterizing CNVs in a common set of samples, to compare both the properties of constitutional and induced CNVs as well as the utility of the two methods in an experimental setting. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions that allowed some CNVs to be associated with inversions, ectopic duplications or LINE insertions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs.
Project description:SummaryIllumina's recently released Nextera Long Mate Pair (LMP) kit enables production of jumping libraries of up to 12 kb. The LMP libraries are an invaluable resource for carrying out complex assemblies and other downstream bioinformatics analyses such as the characterization of structural variants. However, LMP libraries are intrinsically noisy and to maximize their value, post-sequencing data analysis is required. Standardizing laboratory protocols and the selection of sequenced reads for downstream analysis are non-trivial tasks. NextClip is a tool for analyzing reads from LMP libraries, generating a comprehensive quality report and extracting good quality trimmed and deduplicated reads.Availability and implementationSource code, user guide and example data are available from https://github.com/richardmleggett/nextclip/.
Project description:We used a Drosophila melanogaster line (a "double balancer") carrying balancer chromosomes for both the second (CyO) and third (TM3) chromosomes. We crossed the double balancer to an isogenic wild-type "virginizer" line to obtain trans-heterozygous adults from the F1 generation. Whole-genome sequencing and mate pair sequencing were used to identify Single Nucleotide Variants (SNVs) and Structural Variants (SVs) on both chromosomes.