Project description:The Yeonsan Ogye (Ogye) is the rare black chicken breed domesticated in Korean peninsula, which has been noted for entire black color upon its appearances including feather, skin, comb, eyes, shank, claws and internal organs. In this study, whole genome, transcriptome and epigenome sequencings of Ogye were performed using high-throughput NGS sequencing platforms. We have produced Illumina short-reads (Paired-End, Mate-Pair and FOSMID) and PacBio long-reads for whole genome sequencing (WGS), 1.4 billion reads for RNA-seq, and 123 million reads for RRBS (reduced representation bisulfite sequencing) data. Using WGS data, Ogye genome has been assembled, and coding/non-coding transcriptome maps were constructed on Ogye genome given largescale sequencing data. We have predicted 17,472 (3,550 newly annotated and 13,922 known) protein-coding transcripts, and 9,443 (6,689 novel and 2,754 known) long non-coding RNAs (lncRNAs).
Project description:The Yeonsan Ogye (Ogye) is the rare black chicken breed domesticated in Korean peninsula, which has been noted for entire black color upon its appearances including feather, skin, comb, eyes, shank, claws and internal organs. In this study, whole genome, transcriptome and epigenome sequencings of Ogye were performed using high-throughput NGS sequencing platforms. We have produced Illumina short-reads (Paired-End, Mate-Pair and FOSMID) and PacBio long-reads for whole genome sequencing (WGS), 1.4 billion reads for RNA-seq, and 123 million reads for RRBS (reduced representation bisulfite sequencing) data. Using WGS data, Ogye genome has been assembled, and coding/non-coding transcriptome maps were constructed on Ogye genome given largescale sequencing data. We have predicted 17,472 (3,550 newly annotated and 13,922 known) protein-coding transcripts, and 9,443 (6,689 novel and 2,754 known) long non-coding RNAs (lncRNAs).
Project description:Investigate long non-coding RNA (lncRNA) expression characteristics in the peripheral blood lymphocytes of Xinjiang Kazakh people with essential hypertension.
Project description:Whole genome sequencing (WGS) of tongue cancer samples and cell line was performed to identify the fusion gene translocation breakpoint. WGS raw data was aligned to human reference genome (GRCh38.p12) using BWA-MEM (v0.7.17). The BAM files generated were further analysed using SvABA (v1.1.3) tool to identify translocation breakpoints. The translocation breakpoints were annotated using custom scripts, using the reference GENCODE GTF (v30). The fusion breakpoints identified in the SvABA analysis were additionally confirmed using MANTA tool (v1.6.0).
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We performed shallow whole genome sequencing (WGS) on circulating free (cf)DNA extracted from plasma or cerebrospinal fluid (CSF), and shallow WGS on the tissue DNA extracted from the biopsy in order to evaluate the correlation between the two biomaterials. After library construction and sequencing (Hiseq3000 or Ion Proton), copy number variations were called with WisecondorX.