Unknown

Dataset Information

0

Integrated analysis of whole-genome paired-end and mate-pair sequencing data for identifying genomic structural variations in multiple myeloma.


ABSTRACT: We present a pipeline to perform integrative analysis of mate-pair (MP) and paired-end (PE) genomic DNA sequencing data. Our pipeline detects structural variations (SVs) by taking aligned sequencing read pairs as input and classifying these reads into properly paired and discordantly paired categories based on their orientation and inferred insert sizes. Recurrent SV was identified from the discordant read pairs. Our pipeline takes into account genomic annotation and genome repetitive element information to increase detection specificity. Application of our pipeline to whole-genome MP and PE sequencing data from three multiple myeloma cell lines (KMS11, MM.1S, and RPMI8226) recovered known SVs, such as heterozygous TRAF3 deletion, as well as a novel experimentally validated SPI1 - ZNF287 inter-chromosomal rearrangement in the RPMI8226 cell line.

SUBMITTER: Yang R 

PROVIDER: S-EPMC4179644 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrated analysis of whole-genome paired-end and mate-pair sequencing data for identifying genomic structural variations in multiple myeloma.

Yang Rendong R   Chen Li L   Newman Scott S   Gandhi Khanjan K   Doho Gregory G   Moreno Carlos S CS   Vertino Paula M PM   Bernal-Mizarchi Leon L   Lonial Sagar S   Boise Lawrence H LH   Rossi Michael M   Kowalski Jeanne J   Qin Zhaohui S ZS  

Cancer informatics 20140921 Suppl 2


We present a pipeline to perform integrative analysis of mate-pair (MP) and paired-end (PE) genomic DNA sequencing data. Our pipeline detects structural variations (SVs) by taking aligned sequencing read pairs as input and classifying these reads into properly paired and discordantly paired categories based on their orientation and inferred insert sizes. Recurrent SV was identified from the discordant read pairs. Our pipeline takes into account genomic annotation and genome repetitive element in  ...[more]

Similar Datasets

| S-EPMC2905550 | biostudies-other
| S-EPMC6914798 | biostudies-literature
| PRJEB61637 | ENA
| PRJEB4453 | ENA
| S-EPMC3608957 | biostudies-literature
| S-EPMC5732791 | biostudies-literature
| S-EPMC6325071 | biostudies-literature
| PRJEB13570 | ENA
| S-EPMC5846771 | biostudies-literature
| S-EPMC3473372 | biostudies-literature