Dataset Information

Discovery of transgene insertion sites by high throughput sequencing of mate pair libraries.

ABSTRACT: Transgenesis by random integration of a transgene into the genome of a zygote has become a reliable and powerful method for the creation of new mouse strains that express exogenous genes, including human disease genes, tissue specific reporter genes or genes that allow for tissue specific recombination. Nearly 6,500 transgenic alleles have been created by random integration in embryos over the last 30 years, but for the vast majority of these strains, the transgene insertion sites remain uncharacterized.To obtain a complete understanding of how insertion sites might contribute to phenotypic outcomes, to more cost effectively manage transgenic strains, and to fully understand mechanisms of instability in transgene expression, we've developed methodology and a scoring scheme for transgene insertion site discovery using high throughput sequencing data.Similar to other molecular approaches to transgene insertion site discovery, high-throughput sequencing of standard paired-end libraries is hindered by low signal to noise ratios. This problem is exacerbated when the transgene consists of sequences that are also present in the host genome. We've found that high throughput sequencing data from mate-pair libraries are more informative when compared to data from standard paired end libraries. We also show examples of the genomic regions that harbor transgenes, which have in common a preponderance of repetitive sequences.

SUBMITTER: Srivastava A

PROVIDER: S-EPMC4035081 | biostudies-other | 2014 May

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Discovery of transgene insertion sites by high throughput sequencing of mate pair libraries.

Srivastava Anuj A Philip Vivek M VM Greenstein Ian I Rowe Lucy B LB Barter Mary M Lutz Cathleen C Reinholdt Laura G LG

BMC genomics 20140514

<h4>Background</h4>Transgenesis by random integration of a transgene into the genome of a zygote has become a reliable and powerful method for the creation of new mouse strains that express exogenous genes, including human disease genes, tissue specific reporter genes or genes that allow for tissue specific recombination. Nearly 6,500 transgenic alleles have been created by random integration in embryos over the last 30 years, but for the vast majority of these strains, the transgene insertion s ...[more]

PMID: 24884803

Dataset Information

Discovery of transgene insertion sites by high throughput sequencing of mate pair libraries.

Publications

Discovery of transgene insertion sites by high throughput sequencing of mate pair libraries.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Construction of mate pair full-length cDNAs libraries and characterization of transcriptional start sites and termination sites.
| S-EPMC4176323 | biostudies-literature

Bispecific antibody target pair discovery by high-throughput phenotypic screening using in vitro combinatorial Fab libraries.
| S-EPMC7849716 | biostudies-literature

High-throughput discovery of phage receptors using transposon insertion sequencing of bacteria.
| S-EPMC7414163 | biostudies-literature

Cross-mapping and the identification of editing sites in mature microRNAs in high-throughput sequencing libraries.
| S-EPMC2813481 | biostudies-literature

A high-throughput splinkerette-PCR method for the isolation and sequencing of retroviral insertion sites.
| S-EPMC3627465 | biostudies-literature

A bioinformatics approach for identifying transgene insertion sites using whole genome sequencing data.
| S-EPMC5558722 | biostudies-literature

High-throughput sequencing of Campylobacter jejuni insertion mutant libraries reveals mapA as a fitness factor for chicken colonization.
| S-EPMC4010991 | biostudies-literature

Long-read sequencing for identification of insertion sites in large transposon mutant libraries.
| S-EPMC8894413 | biostudies-literature

Improving draft genome contiguity with reference-derived in silico mate-pair libraries.
| S-EPMC5967465 | biostudies-literature