Unknown

Dataset Information

0

Mixed sequence reader: a program for analyzing DNA sequences with heterozygous base calling.


ABSTRACT: The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as ?-defensin 4 (DEFB4) and its paralog HSPDP3.

SUBMITTER: Chang CT 

PROVIDER: S-EPMC3385616 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mixed sequence reader: a program for analyzing DNA sequences with heterozygous base calling.

Chang Chun-Tien CT   Tsai Chi-Neu CN   Tang Chuan Yi CY   Chen Chun-Houh CH   Lian Jang-Hau JH   Hu Chi-Yu CY   Tsai Chia-Lung CL   Chao Angel A   Lai Chyong-Huey CH   Wang Tzu-Hao TH   Lee Yun-Shien YS  

TheScientificWorldJournal 20120618


The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropria  ...[more]

Similar Datasets

| S-EPMC3557274 | biostudies-literature
| S-EPMC130864 | biostudies-literature
| S-EPMC7229812 | biostudies-literature
| S-EPMC3353060 | biostudies-literature
| S-EPMC5616130 | biostudies-literature
| S-EPMC3985508 | biostudies-literature
| S-EPMC6360951 | biostudies-literature
| S-EPMC2575221 | biostudies-literature
| S-EPMC4573246 | biostudies-literature
| S-EPMC3922877 | biostudies-literature