Unknown

Dataset Information

0

Identification of conserved and polymorphic STRs for personal genomes.


ABSTRACT: BACKGROUND: Short tandem repeats (STRs) are abundant in human genomes. Numerous STRs have been shown to be associated with genetic diseases and gene regulatory functions, and have been selected as genetic markers for evolutionary and forensic analyses. High-throughput next generation sequencers have fostered new cutting-edge computing techniques for genome-scale analyses, and cross-genome comparisons have facilitated the efficient identification of polymorphic STR markers for various applications. RESULTS: An automated and efficient system for detecting human polymorphic STRs at the genome scale is proposed in this study. Assembled contigs from next generation sequencing data were aligned and calibrated according to selected reference sequences. To verify identified polymorphic STRs, human genomes from the 1000 Genomes Project were employed for comprehensive analyses, and STR markers from the Combined DNA Index System (CODIS) and disease-related STR motifs were also applied as cases for evaluation. In addition, we analyzed STR variations for highly conserved homologous genes and human-unique genes. In total 477 polymorphic STRs were identified from 492 human-unique genes, among which 26 STRs were retrieved and clustered into three different groups for efficient comparison. CONCLUSIONS: We have developed an online system that efficiently identifies polymorphic STRs and provides novel distinguishable STR biomarkers for different levels of specificity. Candidate polymorphic STRs within a personal genome could be easily retrieved and compared to the constructed STR profile through query keywords, gene names, or assembled contigs.

SUBMITTER: Chen CM 

PROVIDER: S-EPMC4304208 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of conserved and polymorphic STRs for personal genomes.

Chen Chien-Ming CM   Sio Chi-Pong CP   Lu Yu-Lun YL   Chang Hao-Teng HT   Hu Chin-Hwa CH   Pai Tun-Wen TW  

BMC genomics 20141212


<h4>Background</h4>Short tandem repeats (STRs) are abundant in human genomes. Numerous STRs have been shown to be associated with genetic diseases and gene regulatory functions, and have been selected as genetic markers for evolutionary and forensic analyses. High-throughput next generation sequencers have fostered new cutting-edge computing techniques for genome-scale analyses, and cross-genome comparisons have facilitated the efficient identification of polymorphic STR markers for various appl  ...[more]

Similar Datasets

| S-EPMC4742230 | biostudies-literature
| S-EPMC3232370 | biostudies-literature
| S-EPMC7080748 | biostudies-literature
| S-EPMC4678817 | biostudies-literature
| S-EPMC4449708 | biostudies-literature
| S-EPMC5133613 | biostudies-literature
| S-EPMC3381673 | biostudies-literature
| PRJEB35208 | ENA
| S-EPMC5088599 | biostudies-literature
| S-EPMC1182225 | biostudies-literature