Unknown

Dataset Information

0

GSAlign: an efficient sequence alignment tool for intra-species genomes.


ABSTRACT: BACKGROUND:Personal genomics and comparative genomics are becoming more important in clinical practice and genome research. Both fields require sequence alignment to discover sequence conservation and variation. Though many methods have been developed, some are designed for small genome comparison while some are not efficient for large genome comparison. Moreover, most existing genome comparison tools have not been evaluated the correctness of sequence alignments systematically. A wrong sequence alignment would produce false sequence variants. RESULTS:In this study, we present GSAlign that handles large genome sequence alignment efficiently and identifies sequence variants from the alignment result. GSAlign is an efficient sequence alignment tool for intra-species genomes. It identifies sequence variations from the sequence alignments. We estimate performance by measuring the correctness of predicted sequence variations. The experiment results demonstrated that GSAlign is not only faster than most existing state-of-the-art methods, but also identifies sequence variants with high accuracy. CONCLUSIONS:As more genome sequences become available, the demand for genome comparison is increasing. Therefore an efficient and robust algorithm is most desirable. We believe GSAlign can be a useful tool. It exhibits the abilities of ultra-fast alignment as well as high accuracy and sensitivity for detecting sequence variations.

SUBMITTER: Lin HN 

PROVIDER: S-EPMC7041101 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

GSAlign: an efficient sequence alignment tool for intra-species genomes.

Lin Hsin-Nan HN   Hsu Wen-Lian WL  

BMC genomics 20200224 1


<h4>Background</h4>Personal genomics and comparative genomics are becoming more important in clinical practice and genome research. Both fields require sequence alignment to discover sequence conservation and variation. Though many methods have been developed, some are designed for small genome comparison while some are not efficient for large genome comparison. Moreover, most existing genome comparison tools have not been evaluated the correctness of sequence alignments systematically. A wrong  ...[more]

Similar Datasets

| S-EPMC2951093 | biostudies-literature
| S-EPMC1579236 | biostudies-literature
| S-EPMC6821304 | biostudies-literature
| S-EPMC3532014 | biostudies-literature
2014-06-10 | E-GEOD-45684 | biostudies-arrayexpress
2014-06-10 | GSE45684 | GEO
| S-EPMC3333189 | biostudies-literature
| S-EPMC2668612 | biostudies-literature
| S-EPMC2893159 | biostudies-literature
2024-10-10 | PXD050548 | Pride