Unknown

Dataset Information

0

New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.


ABSTRACT: With the development of next-generation sequencing (NGS) technologies, a large amount of short read data has been generated. Assembly of these short reads can be challenging for genomes and metagenomes without template sequences, making alignment-based genome sequence comparison difficult. In addition, sequence reads from NGS can come from different regions of various genomes and they may not be alignable. Sequence signature-based methods for genome comparison based on the frequencies of word patterns in genomes and metagenomes can potentially be useful for the analysis of short reads data from NGS. Here we review the recent development of alignment-free genome and metagenome comparison based on the frequencies of word patterns with emphasis on the dissimilarity measures between sequences, the statistical power of these measures when two sequences are related and the applications of these measures to NGS data.

SUBMITTER: Song K 

PROVIDER: S-EPMC4017329 | biostudies-literature | 2014 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.

Song Kai K   Ren Jie J   Reinert Gesine G   Deng Minghua M   Waterman Michael S MS   Sun Fengzhu F  

Briefings in bioinformatics 20130923 3


With the development of next-generation sequencing (NGS) technologies, a large amount of short read data has been generated. Assembly of these short reads can be challenging for genomes and metagenomes without template sequences, making alignment-based genome sequence comparison difficult. In addition, sequence reads from NGS can come from different regions of various genomes and they may not be alignable. Sequence signature-based methods for genome comparison based on the frequencies of word pa  ...[more]

Similar Datasets

| S-EPMC3581251 | biostudies-literature
| S-EPMC4265526 | biostudies-literature
| S-EPMC2818754 | biostudies-literature
| S-EPMC3123933 | biostudies-literature
| S-EPMC2943993 | biostudies-literature
| S-EPMC3146591 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC4053384 | biostudies-literature
| S-EPMC4501066 | biostudies-literature
| S-EPMC4410667 | biostudies-literature