Unknown

Dataset Information

0

Benchmarking computational tools for polymorphic transposable element detection.


ABSTRACT: Transposable elements (TEs) are an important source of human genetic variation with demonstrable effects on phenotype. Recently, a number of computational methods for the detection of polymorphic TE (polyTE) insertion sites from next-generation sequence data have been developed. The use of such tools will become increasingly important as the pace of human genome sequencing accelerates. For this report, we performed a comparative benchmarking and validation analysis of polyTE detection tools in an effort to inform their selection and use by the TE research community. We analyzed a core set of seven tools with respect to ease of use and accessibility, polyTE detection performance and runtime parameters. An experimentally validated set of 893 human polyTE insertions was used for this purpose, along with a series of simulated data sets that allowed us to assess the impact of sequence coverage on tool performance. The recently developed tool MELT showed the best overall performance followed by Mobster and then RetroSeq. PolyTE detection tools can best detect Alu insertion events in the human genome with reduced reliability for L1 insertions and substantially lowered performance for SVA insertions. We also show evidence that different polyTE detection tools are complementary with respect to their ability to detect a complete set of insertion events. Accordingly, a combined approach, coupled with manual inspection of individual results, may yield the best overall performance. In addition to the benchmarking results, we also provide notes on tool installation and usage as well as suggestions for future polyTE detection algorithm development.

SUBMITTER: Rishishwar L 

PROVIDER: S-EPMC5808724 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Benchmarking computational tools for polymorphic transposable element detection.

Rishishwar Lavanya L   Mariño-Ramírez Leonardo L   Jordan I King IK  

Briefings in bioinformatics 20171101 6


Transposable elements (TEs) are an important source of human genetic variation with demonstrable effects on phenotype. Recently, a number of computational methods for the detection of polymorphic TE (polyTE) insertion sites from next-generation sequence data have been developed. The use of such tools will become increasingly important as the pace of human genome sequencing accelerates. For this report, we performed a comparative benchmarking and validation analysis of polyTE detection tools in a  ...[more]

Similar Datasets

| S-EPMC6437167 | biostudies-literature
| S-EPMC6913007 | biostudies-literature
| S-EPMC4066757 | biostudies-literature
| S-EPMC5517527 | biostudies-literature
| S-EPMC8187371 | biostudies-literature
| S-EPMC4898795 | biostudies-literature
2014-11-14 | E-GEOD-57191 | biostudies-arrayexpress
| S-EPMC4478395 | biostudies-literature
| S-EPMC5003039 | biostudies-literature