Unknown

Dataset Information

0

SINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets.


ABSTRACT:

Motivation

Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences.

Results

Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structural signals to identify a SINE element. SINE_Scan outperforms the previously published de novo SINE discovery program. It shows high sensitivity and specificity in 19 plant and animal genome assemblies, of which sizes vary from 120 Mb to 3.5 Gb. It identifies numerous new families and substantially increases the estimation of the abundance of SINEs in these genomes.

Availability and implementation

The code of SINE_Scan is freely available at http://github.com/maohlzj/SINE_Scan , implemented in PERL and supported on Linux.

Contact

wangh8@fudan.edu.cn.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Mao H 

PROVIDER: S-EPMC5408816 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

SINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets.

Mao Hongliang H   Wang Hao H  

Bioinformatics (Oxford, England) 20170301 5


<h4>Motivation</h4>Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences.<h4>Results</h4>Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structur  ...[more]

Similar Datasets

| S-EPMC5585668 | biostudies-literature
| S-EPMC15861 | biostudies-literature
| S-EPMC1356118 | biostudies-literature
| S-EPMC117560 | biostudies-literature
| S-EPMC4223381 | biostudies-literature
| S-EPMC8010984 | biostudies-literature
| S-EPMC6328105 | biostudies-literature
| S-EPMC9191728 | biostudies-literature
| S-EPMC47062 | biostudies-other
| S-EPMC1207951 | biostudies-other