Unknown

Dataset Information

0

Segtor: rapid annotation of genomic coordinates and single nucleotide variations using segment trees.


ABSTRACT: Various research projects often involve determining the relative position of genomic coordinates, intervals, single nucleotide variations (SNVs), insertions, deletions and translocations with respect to genes and their potential impact on protein translation. Due to the tremendous increase in throughput brought by the use of next-generation sequencing, investigators are routinely faced with the need to annotate very large datasets. We present Segtor, a tool to annotate large sets of genomic coordinates, intervals, SNVs, indels and translocations. Our tool uses segment trees built using the start and end coordinates of the genomic features the user wishes to use instead of storing them in a database management system. The software also produces annotation statistics to allow users to visualize how many coordinates were found within various portions of genes. Our system currently can be made to work with any species available on the UCSC Genome Browser. Segtor is a suitable tool for groups, especially those with limited access to programmers or with interest to analyze large amounts of individual genomes, who wish to determine the relative position of very large sets of mapped reads and subsequently annotate observed mutations between the reads and the reference. Segtor (http://lbbc.inca.gov.br/segtor/) is an open-source tool that can be freely downloaded for non-profit use. We also provide a web interface for testing purposes.

SUBMITTER: Renaud G 

PROVIDER: S-EPMC3206052 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Segtor: rapid annotation of genomic coordinates and single nucleotide variations using segment trees.

Renaud Gabriel G   Neves Pedro P   Folador Edson Luiz EL   Ferreira Carlos Gil CG   Passetti Fabio F  

PloS one 20111101 11


Various research projects often involve determining the relative position of genomic coordinates, intervals, single nucleotide variations (SNVs), insertions, deletions and translocations with respect to genes and their potential impact on protein translation. Due to the tremendous increase in throughput brought by the use of next-generation sequencing, investigators are routinely faced with the need to annotate very large datasets. We present Segtor, a tool to annotate large sets of genomic coor  ...[more]

Similar Datasets

| S-EPMC9236577 | biostudies-literature
| S-EPMC2643714 | biostudies-literature
| S-EPMC6244222 | biostudies-literature
| S-EPMC10692869 | biostudies-literature
2009-06-15 | E-GEOD-16190 | biostudies-arrayexpress
| S-EPMC4253807 | biostudies-literature
| S-EPMC2760790 | biostudies-literature
| S-EPMC4987916 | biostudies-literature
| S-EPMC8674696 | biostudies-literature
2009-06-15 | GSE16190 | GEO