Unknown

Dataset Information

0

GeoBoost: accelerating research involving the geospatial metadata of virus GenBank records.


ABSTRACT: Summary:GeoBoost is a command-line software package developed to address sparse or incomplete metadata in GenBank sequence records that relate to the location of the infected host (LOIH) of viruses. Given a set of GenBank accession numbers corresponding to virus GenBank records, GeoBoost extracts, integrates and normalizes geographic information reflecting the LOIH of the viruses using integrated information from GenBank metadata and related full-text publications. In addition, to facilitate probabilistic geospatial modeling, GeoBoost assigns probability scores for each possible LOIH. Availability and implementation:Binaries and resources required for running GeoBoost are packed into a single zipped file and freely available for download at https://tinyurl.com/geoboost. A video tutorial is included to help users quickly and easily install and run the software. The software is implemented in Java 1.8, and supported on MS Windows and Linux platforms. Contact:gragon@upenn.edu. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Tahsin T 

PROVIDER: S-EPMC5925778 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

GeoBoost: accelerating research involving the geospatial metadata of virus GenBank records.

Tahsin Tasnia T   Weissenbacher Davy D   O'Connor Karen K   Magge Arjun A   Scotch Matthew M   Gonzalez-Hernandez Graciela G  

Bioinformatics (Oxford, England) 20180501 9


<h4>Summary</h4>GeoBoost is a command-line software package developed to address sparse or incomplete metadata in GenBank sequence records that relate to the location of the infected host (LOIH) of viruses. Given a set of GenBank accession numbers corresponding to virus GenBank records, GeoBoost extracts, integrates and normalizes geographic information reflecting the LOIH of the viruses using integrated information from GenBank metadata and related full-text publications. In addition, to facili  ...[more]

Similar Datasets

| S-EPMC4997033 | biostudies-literature
| S-EPMC6225896 | biostudies-literature
| S-EPMC7755405 | biostudies-literature
2017-06-24 | GSE100427 | GEO
| S-EPMC2275786 | biostudies-literature
| S-EPMC2808878 | biostudies-literature
| S-EPMC6842603 | biostudies-literature
| S-EPMC8441584 | biostudies-literature
| S-EPMC7602672 | biostudies-literature
| S-EPMC7426930 | biostudies-literature