Unknown

Dataset Information

0

GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture.


ABSTRACT:

Motivation

Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data.

Results

For this purpose we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, find genomic spatial overlap between various types of genomic features, filter and aggregate them in various ways.

Availability and implementation

The GORpipe software is freely available for non-commercial academic usage and can be downloaded from www.nextcode.com/gorpipe CONTACT: hakon@wuxinextcode.comSupplementary information: Supplementary data are available at Bioinformatics online.

SUBMITTER: Guðbjartsson H 

PROVIDER: S-EPMC5048061 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture.

Guðbjartsson Hákon H   Georgsson Guðmundur Fr GF   Guðjónsson Sigurjón A SA   Valdimarsson Ragnar Þór RÞ   Sigurðsson Jóhann H JH   Stefánsson Sigmar K SK   Másson Gísli G   Magnússon Gísli G   Pálmason Vilmundur V   Stefánsson Kári K  

Bioinformatics (Oxford, England) 20160623 20


<h4>Motivation</h4>Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data.<h4>Results</h4>For this purpose we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, f  ...[more]

Similar Datasets

| S-EPMC3044293 | biostudies-literature
| S-EPMC540011 | biostudies-literature
| S-EPMC7665227 | biostudies-literature
| S-EPMC6301331 | biostudies-literature
| S-EPMC5210639 | biostudies-literature
| S-EPMC4423967 | biostudies-literature
| S-EPMC6248266 | biostudies-literature
| S-EPMC5789007 | biostudies-literature