Dataset Information

Machado: Open source genomics data integration framework.

ABSTRACT: Genome projects and multiomics experiments generate huge volumes of data that must be stored, mined, and transformed into useful knowledge. All this information is supposed to be accessible and, if possible, browsable afterwards. Computational biologists have been dealing with this scenario for more than a decade and have been implementing software and databases to meet this challenge. The GMOD's (Generic Model Organism Database) biological relational database schema, known as Chado, is one of the few successful open source initiatives; it is widely adopted and many software packages are able to connect to it. We have been developing an open source software package named Machado, a genomics data integration framework implemented in Python, to enable research groups to both store and visualize genomics data. The framework relies on the Chado database schema and, therefore, should be very intuitive for current developers to adopt it or have it running on top of already existing databases. It has several data-loading tools for genomics and transcriptomics data and also for annotation results from tools such as BLAST, InterproScan, OrthoMCL, and LSTrAP. There is an API to connect to JBrowse, and a web visualization tool is implemented using Django Views and Templates. The Haystack library integrated with the ElasticSearch engine was used to implement a Google-like search, i.e., single auto-complete search box that provides fast results and filters. Machado aims to be a modern object-relational framework that uses the latest Python libraries to produce an effective open source resource for genomics research.

SUBMITTER: Mudadu MA

PROVIDER: S-EPMC7490629 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machado: Open source genomics data integration framework.

Mudadu Mauricio de Alvarenga MA Zerlotini Adhemar A

GigaScience 20200901 9

<h4>Background</h4>Genome projects and multiomics experiments generate huge volumes of data that must be stored, mined, and transformed into useful knowledge. All this information is supposed to be accessible and, if possible, browsable afterwards. Computational biologists have been dealing with this scenario for more than a decade and have been implementing software and databases to meet this challenge. The GMOD's (Generic Model Organism Database) biological relational database schema, known as ...[more]

PMID: 32930331

Dataset Information

Machado: Open source genomics data integration framework.

Publications

Machado: Open source genomics data integration framework.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

JBioWH: an open-source Java framework for bioinformatics data integration.
| S-EPMC3708619 | biostudies-literature

A STATISTICAL FRAMEWORK FOR DATA INTEGRATION THROUGH GRAPHICAL MODELS WITH APPLICATION TO CANCER GENOMICS.
| S-EPMC6447291 | biostudies-literature

Picasso-server: a community-based, open-source processing framework for super-resolution data.
| S-EPMC9458736 | biostudies-literature

geWorkbench: an open source platform for integrative genomics.
| S-EPMC2894520 | biostudies-literature

STORMSeq: an open-source, user-friendly pipeline for processing personal genomics data in the cloud.
| S-EPMC3893165 | biostudies-literature

An open-source framework for end-to-end analysis of electronic health record data.
| S-EPMC11564094 | biostudies-literature

Void sorcerer: an open source, open access framework for mouse uroflowmetry.
| S-EPMC6627548 | biostudies-literature

Multi-modal framework for battery state of health evaluation using open-source electric vehicle data.
| S-EPMC11779878 | biostudies-literature

A unified framework for the integration of multiple hierarchical clusterings or networks from multi-source data.
| S-EPMC8336092 | biostudies-literature

Jenkins-CI, an Open-Source Continuous Integration System, as a Scientific Data and Image-Processing Platform.
| S-EPMC5322829 | biostudies-literature