Unknown

Dataset Information

0

CardioTF, a database of deconstructing transcriptional circuits in the heart system.


ABSTRACT:

Background

Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database was constructed. The purpose of this database is to collate information on cardiovascular transcription factors (TFs), position weight matrices (PWMs), and enhancer sequences discovered using the ChIP-seq method.

Methods

The Naïve-Bayes algorithm was used to classify literature and identify all PubMed abstracts on cardiovascular development. The natural language learning tool GNAT was then used to identify corresponding gene names embedded within these abstracts. Local Perl scripts were used to integrate and dump data from public databases into the MariaDB management system (MySQL). In-house R scripts were written to analyze and visualize the results.

Results

Known cardiovascular TFs from humans and human homologs from fly, Ciona, zebrafish, frog, chicken, and mouse were identified and deposited in the database. PWMs from Jaspar, hPDI, and UniPROBE databases were deposited in the database and can be retrieved using their corresponding TF names. Gene enhancer regions from various sources of ChIP-seq data were deposited into the database and were able to be visualized by graphical output. Besides biocuration, mouse homologs of the 81 core cardiac TFs were selected using a Naïve-Bayes approach and then by intersecting four independent data sources: RNA profiling, expert annotation, PubMed abstracts and phenotype.

Discussion

The CardioTF database can be used as a portal to construct transcriptional network of cardiac development.

Availability and implementation

Database URL: http://www.cardiosignal.org/database/cardiotf.html.

SUBMITTER: Zhen Y 

PROVIDER: S-EPMC5012272 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

CardioTF, a database of deconstructing transcriptional circuits in the heart system.

Zhen Yisong Y  

PeerJ 20160823


<h4>Background</h4>Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database was constructed. The purpose of this database is to collate information on cardiovascular transcription factors (TFs), position weight matrices (PWMs), and enhancer sequences discovered using  ...[more]

Similar Datasets

| S-EPMC4256722 | biostudies-literature
| S-EPMC3116505 | biostudies-literature
| S-EPMC6972013 | biostudies-literature
2014-12-04 | E-GEOD-60749 | biostudies-arrayexpress
| S-EPMC2973918 | biostudies-literature
| S-EPMC6584022 | biostudies-literature
| S-EPMC7796611 | biostudies-literature
| S-EPMC4765898 | biostudies-literature
| S-EPMC5495784 | biostudies-literature
| S-EPMC5996764 | biostudies-literature