Unknown

Dataset Information

0

Mining significant high utility gene regulation sequential patterns.


ABSTRACT: BACKGROUND:Mining frequent gene regulation sequential patterns in time course microarray datasets is an important mining task in bioinformatics. Although finding such patterns are of paramount important for studying a disease, most existing work do not consider gene-disease association during gene regulation sequential pattern discovery. Moreover, they consider more absent/existence effects of genes during the mining process than taking the degrees of genes expression into account. Consequently, such techniques discover too many patterns which may not represent important information to biologists to investigate the relationships between the disease and underlying reasons hidden in gene regulation sequences. RESULTS:We propose a utility model by considering both the gene-disease association score and their degrees of expression levels under a biological investigation. We propose an efficient method called Top-HUGS, for discoverying significant high utility gene regulation sequential patterns from a time-course microarray dataset. CONCLUSIONS:In this study, the proposed methods were evaluated on a publicly available time course microarray dataset. The experimental results show higher accuracies compared to the baseline methods. Our proposed methods found that several new gene regulation sequential patterns involved in such patterns were useful for biologists and provided further insights into the mechanisms underpinning biological processes. To effectively work with the proposed method, a web interface is developed to our system using Java. To the best of our knowledge, this is the first demonstration for significant high utility gene regulation sequential pattern discovery.

SUBMITTER: Zihayat M 

PROVIDER: S-EPMC5751562 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mining significant high utility gene regulation sequential patterns.

Zihayat Morteza M   Davoudi Heidar H   An Aijun A  

BMC systems biology 20171214 Suppl 6


<h4>Background</h4>Mining frequent gene regulation sequential patterns in time course microarray datasets is an important mining task in bioinformatics. Although finding such patterns are of paramount important for studying a disease, most existing work do not consider gene-disease association during gene regulation sequential pattern discovery. Moreover, they consider more absent/existence effects of genes during the mining process than taking the degrees of genes expression into account. Conse  ...[more]

Similar Datasets

| S-EPMC5526537 | biostudies-other
| S-EPMC10058157 | biostudies-literature
| S-EPMC3848764 | biostudies-other
| S-EPMC4355605 | biostudies-other
| S-EPMC6042480 | biostudies-literature
| S-EPMC3333188 | biostudies-literature
| S-EPMC7118609 | biostudies-literature
| S-EPMC8743106 | biostudies-literature
| S-EPMC1574354 | biostudies-literature
| S-EPMC6480909 | biostudies-literature