Unknown

Dataset Information

0

A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.


ABSTRACT: Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.

SUBMITTER: Lu Q 

PROVIDER: S-EPMC4444969 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.

Lu Qiongshi Q   Hu Yiming Y   Sun Jiehuan J   Cheng Yuwei Y   Cheung Kei-Hoi KH   Zhao Hongyu H  

Scientific reports 20150527


Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-  ...[more]

Similar Datasets

| S-EPMC3190956 | biostudies-literature
| S-EPMC5963360 | biostudies-literature
| S-EPMC3632130 | biostudies-literature
| S-EPMC5570202 | biostudies-literature
| S-EPMC5905645 | biostudies-literature
| S-EPMC4481266 | biostudies-literature
| S-EPMC2570367 | biostudies-literature
| S-EPMC6447291 | biostudies-literature
| S-EPMC1630634 | biostudies-literature
| S-EPMC10229068 | biostudies-literature