Unknown

Dataset Information

0

Bayesian centroid estimation for motif discovery.


ABSTRACT: Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

SUBMITTER: Carvalho L 

PROVIDER: S-EPMC3855595 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bayesian centroid estimation for motif discovery.

Carvalho Luis L  

PloS one 20131206 12


Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss functi  ...[more]

Similar Datasets

| S-EPMC2265131 | biostudies-other
| S-EPMC4388571 | biostudies-literature
2024-03-26 | GSE234010 | GEO
| S-EPMC2687942 | biostudies-literature
| S-EPMC3745480 | biostudies-other
| S-EPMC3390389 | biostudies-literature
| S-EPMC3622139 | biostudies-literature
| S-EPMC4796016 | biostudies-literature