Unknown

Dataset Information

0

Features generated for computational splice-site prediction correspond to functional elements.


ABSTRACT:

Background

Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologically meaningful splicing signals.

Results

We present examples from the observed features that correspond to known signals, both core signals (including the branch site and pyrimidine tract) and auxiliary signals (including GGG triplets and exon splicing enhancers). We present evidence that features identified by FGA include splicing signals not found by other methods.

Conclusion

Our generated features capture known biological signals in the expected sequence interval flanking splice sites. The method can be easily applied to other species and to similar classification problems, such as tissue-specific regulatory elements, polyadenylation sites, promoters, etc.

SUBMITTER: Dogan RI 

PROVIDER: S-EPMC2241647 | biostudies-literature | 2007 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Features generated for computational splice-site prediction correspond to functional elements.

Dogan Rezarta Islamaj RI   Getoor Lise L   Wilbur W John WJ   Mount Stephen M SM  

BMC bioinformatics 20071024


<h4>Background</h4>Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologicall  ...[more]

Similar Datasets

| S-EPMC8360004 | biostudies-literature
| S-EPMC4455060 | biostudies-literature
| S-EPMC275452 | biostudies-literature
| S-EPMC8609763 | biostudies-literature
| S-EPMC1258172 | biostudies-literature
| S-EPMC2777938 | biostudies-literature
| S-EPMC6821175 | biostudies-literature
| S-EPMC2897571 | biostudies-literature
| S-EPMC6804535 | biostudies-literature
| S-EPMC2947103 | biostudies-literature