Unknown

Dataset Information

0

An organism-specific method to rank predicted coding regions in Trypanosoma brucei.


ABSTRACT: Genome annotation in differently evolved organisms presents challenges because the lack of sequence-based homology limits the ability to determine the function of putative coding regions. To provide an alternative to annotation by sequence homology, we developed a method that takes advantage of unusual trypanosomatid biology and skews in nucleotide composition between coding regions and upstream regions to rank putative open reading frames based on the likelihood of coding. The method is 93% accurate when tested on known genes. We have applied our method to the full complement of open reading frames on Chromosome I of Trypanosoma brucei, and we can predict with high confidence that 226 putative coding regions are likely to be functional. Methods such as the one described here for discriminating true coding regions are critical for genome annotation when other sources of evidence for function are limited.

SUBMITTER: Gopal S 

PROVIDER: S-EPMC219476 | biostudies-literature | 2003 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

An organism-specific method to rank predicted coding regions in Trypanosoma brucei.

Gopal Shuba S   Cross George A M GA   Gaasterland Terry T  

Nucleic acids research 20031001 20


Genome annotation in differently evolved organisms presents challenges because the lack of sequence-based homology limits the ability to determine the function of putative coding regions. To provide an alternative to annotation by sequence homology, we developed a method that takes advantage of unusual trypanosomatid biology and skews in nucleotide composition between coding regions and upstream regions to rank putative open reading frames based on the likelihood of coding. The method is 93% acc  ...[more]

Similar Datasets

| S-EPMC5650466 | biostudies-literature
| S-EPMC3292466 | biostudies-literature
2022-08-31 | GSE23885 | GEO
2009-07-11 | GSE17049 | GEO
| PRJNA369536 | ENA
| PRJNA438967 | ENA
| PRJNA101397 | ENA