Unknown

Dataset Information

0

GISMO--gene identification using a support vector machine for ORF classification.


ABSTRACT: We present the novel prokaryotic gene finder GISMO, which combines searches for protein family domains with composition-based classification based on a support vector machine. GISMO is highly accurate; exhibiting high sensitivity and specificity in gene identification. We found that it performs well for complete prokaryotic chromosomes, irrespective of their GC content, and also for plasmids as short as 10 kb, short genes and for genes with atypical sequence composition. Using GISMO, we found several thousand new predictions for the published genomes that are supported by extrinsic evidence, which strongly suggest that these are very likely biologically active genes. The source code for GISMO is freely available under the GPL license.

SUBMITTER: Krause L 

PROVIDER: S-EPMC1802617 | biostudies-literature | 2007

REPOSITORIES: biostudies-literature

altmetric image

Publications

GISMO--gene identification using a support vector machine for ORF classification.

Krause Lutz L   McHardy Alice C AC   Nattkemper Tim W TW   Pühler Alfred A   Stoye Jens J   Meyer Folker F  

Nucleic acids research 20061214 2


We present the novel prokaryotic gene finder GISMO, which combines searches for protein family domains with composition-based classification based on a support vector machine. GISMO is highly accurate; exhibiting high sensitivity and specificity in gene identification. We found that it performs well for complete prokaryotic chromosomes, irrespective of their GC content, and also for plasmids as short as 10 kb, short genes and for genes with atypical sequence composition. Using GISMO, we found se  ...[more]

Similar Datasets

| S-EPMC4395415 | biostudies-literature
| S-EPMC1184049 | biostudies-literature
| S-EPMC3962446 | biostudies-literature
| S-EPMC4057401 | biostudies-literature
| S-EPMC5662531 | biostudies-other
| S-EPMC1594580 | biostudies-literature
| S-EPMC4183366 | biostudies-literature
| S-EPMC4670226 | biostudies-literature
| S-EPMC7412107 | biostudies-literature