Unknown

Dataset Information

0

PROTOGENE: turning amino acid alignments into bona fide CDS nucleotide alignments.


ABSTRACT: We describe Protogene, a server that can turn a protein multiple sequence alignment into the equivalent alignment of the original gene coding DNA. Protogene relies on a pipeline where every initial protein sequence is BLASTed against RefSeq or NR. The annotation associated with potential matches is used to identify the gene sequence. This gene sequence is then aligned with the query protein using Exonerate in order to extract a coding nucleotide sequence matching the original protein. Protogene can handle protein fragments and will return every CDS coding for a given protein, even if they occur in different genomes. Protogene is available from http://www.tcoffee.org/.

SUBMITTER: Moretti S 

PROVIDER: S-EPMC1538918 | biostudies-literature | 2006 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

PROTOGENE: turning amino acid alignments into bona fide CDS nucleotide alignments.

Moretti Sébastien S   Reinier Frédéric F   Poirot Olivier O   Armougom Fabrice F   Audic Stéphane S   Keduas Vladimir V   Notredame Cédric C  

Nucleic acids research 20060701 Web Server issue


We describe Protogene, a server that can turn a protein multiple sequence alignment into the equivalent alignment of the original gene coding DNA. Protogene relies on a pipeline where every initial protein sequence is BLASTed against RefSeq or NR. The annotation associated with potential matches is used to identify the gene sequence. This gene sequence is then aligned with the query protein using Exonerate in order to extract a coding nucleotide sequence matching the original protein. Protogene  ...[more]

Similar Datasets

| S-EPMC6675789 | biostudies-literature
| S-EPMC4944278 | biostudies-literature
| S-EPMC7801460 | biostudies-literature
| S-EPMC6985830 | biostudies-literature
| S-EPMC419649 | biostudies-other
| S-EPMC2573217 | biostudies-literature
| S-EPMC8380510 | biostudies-literature
| S-EPMC6348098 | biostudies-literature
| S-EPMC5760642 | biostudies-literature
| S-EPMC6259105 | biostudies-literature