Unknown

Dataset Information

0

TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences.


ABSTRACT: Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid 'seed' query to: (i) automatically identify and retrieve gene family homologs from a genomic database, (ii) characterize gene structure and (iii) perform phylogenetic analysis. Due to its high speed, TARGeT is also able to characterize very large gene families, including transposable elements (TEs). We evaluated TARGeT using well-annotated datasets, including the ascorbate peroxidase gene family of rice, maize and sorghum and several TE families in rice. In all cases, TARGeT rapidly recapitulated the known homologs and predicted new ones. We also demonstrated that TARGeT outperforms similar pipelines and has functionality that is not offered elsewhere.

SUBMITTER: Han Y 

PROVIDER: S-EPMC2699529 | biostudies-literature | 2009 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences.

Han Yujun Y   Burnette James M JM   Wessler Susan R SR  

Nucleic acids research 20090508 11


Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid 'se  ...[more]

Similar Datasets

| S-EPMC7196820 | biostudies-literature
| S-EPMC6027284 | biostudies-literature
| S-EPMC5605234 | biostudies-literature
2024-11-10 | GSE280365 | GEO
| S-EPMC6913007 | biostudies-literature
| S-EPMC403780 | biostudies-literature
| S-EPMC6829137 | biostudies-literature
| S-EPMC2850603 | biostudies-literature
| S-EPMC3517506 | biostudies-literature
| S-EPMC2203978 | biostudies-literature