Unknown

Dataset Information

0

An EST resource for tilapia based on 17 normalized libraries and assembly of 116,899 sequence tags.


ABSTRACT: Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries representing 16 tissues from tilapia, a cichlid fish widely used in aquaculture and biological research.The ESTs were assembled into 20,190 contigs and 36,028 singletons for a total of 56,218 unique sequences and a total assembled length of 35,168,415 bp. Over the whole project, a unique sequence was discovered for every 2.079 sequence reads. 17,722 (31.5%) of these unique sequences had significant BLAST hits (e-value < 10(-10)) to the UniProt database.Normalization of the cDNA pools with double-stranded nuclease allowed us to efficiently sequence a large collection of ESTs. These sequences are an important resource for studies of gene expression, comparative mapping and annotation of the forthcoming tilapia genome sequence.

SUBMITTER: Lee BY 

PROVIDER: S-EPMC2874815 | biostudies-literature | 2010 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

An EST resource for tilapia based on 17 normalized libraries and assembly of 116,899 sequence tags.

Lee Bo-Young BY   Howe Aimee E AE   Conte Matthew A MA   D'Cotta Helena H   Pepey Elodie E   Baroiller Jean-Francois JF   di Palma Federica F   Carleton Karen L KL   Kocher Thomas D TD  

BMC genomics 20100430


<h4>Background</h4>Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries representing 16 tissues from tilapia, a cichlid fish widely used in aquaculture and biological research.<h4>Results</h4>The ESTs were assembled into 20,190 contigs and 36,028 singletons for a total of 56,218 unique sequences and a total assembled leng  ...[more]

Similar Datasets

| S-EPMC1895994 | biostudies-literature
| S-EPMC522874 | biostudies-literature
| S-EPMC3536561 | biostudies-literature
| S-EPMC3405963 | biostudies-literature
| S-EPMC5172415 | biostudies-literature
| S-EPMC3118787 | biostudies-literature
| S-EPMC2447738 | biostudies-other
| S-EPMC310898 | biostudies-literature
| S-EPMC2757623 | biostudies-literature
| S-EPMC116728 | biostudies-literature