Unknown

Dataset Information

0

Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L.


ABSTRACT:

Background

Cotton (Gossypium hirsutum L.) is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence.

Methodology/principal findings

In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves.

Conclusions/significance

These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species.

SUBMITTER: Lin M 

PROVIDER: S-EPMC3795732 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L.

Lin Min M   Lai Deyong D   Pang Chaoyou C   Fan Shuli S   Song Meizhen M   Yu Shuxun S  

PloS one 20131011 10


<h4>Background</h4>Cotton (Gossypium hirsutum L.) is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence.<h4>Methodology/principal findings</h4>In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during  ...[more]

Similar Datasets

| S-EPMC2923529 | biostudies-literature
| S-EPMC2568000 | biostudies-literature
| S-EPMC2608845 | biostudies-literature
| S-EPMC1444929 | biostudies-literature
| S-EPMC3709719 | biostudies-literature
| S-EPMC3118787 | biostudies-literature
| S-EPMC1950504 | biostudies-literature
| S-EPMC3311628 | biostudies-literature
| S-EPMC5054741 | biostudies-literature
| S-EPMC311136 | biostudies-literature