Unknown

Dataset Information

0

Fine organization of Bombyx mori fibroin heavy chain gene.


ABSTRACT: The complete sequence of the Bombyx mori fibroin gene has been determined by means of combining a shotgun sequencing strategy with physical map-based sequencing procedures. It consists of two exons (67 and 15 750 bp, respectively) and one intron (971 bp). The fibroin coding sequence presents a spectacular organization, with a highly repetitive and G-rich (approximately 45%) core flanked by non-repetitive 5' and 3' ends. This repetitive core is composed of alternate arrays of 12 repetitive and 11 amorphous domains. The sequences of the amorphous domains are evolutionarily conserved and the repetitive domains differ from each other in length by a variety of tandem repeats of subdomains of approximately 208 bp which are reminiscent of the repetitive nucleosome organization. A typical composition of a subdomain is a cluster of repetitive units, Ua, followed by a cluster of units, Ub, (with a Ua:Ub ratio of 2:1) flanked by conserved boundary elements at the 3' end. Moreover some repeats are also perfectly conserved at the peptide level indicating that the evolutionary pressure is not identical along the sequence. A tentative model for the constitution and evolution of this unusual gene is discussed.

SUBMITTER: Zhou CZ 

PROVIDER: S-EPMC102737 | biostudies-literature | 2000 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fine organization of Bombyx mori fibroin heavy chain gene.

Zhou C Z CZ   Confalonieri F F   Medina N N   Zivanovic Y Y   Esnault C C   Yang T T   Jacquet M M   Janin J J   Duguet M M   Perasso R R   Li Z G ZG  

Nucleic acids research 20000601 12


The complete sequence of the Bombyx mori fibroin gene has been determined by means of combining a shotgun sequencing strategy with physical map-based sequencing procedures. It consists of two exons (67 and 15 750 bp, respectively) and one intron (971 bp). The fibroin coding sequence presents a spectacular organization, with a highly repetitive and G-rich (approximately 45%) core flanked by non-repetitive 5' and 3' ends. This repetitive core is composed of alternate arrays of 12 repetitive and 11  ...[more]

Similar Datasets

| S-EPMC8231919 | biostudies-literature
| S-EPMC8467315 | biostudies-literature
| S-EPMC8675719 | biostudies-literature
| S-EPMC1271246 | biostudies-other
| S-EPMC7570510 | biostudies-literature
| S-EPMC3989216 | biostudies-literature
| S-EPMC8659740 | biostudies-literature
| S-EPMC4294524 | biostudies-literature
| S-EPMC8717555 | biostudies-literature
| S-EPMC11241164 | biostudies-literature