Unknown

Dataset Information

0

Repetitive sequence environment distinguishes housekeeping genes.


ABSTRACT: Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (>400-bp) repetitive sequences ("repeats"), including Long Interspersed Nuclear Element-1 (LINE-1) elements, are excluded from these regions. We further show that isochore membership does not distinguish housekeeping genes from tissue-specific genes and that repetitive sequence environment distinguishes housekeeping genes from tissue-specific genes in every isochore. The distinct repetitive sequence environment, in combination with other previously published sequence properties of housekeeping genes, was used to develop a method of predicting housekeeping genes on the basis of DNA sequence alone. Using expression across tissue types as a measure of success, we demonstrate that repetitive sequence environment is by far the most important sequence feature identified to date for distinguishing housekeeping genes.

SUBMITTER: Eller CD 

PROVIDER: S-EPMC1857324 | biostudies-literature | 2007 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Repetitive sequence environment distinguishes housekeeping genes.

Eller C Daniel CD   Regelson Moira M   Merriman Barry B   Nelson Stan S   Horvath Steve S   Marahrens York Y  

Gene 20061005 1-2


Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (>400-bp) repetitive sequences ("repeats"), including Long Interspersed Nuclear E  ...[more]

Similar Datasets

| S-EPMC2323216 | biostudies-literature
| S-EPMC2725479 | biostudies-literature
| S-EPMC8068995 | biostudies-literature
| S-EPMC4430495 | biostudies-literature
| S-EPMC9312424 | biostudies-literature
| S-EPMC3766922 | biostudies-literature
| S-EPMC96437 | biostudies-literature
| S-EPMC3441883 | biostudies-literature
| S-EPMC1976390 | biostudies-literature
| S-EPMC535203 | biostudies-literature