Unknown

Dataset Information

0

Methods for estimating human endogenous retrovirus activities from EST databases.


ABSTRACT:

Background

Human endogenous retroviruses (HERVs) are surviving traces of ancient retrovirus infections and now reside within the human DNA. Recently HERV expression has been detected in both normal tissues and diseased patients. However, the activities (expression levels) of individual HERV sequences are mostly unknown.

Results

We introduce a generative mixture model, based on Hidden Markov Models, for estimating the activities of the individual HERV sequences from EST (expressed sequence tag) databases. We use the model to estimate the relative activities of 181 HERVs. We also empirically justify a faster heuristic method for HERV activity estimation and use it to estimate the activities of 2450 HERVs. The majority of the HERV activities were previously unknown.

Conclusion

(i) Our methods estimate activity accurately based on experiments on simulated data. (ii) Our estimate on real data shows that 7% of the HERVs are active. The active ones are spread unevenly into HERV groups and relatively uniformly in terms of estimated age. HERVs with the retroviral env gene are more often active than HERVs without env. Few of the active HERVs have open reading frames for retroviral proteins.

SUBMITTER: Oja M 

PROVIDER: S-EPMC1892069 | biostudies-literature | 2007 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Methods for estimating human endogenous retrovirus activities from EST databases.

Oja Merja M   Peltonen Jaakko J   Blomberg Jonas J   Kaski Samuel S  

BMC bioinformatics 20070503


<h4>Background</h4>Human endogenous retroviruses (HERVs) are surviving traces of ancient retrovirus infections and now reside within the human DNA. Recently HERV expression has been detected in both normal tissues and diseased patients. However, the activities (expression levels) of individual HERV sequences are mostly unknown.<h4>Results</h4>We introduce a generative mixture model, based on Hidden Markov Models, for estimating the activities of the individual HERV sequences from EST (expressed  ...[more]

Similar Datasets

| S-EPMC136318 | biostudies-literature
| S-EPMC1781480 | biostudies-literature
| S-EPMC4136327 | biostudies-literature
| S-EPMC8122352 | biostudies-literature
| S-EPMC11319637 | biostudies-literature
| S-EPMC538696 | biostudies-literature
| S-EPMC6344353 | biostudies-literature
| S-EPMC6342650 | biostudies-literature
| S-EPMC8225122 | biostudies-literature
| S-EPMC5361811 | biostudies-literature