Unknown

Dataset Information

0

The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees.


ABSTRACT: BACKGROUND:There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In this paper, we present an approach to search for interaction effects among several variables using boosted regression trees. METHODS:We simulate a continuous outcome from real data on 27 environmental contaminants, some of which are correlated, and test the method's ability to uncover the simulated interactions. The simulated outcome contains one four-way interaction, one non-linear effect and one interaction between a continuous variable and a binary variable. Four scenarios reflecting different strengths of association are simulated. We illustrate the method using real data. RESULTS:The method succeeded in identifying the true interactions in all scenarios except where the association was weakest. Some spurious interactions were also found, however. The method was also capable to identify interactions in the real data set. CONCLUSIONS:We conclude that boosted regression trees can be used to uncover complex interaction effects in epidemiological studies.

SUBMITTER: Lampa E 

PROVIDER: S-EPMC4120739 | biostudies-literature | 2014 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees.

Lampa Erik E   Lind Lars L   Lind P Monica PM   Bornefalk-Hermansson Anna A  

Environmental health : a global access science source 20140704


<h4>Background</h4>There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In this paper, we present an approach to search for interaction effects among several variables using boosted regression trees.<h4>Methods</h4>We simulate a continuous outcome from real data on 27 e  ...[more]

Similar Datasets

| S-EPMC6688581 | biostudies-literature
| S-EPMC9302655 | biostudies-literature
| S-EPMC3633987 | biostudies-literature
| S-EPMC7319248 | biostudies-literature
| S-EPMC6811674 | biostudies-literature
| S-EPMC5673221 | biostudies-literature
| S-EPMC7286386 | biostudies-literature
| S-EPMC8674730 | biostudies-literature
| S-EPMC7763457 | biostudies-literature
| S-EPMC6531441 | biostudies-other