Unknown

Dataset Information

0

In silico prediction of chemical aquatic toxicity for marine crustaceans via machine learning.


ABSTRACT: Aquatic toxicity is a crucial endpoint for evaluating chemically adverse effects on ecosystems. Therefore, we developed in silico methods for the prediction of chemical aquatic toxicity in marine environment. At first, a diverse data set including different crustacean species was constructed. We then built local binary models using Mysidae data and global binary models using Mysidae, Palaemonidae, and Penaeidae data. Molecular fingerprints and descriptors were employed to represent chemical structures separately. All the models were built by six machine learning methods. The AUC (area under the receiver operating characteristic curve) values of the better local and global models were around 0.8 and 0.9 for the test sets, respectively. We also identified several chemicals with selective toxicity on different species. The analysis of selective toxicity would promote to design greener chemicals in a specific environment. Finally, to understand and interpret the models, we explored the relationships between chemical aquatic toxicity and the molecular descriptors. Our study would be helpful in gaining further insights into marine organisms, prediction of chemical aquatic toxicity and prioritization of environmental hazard assessment.

SUBMITTER: Liu L 

PROVIDER: S-EPMC6505403 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

<i>In silico</i> prediction of chemical aquatic toxicity for marine crustaceans <i>via</i> machine learning.

Liu Lin L   Yang Hongbin H   Cai Yingchun Y   Cao Qianqian Q   Sun Lixia L   Wang Zhuang Z   Li Weihua W   Liu Guixia G   Lee Philip W PW   Tang Yun Y  

Toxicology research 20190125 3


Aquatic toxicity is a crucial endpoint for evaluating chemically adverse effects on ecosystems. Therefore, we developed <i>in silico</i> methods for the prediction of chemical aquatic toxicity in marine environment. At first, a diverse data set including different crustacean species was constructed. We then built local binary models using <i>Mysidae</i> data and global binary models using <i>Mysidae</i>, <i>Palaemonidae</i>, and <i>Penaeidae</i> data. Molecular fingerprints and descriptors were  ...[more]

Similar Datasets

| S-EPMC6062408 | biostudies-literature
| S-EPMC7329181 | biostudies-literature
| S-EPMC8582201 | biostudies-literature
| S-EPMC6062245 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC8142502 | biostudies-literature
| S-EPMC10917336 | biostudies-literature
| S-EPMC10531479 | biostudies-literature
| S-EPMC10826801 | biostudies-literature
| S-EPMC4785608 | biostudies-literature