Concepedia

Publication | Open Access

Machine Learning Models for Estrogen Receptor Bioactivity and Endocrine Disruption Prediction

49

Citations

43

References

2020

Year

Abstract

The U.S. Environmental Protection Agency (EPA) periodically releases <i>in vitro</i> data across a variety of targets, including the estrogen receptor (ER). In 2015, the EPA used these data to construct mathematical models of ER agonist and antagonist pathways to prioritize chemicals for endocrine disruption testing. However, mathematical models require <i>in vitro</i> data prior to predicting estrogenic activity, but machine learning methods are capable of prospective prediction from the molecular structure alone. The current study describes the generation and evaluation of Bayesian machine learning models grouped by the EPA's ER agonist pathway model using multiple data types with proprietary software, Assay Central. External predictions with three test sets of <i>in vitro</i> and <i>in vivo</i> reference chemicals with agonist activity classifications were compared to previous mathematical model publications. Training data sets were subjected to additional machine learning algorithms and compared with rank normalized scores of internal five-fold cross-validation statistics. External predictions were found to be comparable or superior to previous studies published by the EPA. When assessing six additional algorithms for the training data sets, Assay Central performed similarly at a reduced computational cost. This study demonstrates that machine learning can prioritize chemicals for future <i>in vitro</i> and <i>in vivo</i> testing of ER agonism.

References

YearCitations

Page 1