Concepedia

Publication | Open Access

QSPR Models for Prediction of Aqueous Solubility: Exploring the Potency of Randić-type Indices

16

Citations

33

References

2020

Year

Abstract

The development of QSPR models to predict aqueous solubility (logS) is presented. A structurally diverse set of over 1600 compounds with experimentally determined solubility values (AqSolDB database) is used for building the data-driven models based on multiple linear regression (MLR) and artificial neural network (ANN) methods to predict aqueous solubility. Molecular structures are encoded by numerous structural descriptors, including the connectivity index developed by Randi in 1975, and many later derived variations. To evaluate the potency of Randi-like descriptors in the structure-property relationship, we developed models based on two sets of descriptors, first using only Randilike descriptors calculated with Dragon, and second using 17 commonly applied descriptors available in the AqSolDB database. All models were validated with external prediction sets, with the RMSE ranging from 0.8 to 1.1. Interestingly, the RMSE of predicted LogS values of models based only on the Randi-like descriptors were in average just 0.1 larger than the models with 17 descriptors preselected as suitable for modelling logS.

References

YearCitations

Page 1