In SAR and QSAR in environmental research

Tyrosinase is a key rate-limiting enzyme in the process of melanin synthesis, which is closely related to human pigmentation disorders. Tyrosinase inhibitors can down-regulate tyrosinase to effectively reduce melanin synthesis. In this work, we conducted structure-activity relationship (SAR) study on 1097 diverse mushroom tyrosinase inhibitors. We applied five kinds of machine learning methods to develop 15 classification models. Model 5B built by fully connected neural networks and ECFP4 fingerprints achieved the highest prediction accuracy of 91.36% and Matthews correlation coefficient (MCC) of 0.81 on the test set. The applicability domains (AD) of classification models were defined by d S T D - P R O method. Moreover, we clustered the 1097 inhibitors into eight subsets by K-Means to figure out inhibitors' structural features. In addition, 10 quantitative structure-activity relationship (QSAR) models were constructed by four machine learning methods based on 813 inhibitors. Model 6 J, the best QSAR model, was developed by fully connected neural networks with 50 RDKit descriptors. It resulted in a coefficient of determination (r2) of 0.770 and a root mean squared error (RMSE) of 0.482 on the test set. The AD of Model 6 J was visualized by Williams plot. The models built in this study can be obtained from the authors.

Wu Y, Huo D, Chen G, Yan A


K-Means, Structure-activity relationship (SAR), machine learning, quantitative structure-activity relationship (QSAR), tyrosinase inhibitors