In BMC bioinformatics
BACKGROUND : A Virtual Screening algorithm has to adapt to the different stages of this process. Early screening needs to ensure that all bioactive compounds are ranked in the first positions despite of the number of false positives, while a second screening round is aimed at increasing the prediction accuracy.
RESULTS : A novel CNN architecture is presented to this aim, which predicts bioactivity of candidate compounds on CDK1 using a combination of molecular fingerprints as their vector representation, and has been trained suitably to achieve good results as regards both enrichment factor and accuracy in different screening modes (98.55% accuracy in active-only selection, and 98.88% in high precision discrimination).
CONCLUSION : The proposed architecture outperforms state-of-the-art ML approaches, and some interesting insights on molecular fingerprints are devised.
Mendolia Isabella, Contino Salvatore, Perricone Ugo, Ardizzone Edoardo, Pirrone Roberto
Bioactivity prediction, Deep learning, Drug design, Molecular fingerprints, Virtual screening