In Bioinformatics (Oxford, England)
MOTIVATION : Recent advances in the areas of bioinformatics and chemogenomics are poised to accelerate the discovery of small-molecule regulators of cell development. Combining large genomics and molecular data sources with powerful deep learning techniques has the potential to revolutionize predictive biology. In this study, we present Deep Compound Profiler (DeepCOP), a deep learning based model that can predict gene regulating effects of low-molecular weight compounds. This model can be used for direct identification of a drug candidate causing a desired gene expression response, without utilizing any information on its interactions with protein target(s).
RESULTS : In this study we successfully combined molecular fingerprint descriptors and gene descriptors (derived from GO terms) to train deep neural networks that predict differential gene regulation endpoints collected in LINCS database. We achieved 10-fold cross validation RAUC scores of and above 0.80, as well as enrichment factors of > 5. We validated our models using an external RNA-Seq dataset generated in-house that described the effect of three potent antiandrogens (with different modes of action) on gene expression in LNCaP prostate cancer cell line. The results of this pilot study demonstrate that deep learning models can effectively synergize molecular and genomic descriptors and can be used to screen for novel drug candidates with the desired effect on gene expression. We anticipate that such models can find a broad use in developing novel cancer therapeutics and can facilitate precision oncology efforts.
SUPPLEMENTARY INFORMATION : Supplementary data are available at Bioinformatics online.
Woo Godwin, Fernandez Michael, Hsing Michael, Lack Nathan A, Cavga Ayse Derya, Cherkasov Artem