In Journal of proteome research
In recent years machine learning has made extensive progress in modeling many aspects of mass spectrometry data. We brought together proteomics data generators, repository managers, and machine learning experts in a workshop with the goals to evaluate and explore machine learning applications for realistic modeling of data from multidimensional mass spectrometry-based proteomics analysis of any sample or organism. Following this sample-to-data roadmap helped identify knowledge gaps and define needs. Being able to generate bespoke and realistic synthetic data has legitimate and important uses in system suitability, method development, and algorithm benchmarking, while also posing critical ethical questions. The interdisciplinary nature of the workshop informed discussions of what is currently possible and future opportunities and challenges. In the following perspective we summarize these discussions in the hope of conveying our excitement about the potential of machine learning in proteomics and to inspire future research.
Neely Benjamin A, Dorfer Viktoria, Martens Lennart, Bludau Isabell, Bouwmeester Robbin, Degroeve Sven, Deutsch Eric W, Gessulat Siegfried, Käll Lukas, Palczynski Pawel, Payne Samuel H, Rehfeldt Tobias Greisager, Schmidt Tobias, Schwämmle Veit, Uszkoreit Julian, Vizcaíno Juan Antonio, Wilhelm Mathias, Palmblad Magnus
2023-Feb-06
artificial intelligence, deep learning, enzymatic digestion, ion mobility, liquid chromatography, machine learning, research integrity, synthetic data, tandem mass spectrometry