Determining the etiologic basis of the mutations that are responsible for cancer is one of the fundamental challenges in modern cancer research. Different mutational processes induce different types of DNA mutations, providing 'mutational signatures' that have led to key insights into cancer etiology. The most widely used signatures for assessing genomic data are based on unsupervised patterns that are then retrospectively correlated with certain features of cancer. We show here that supervised machine-learning techniques can identify signatures, called SuperSigs, that are more predictive than those currently available. Surprisingly, we found that aging yields different SuperSigs in different tissues, and the same is true for environmental exposures. We were able to discover SuperSigs associated with obesity, the most important lifestyle factor contributing to cancer in Western populations.
Afsari Bahman, Kuo Albert, Zhang YiFan, Li Lu, Lahouel Kamel, Danilova Ludmila, Favorov Alexander, Rosenquist Thomas A, Grollman Arthur P, Kinzler Ken W, Cope Leslie, Vogelstein Bert, Tomasetti Cristian
cancer biology, human