In Annals of translational medicine
Background : Multicenter medical research is becoming a new trend. However, interagency data privacy security protection has become a major bottleneck of multicenter research. Therefore, to overcome this privacy protection issue, the aim of the present study was to apply a self-developed privacy-preserving machine learning framework for researchers who can build models on medical data from multiple sources, while providing privacy protection for both sensitive data and the learned model.
Methods : Based on Arya, a novel privacy computing platform developed by Healink, we constructed a privacy-preserving federated learning (FL) model using the fully connected neural network with datasets from 2-3 individual medical institutions. In the dataset, 80% of records were used for joint modeling on acute myocardial infarction (AMI) diagnosis. Modeling efficacy was evaluated with the remaining 20% of records. As the control, 1,500 medical records from 1 medical institution were used for single-center modeling and efficacy evaluation. During the process, the original data were still kept in individual hospital without moving or transferring out of the hospitial. The diagnostic efficacy (sensitivity, positive predictive value, and accuracy) was evaluated.
Results : Our privacy-preserving FL model gives reliable AMI diagnostic efficacy. Three-center modeling (79% sensitivity, 88% positive predictive value, and 82.3% accuracy) and two-center modeling (77.8% or 77.6% sensitivity, 86.7% or 85.30% positive predictive value, and 81% or 79.7% accuracy) achieved relative high diagnostic efficacy; and single-center modeling achieved relative low diagnostic efficacy (76% sensitivity, 84.7% positive predictive value, and 79% accuracy).
Conclusions : The Arya privacy computing platform is efficient and practical for the FL model, which could promote multicenter medical research securely without sacrificing diagnostic efficacy.
Xu Jie, Zhang Yu, Yu Huamin, Lin Bo, Xiang Peng, Lin Te, Lu Huizhe, Zhang Guiying, Wang Dejian, Yuan Hong, Hu Bin, Jiang Jun
Arya privacy computing platform, acute myocardial infarction, federated learning, multicenter modeling