Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

In Advances in therapy

INTRODUCTION : A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of multiple hospitals.

METHODS : We constructed AI models (Bidirectional Encoder Representations from Transformers [BERT], Naïve Bayes, and Longformer) for tumor evaluation using the University of Miyazaki Hospital (UMH) database. This data included both structured and unstructured data from progress notes, radiology reports, and discharge summaries. The BERT model was applied to the Life Data Initiative (LDI) data set of six hospitals. Study outcomes included the performance of AI models and time to progression of disease (TTP) for each line of treatment based on the treatment response extracted by AI models.

RESULTS : For the UMH data set, the BERT model exhibited higher precision accuracy compared to the Naïve Bayes or the Longformer models, respectively (precision [0.42 vs. 0.47 or 0.22], recall [0.63 vs. 0.46 or 0.33] and F1 scores [0.50 vs. 0.46 or 0.27]). When this BERT model was applied to LDI data, prediction accuracy remained quite similar. The Kaplan-Meier plots of TTP (months) showed similar trends for the first (median 14.9 [95% confidence interval 11.5, 21.1] and 16.8 [12.6, 21.8]), the second (7.8 [6.7, 10.7] and 7.8 [6.7, 10.7]), and the later lines of treatment for the predicted data by the BERT model and the manually curated data.

CONCLUSION : We developed AI models to extract treatment responses in patients with lung cancer using a large EHR database; however, the model requires further improvement.

Araki Kenji, Matsumoto Nobuhiro, Togo Kanae, Yonemoto Naohiro, Ohki Emiko, Xu Linghua, Hasegawa Yoshiyuki, Satoh Daisuke, Takemoto Ryota, Miyazaki Taiga

2022-Dec-22

Artificial intelligence, BERT, Electronic health records database, Lung cancer, Real-world data, Retrospective study