Doctor Penguin

Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

General

General

Taming hyperparameter tuning in continuous normalizing flows using the JKO scheme.

In Scientific reports ; h5-index 158.0
A normalizing flow (NF) is a mapping that transforms a chosen probability distribution to a normal distribution. Such flows are a common technique used for data generation and density estimation in machine learning and data science. The density estimate obtained with a NF requires a change of variables formula that involves the computation of the Jacobian determinant of the NF transformation. In order to tractably compute this determinant, continuous normalizing flows (CNF) estimate the mapping and its Jacobian determinant using a neural ODE. Optimal transport (OT) theory has been successfully used to assist in finding CNFs by formulating them as OT problems with a soft penalty for enforcing the standard normal distribution as a target measure. A drawback of OT-based CNFs is the addition of a hyperparameter, [Formula: see text], that controls the strength of the soft penalty and requires significant tuning. We present JKO-Flow, an algorithm to solve OT-based CNF without the need of tuning [Formula: see text]. This is achieved by integrating the OT CNF framework into a Wasserstein gradient flow framework, also known as the JKO scheme. Instead of tuning [Formula: see text], we repeatedly solve the optimization problem for a fixed [Formula: see text] effectively performing a JKO update with a time-step [Formula: see text]. Hence we obtain a "divide and conquer" algorithm by repeatedly solving simpler problems instead of solving a potentially harder problem with large [Formula: see text].
Vidal Alexander, Wu Fung Samy, Tenorio Luis, Osher Stanley, Nurbekyan Levon

2023-Mar-18

General

General

DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4

ArXiv Preprint
The digitization of healthcare has facilitated the sharing and re-using of medical data but has also raised concerns about confidentiality and privacy. HIPAA (Health Insurance Portability and Accountability Act) mandates removing re-identifying information before the dissemination of medical records. Thus, effective and efficient solutions for de-identifying medical data, especially those in free-text forms, are highly needed. While various computer-assisted de-identification methods, including both rule-based and learning-based, have been developed and used in prior practice, such solutions still lack generalizability or need to be fine-tuned according to different scenarios, significantly imposing restrictions in wider use. The advancement of large language models (LLM), such as ChatGPT and GPT-4, have shown great potential in processing text data in the medical domain with zero-shot in-context learning, especially in the task of privacy protection, as these models can identify confidential information by their powerful named entity recognition (NER) capability. In this work, we developed a novel GPT4-enabled de-identification framework ("DeID-GPT") to automatically identify and remove the identifying information. Compared to existing commonly used medical text data de-identification methods, our developed DeID-GPT showed the highest accuracy and remarkable reliability in masking private information from the unstructured medical text while preserving the original structure and meaning of the text. This study is one of the earliest to utilize ChatGPT and GPT-4 for medical text data processing and de-identification, which provides insights for further research and solution development on the use of LLMs such as ChatGPT/GPT-4 in healthcare. Codes and benchmarking data information are available at https://github.com/yhydhx/ChatGPT-API.
Zhengliang Liu, Xiaowei Yu, Lu Zhang, Zihao Wu, Chao Cao, Haixing Dai, Lin Zhao, Wei Liu, Dinggang Shen, Quanzheng Li, Tianming Liu, Dajiang Zhu, Xiang Li

2023-03-20

Radiology

Radiology

Multiparameter prediction model of immune checkpoint inhibitors combined with chemotherapy for non-small cell lung cancer based on support vector machine learning.

In Scientific reports ; h5-index 158.0
The reliable predictive markers to identify which patients with advanced non-small cell lung cancer tumors (NSCLC) will achieve durable clinical benefit (DCB) for chemo-immunotherapy are needed. In this retrospective study, we collected radiomics and clinical signatures from 94 patients with advanced NSCLC treated with anti-PD-1/PD-L1 combined with chemotherapy from January 1, 2018 to May 31, 2022. Radiomics variables were extracted from pretreatment CT and selected by Spearman correlation coefficients and clinical features by Logistics regression analysis. We performed effective diagnostic algorithms principal components analysis (PCA) and support vector machine (SVM) to develop an early classification model among DCB and non-durable benefit (NDB) groups. A total of 26 radiomics features and 6 clinical features were selected, and then principal component analysis was used to obtain 6 principal components for SVM building. RC-SVM achieved prediction accuracy with AUC of 0.91 (95% CI 0.87-0.94) in the training set, 0.73 (95% CI 0.61-0.85) in the cross-validation set, 0.84 (95% CI 0.80-0.89) in the external validation set. The new method of RC-SVM model based on radiomics-clinical signatures provides a significant additive value on response prediction in patients with NSCLC preceding chemo-immunotherapy.
Zhou Zihan, Guo Wenjie, Liu Dingqi, Micha Jose Ramon Nsue, Song Yue, Han Shuhua

2023-Mar-18

General

General

Uncertainty-aware deep learning for digital twin-driven monitoring: Application to fault detection in power lines

ArXiv Preprint
Deep neural networks (DNNs) are often coupled with physics-based models or data-driven surrogate models to perform fault detection and health monitoring of systems in the low data regime. These models serve as digital twins to generate large quantities of data to train DNNs which would otherwise be difficult to obtain from the real-life system. However, such models can exhibit parametric uncertainty that propagates to the generated data. In addition, DNNs exhibit uncertainty in the parameters learnt during training. In such a scenario, the performance of the DNN model will be influenced by the uncertainty in the physics-based model as well as the parameters of the DNN. In this article, we quantify the impact of both these sources of uncertainty on the performance of the DNN. We perform explicit propagation of uncertainty in input data through all layers of the DNN, as well as implicit prediction of output uncertainty to capture the former. Furthermore, we adopt Monte Carlo dropout to capture uncertainty in DNN parameters. We demonstrate the approach for fault detection of power lines with a physics-based model, two types of input data and three different neural network architectures. We compare the performance of such uncertainty-aware probabilistic models with their deterministic counterparts. The results show that the probabilistic models provide important information regarding the confidence of predictions, while also delivering an improvement in performance over deterministic models.
Laya Das, Blazhe Gjorgiev, Giovanni Sansavini

2023-03-20

Public Health

Public Health

Development and validation of a simple machine learning tool to predict mortality in leptospirosis.

In Scientific reports ; h5-index 158.0
Predicting risk factors for death in leptospirosis is challenging, and identifying high-risk patients is crucial as it might expedite the start of life-saving supportive care. Admission data of 295 leptospirosis patients were enrolled, and a machine-learning approach was used to fit models in a derivation cohort. The comparison of accuracy metrics was performed with two previous models-SPIRO score and quick SOFA score. A Lasso regression analysis was the selected model, demonstrating the best accuracy to predict mortality in leptospirosis [area under the curve (AUC-ROC) = 0.776]. A score-based prediction was carried out with the coefficients of this model and named LeptoScore. Then, to simplify the predictive tool, a new score was built by attributing points to the predictors with importance values higher than 1. The simplified score, named QuickLepto, has five variables (age > 40 years; lethargy; pulmonary symptom; mean arterial pressure < 80 mmHg and hematocrit < 30%) and good predictive accuracy (AUC-ROC = 0.788). LeptoScore and QuickLepto had better accuracy to predict mortality in patients with leptospirosis when compared to SPIRO score (AUC-ROC = 0.500) and quick SOFA score (AUC-ROC = 0.782). The main result is a new scoring system, the QuickLepto, that is a simple and useful tool to predict death in leptospirosis patients at hospital admission.
Galdino Gabriela Studart, de Sandes-Freitas Tainá Veras, de Andrade Luis Gustavo Modelli, Adamian Caio Manuel Caetano, Meneses Gdayllon Cavalcante, da Silva Junior Geraldo Bezerra, de Francesco Daher Elizabeth

2023-Mar-18

General

General

k-SALSA: k-anonymous synthetic averaging of retinal images via local style alignment

ArXiv Preprint
The application of modern machine learning to retinal image analyses offers valuable insights into a broad range of human health conditions beyond ophthalmic diseases. Additionally, data sharing is key to fully realizing the potential of machine learning models by providing a rich and diverse collection of training data. However, the personally-identifying nature of retinal images, encompassing the unique vascular structure of each individual, often prevents this data from being shared openly. While prior works have explored image de-identification strategies based on synthetic averaging of images in other domains (e.g. facial images), existing techniques face difficulty in preserving both privacy and clinical utility in retinal images, as we demonstrate in our work. We therefore introduce k-SALSA, a generative adversarial network (GAN)-based framework for synthesizing retinal fundus images that summarize a given private dataset while satisfying the privacy notion of k-anonymity. k-SALSA brings together state-of-the-art techniques for training and inverting GANs to achieve practical performance on retinal images. Furthermore, k-SALSA leverages a new technique, called local style alignment, to generate a synthetic average that maximizes the retention of fine-grain visual patterns in the source images, thus improving the clinical utility of the generated images. On two benchmark datasets of diabetic retinopathy (EyePACS and APTOS), we demonstrate our improvement upon existing methods with respect to image fidelity, classification performance, and mitigation of membership inference attacks. Our work represents a step toward broader sharing of retinal images for scientific collaboration. Code is available at https://github.com/hcholab/k-salsa.
Minkyu Jeon, Hyeonjin Park, Hyunwoo J. Kim, Michael Morley, Hyunghoon Cho

2023-03-20