Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

Pathology Pathology

Paving New Roads Using Allium sativum as a Repurposed Drug and Analyzing its Antiviral Action Using Artificial Intelligence Technology.

In Iranian journal of pharmaceutical research : IJPR

CONTEXT : The whole universe is facing a coronavirus catastrophe, and prompt treatment for the health crisis is primarily significant. The primary way to improve health conditions in this battle is to boost our immunity and alter our diet patterns. A common bulb veggie used to flavor cuisine is garlic. Compounds in the plant that are physiologically active are present, contributing to its pharmacological characteristics. Among several food items with nutritional value and immunity improvement, garlic stood predominant and more resourceful natural antibiotic with a broad spectrum of antiviral potency against diverse viruses. However, earlier reports have depicted its efficacy in the treatment of a variety of viral illnesses. Nonetheless, there is no information on its antiviral activities and underlying molecular mechanisms.

OBJECTIVES : The bioactive compounds in garlic include organosulfur (allicin and alliin) and flavonoid (quercetin) compounds. These compounds have shown immunomodulatory effects and inhibited attachment of coronavirus to the angiotensin-converting enzyme 2 (ACE2) receptor and the Mpro of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). Further, we have discussed the contradictory impacts of garlic used as a preventive measure against the novel coronavirus.

METHOD : The GC/MS analysis revealed 18 active chemicals, including 17 organosulfur compounds in garlic. Using the molecular docking technique, we report for the first time the inhibitory effect of the under-consideration compounds on the host receptor ACE2 protein in the human body, providing a crucial foundation for understanding individual compound coronavirus resistance on the main protease protein of SARS-CoV-2. Allyl disulfide and allyl trisulfide, which make up the majority of the compounds in garlic, exhibit the most potent activity.

RESULTS : Conventional medicine has proven its efficiency from ancient times. Currently, our article's prime spotlight was on the activity of Allium sativum on the relegation of viral load and further highlighted artificial intelligence technology to study the attachment of the allicin compound to the SARS-CoV-2 receptor to reveal its efficacy.

CONCLUSIONS : The COVID-19 pandemic has triggered interest among researchers to conduct future research on molecular docking with clinical trials before releasing salutary remedies against the deadly malady.

Atoum Manar Fayiz, Padma Kanchi Ravi, Don Kanchi Ravi

2022-Dec

Allium sativum, Flavonoid, Immunomodulatory, SARS-CoV-2

General General

A Cross-institutional Evaluation on Breast Cancer Phenotyping NLP Algorithms on Electronic Health Records

ArXiv Preprint

Objective: The generalizability of clinical large language models is usually ignored during the model development process. This study evaluated the generalizability of BERT-based clinical NLP models across different clinical settings through a breast cancer phenotype extraction task. Materials and Methods: Two clinical corpora of breast cancer patients were collected from the electronic health records from the University of Minnesota and the Mayo Clinic, and annotated following the same guideline. We developed three types of NLP models (i.e., conditional random field, bi-directional long short-term memory and CancerBERT) to extract cancer phenotypes from clinical texts. The models were evaluated for their generalizability on different test sets with different learning strategies (model transfer vs. locally trained). The entity coverage score was assessed with their association with the model performances. Results: We manually annotated 200 and 161 clinical documents at UMN and MC, respectively. The corpora of the two institutes were found to have higher similarity between the target entities than the overall corpora. The CancerBERT models obtained the best performances among the independent test sets from two clinical institutes and the permutation test set. The CancerBERT model developed in one institute and further fine-tuned in another institute achieved reasonable performance compared to the model developed on local data (micro-F1: 0.925 vs 0.932). Conclusions: The results indicate the CancerBERT model has the best learning ability and generalizability among the three types of clinical NLP models. The generalizability of the models was found to be correlated with the similarity of the target entities between the corpora.

Sicheng Zhou, Nan Wang, Liwei Wang, Ju Sun, Anne Blaes, Hongfang Liu, Rui Zhang

2023-03-15

General General

MAHOMES II: A webserver for predicting if a metal binding site is enzymatic.

In Protein science : a publication of the Protein Society

Recent advances have enabled high-quality computationally generated structures for proteins with no solved crystal structures. However, protein function data remains largely limited to experimental methods and homology mapping. Since structure determines function, it is natural that methods capable of using computationally generated structures for functional annotations need to be advanced. Our laboratory recently developed a method to distinguish between metalloenzyme and non-enzyme sites. Here we report improvements to this method by upgrading our physicochemical features to alleviate the need for structures with sub-angstrom precision and using machine learning to reduce training data labeling error. Our improved classifier identifies protein bound metal sites as enzymatic or non-enzymatic with 94% precision and 92% recall. We demonstrate that both adjustments increased predictive performance and reliability on sites with sub-angstrom variations. We constructed a set of predicted metalloprotein structures with no solved crystal structures and no detectable homology to our training data. Our model had an accuracy of 90-97.5% depending on the quality of the predicted structures included in our test. Finally, we found the physicochemical trends that drove this model's successful performance were local protein density, second shell ionizable residue burial, and the pocket's accessibility to the site. We anticipate that our model's ability to correctly identify catalytic metal sites could enable identification of new enzymatic mechanisms and improve de novo metalloenzyme design success rates. This article is protected by copyright. All rights reserved.

Feehan Ryan, Copeland Matthew, Franklin Meghan W, Slusky Joanna S G

2023-Mar-14

Enzymes, Machine Learning, Metalloenzymes, Metalloproteins

Radiology Radiology

Early Diagnosis of Pancreatic Cancer: Clinical Premonitions, Timely Precursor Detection and Increased Curative-Intent Surgery.

In Cancer control : journal of the Moffitt Cancer Center

BACKGROUND : The overall poor prognosis in pancreatic cancer is related to late clinical detection. Early diagnosis remains a considerable challenge in pancreatic cancer. Unfortunately, the onset of clinical symptoms in patients usually indicate advanced disease or presence of metastasis.

ANALYSIS AND RESULTS : Currently, there are no designated diagnostic or screening tests for pancreatic cancer in clinical use. Thus, identifying risk groups, preclinical risk factors or surveillance strategies to facilitate early detection is a target for ongoing research. Hereditary genetic syndromes are a obvious, but small group at risk, and warrants close surveillance as suggested by society guidelines. Screening for pancreatic cancer in asymptomatic individuals is currently associated with the risk of false positive tests and, thus, risk of harms that outweigh benefits. The promise of cancer biomarkers and use of 'omics' technology (genomic, transcriptomics, metabolomics etc.) has yet to see a clinical breakthrough. Several proposed biomarker studies for early cancer detection lack external validation or, when externally validated, have shown considerably lower accuracy than in the original data. Biopsies or tissues are often taken at the time of diagnosis in research studies, hence invalidating the value of a time-dependent lag of the biomarker to detect a pre-clinical, asymptomatic yet operable cancer. New technologies will be essential for early diagnosis, with emerging data from image-based radiomics approaches, artificial intelligence and machine learning suggesting avenues for improved detection.

CONCLUSIONS : Early detection may come from analytics of various body fluids (eg 'liquid biopsies' from blood or urine). In this review we present some the technological platforms that are explored for their ability to detect pancreatic cancer, some of which may eventually change the prospects and outcomes of patients with pancreatic cancer.

Søreide Kjetil, Ismail Warsan, Roalsø Marcus, Ghotbi Jacob, Zaharia Claudia

2023

biomarker, curative surgery, diagnosis, early detection, early diagnosis, liquid biopsy, prevention, radiology, screening

General General

Deep Learning Model for Efficient Protein-Ligand Docking with Implicit Side-Chain Flexibility.

In Journal of chemical information and modeling

Protein-ligand docking is an essential tool in structure-based drug design with applications ranging from virtual high-throughput screening to pose prediction for lead optimization. Most docking programs for pose prediction are optimized for redocking to an existing cocrystallized protein structure, ignoring protein flexibility. In real-world drug design applications, however, protein flexibility is an essential feature of the ligand-binding process. Flexible protein-ligand docking still remains a significant challenge to computational drug design. To target this challenge, we present a deep learning (DL) model for flexible protein-ligand docking based on the prediction of an intermolecular Euclidean distance matrix (EDM), making the typical use of iterative search algorithms obsolete. The model was trained on a large-scale data set of protein-ligand complexes and evaluated on independent test sets. Our model generates high quality poses for a diverse set of protein and ligand structures and outperforms comparable docking methods.

Masters Matthew R, Mahmoud Amr H, Wei Yao, Lill Markus A

2023-Mar-14

General General

Integrated gene profiling of fine-needle aspiration sample improves lymph node metastasis risk stratification for thyroid cancer.

In Cancer medicine

BACKGROUND : Lymph node metastasis risk stratification is crucial for the surgical decision-making of thyroid cancer. This study investigated whether the integrated gene profiling (combining expression, SNV, fusion) of Fine-Needle Aspiration (FNA) samples can improve the prediction of lymph node metastasis in patients with papillary thyroid cancer.

METHODS : In this retrospective cohort study, patients with papillary thyroid cancer who went through thyroidectomy and central lymph node dissection were included. Multi-omics data of FNA samples were assessed by an integrated array. To predict lymph node metastasis, we built models using gene expressions or mutations (SNV and fusion) only and an Integrated Risk Stratification (IRS) model combining genetic and clinical information. Blinded histopathology served as the reference standard. ROC curve and decision curve analysis was applied to evaluate the predictive models.

RESULTS : One hundred and thirty two patients with pathologically confirmed papillary thyroid cancer were included between 2016-2017. The IRS model demonstrated greater performance [AUC = 0.87 (0.80-0.94)] than either expression classifier [AUC = 0.67 (0.61-0.74)], mutation classifier [AUC = 0.61 (0.55-0.67)] or TIRADS score [AUC = 0.68 (0.62-0.74)] with statistical significance (p < 0.001), and the IRS model had similar predictive performance in large nodule [>1 cm, AUC = 0.88 (0.79-0.97)] and small nodule [≤1 cm, AUC = 0.84 (0.74-0.93)] subgroups. The genetic risk factor showed independent predictive value (OR = 10.3, 95% CI:1.1-105.3) of lymph node metastasis in addition to the preoperative clinical information, including TIRADS grade, age, and nodule size.

CONCLUSION : The integrated gene profiling of FNA samples and the IRS model developed by the machine-learning method significantly improve the risk stratification of thyroid cancer, thus helping make wise decisions and reducing unnecessary extensive surgeries.

Zhang Weituo, Yun Xinwei, Xu Tianyu, Wang Xiaoqing, Li Qiang, Zhang Tiantian, Xie Li, Wang Suna, Li Dapeng, Wei Xi, Yu Yang, Qian Biyun

2023-Mar-14

biomarker, fine-needle aspiration, machine learning, multi-omics, risk stratification, thyroid cancer