In Journal of pathology informatics ; h5-index 23.0
Exa-scale volumes of medical data have been produced for decades. In most cases, the diagnosis is reported in free text, encoding medical knowledge that is still largely unexploited. In order to allow decoding medical knowledge included in reports, we propose an unsupervised knowledge extraction system combining a rule-based expert system with pre-trained Machine Learning (ML) models, namely the Semantic Knowledge Extractor Tool (SKET). Combining rule-based techniques and pre-trained ML models provides high accuracy results for knowledge extraction. This work demonstrates the viability of unsupervised Natural Language Processing (NLP) techniques to extract critical information from cancer reports, opening opportunities such as data mining for knowledge extraction purposes, precision medicine applications, structured report creation, and multimodal learning. SKET is a practical and unsupervised approach to extracting knowledge from pathology reports, which opens up unprecedented opportunities to exploit textual and multimodal medical information in clinical practice. We also propose SKET eXplained (SKET X), a web-based system providing visual explanations about the algorithmic decisions taken by SKET. SKET X is designed/developed to support pathologists and domain experts in understanding SKET predictions, possibly driving further improvements to the system.
Marchesin Stefano, Giachelle Fabio, Marini Niccolò, Atzori Manfredo, Boytcheva Svetla, Buttafuoco Genziana, Ciompi Francesco, Di Nunzio Giorgio Maria, Fraggetta Filippo, Irrera Ornella, Müller Henning, Primov Todor, Vatrano Simona, Silvello Gianmaria
Clinical practice, Digital pathology, Expert systems, Explainable AI, Knowledge extraction, Machine learning