Deformable phrase level attention: A flexible approach for improving AI based medical coding
Journal Article
·
· Artificial Intelligence in Medicine
- Univ. of Tennessee, Knoxville, TN (United States). Bredesen Center for Interdisciplinary Research and Graduate Education; Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Thomson Reuters, Eagan, MN (United States)
- Elsevier, Philadelphia, PA (United States)
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Objective: Improving the AI-driven automated medical encoding of clinical text plays a vital role in gathering information on the occurrence of diseases to improve population-level health. This work presents a novel attention mechanism designed to enhance text classification models and ensure appropriate classification of medical concepts in unstructured electronic health records. Materials and Methods: We developed a deformable, phrase-level attention mechanism to identify important lexical word-level and contextual phrase-level information from clinical text documents. We evaluated conventional and transformer-based deep learning models that we extended with our attention mechanism on the extraction of critical cancer information (e.g., site, subsite, laterality, histology, behavior) from 629,908 electronic pathology reports and on the automated medical encoding of 52,722 hospital discharge summaries. Results: Transformer-based models with the deformable, phrase-level attention mechanism achieved the best performance on the extraction of critical cancer information from pathology reports. Conventional- and transformer-based models show similar or better performance than their baseline counterparts on the automated medical encoding of clinical documents. Discussion: The addition of phrase-level information allowed models extended with our proposed method to outperform standard word-level attention. Our method showed favorable properties for the real-world application in terms of model robustness and phenotyping. These results indicate that our method is promising for automated data harmonization for common data models. Conclusion: This work proposes a novel deformable, phrase-level attention mechanism that enhances text classification models in the extraction of medical concepts from clinical text documents. We demonstrate strong performances on two clinical text datasets and showcase real-world deployability of our method.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- National Institutes of Health (NIH); USDOE
- Grant/Contract Number:
- AC02-06CH11357; AC05-00OR22725; AC52-06NA25396; AC52-07NA27344
- OSTI ID:
- 3020940
- Journal Information:
- Artificial Intelligence in Medicine, Journal Name: Artificial Intelligence in Medicine Vol. 171; ISSN 0933-3657
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types
FrESCO: Framework for Exploring Scalable Computational Oncology
Journal Article
·
Sun Nov 21 19:00:00 EST 2021
· Journal of Biomedical Informatics
·
OSTI ID:1884003
FrESCO: Framework for Exploring Scalable Computational Oncology
Journal Article
·
Sun Sep 10 20:00:00 EDT 2023
· Journal of Open Source Software
·
OSTI ID:2000393