Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Improving Performance of Clinical Text Classification Models with Attention Mechanisms and Rationales
Details

Improving Performance of Clinical Text Classification Models with Attention Mechanisms and Rationales

Date Issued
December 1, 2024
Author(s)
Metzner, Christoph
Advisor(s)
Heidi A. Hanson
Additional Advisor(s)
Shang Gao
Drahomira Herrmannova
Russell Zaretzki
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/19583
Abstract

AI is revolutionizing the technological landscape in medicine. A key application is the AI-driven summarization of clinical text, which facilitates the harmonization and curation of clinical data elements for common data models leading to improved understanding of population-level health. Population-level health is derived from aggregating patient-level information stored in unstructured electronic health records, often in the form of free-text clinical notes. As clinical text documents are information dense and written in highly complex clinical language, a model’s ability to discern signal from noise becomes exceedingly more crucial. To enable models to identify relevant information in text documents, previous research has shown attention mechanisms and non-medical human-based rationales to be effective. Building on this foundation, this dissertation evaluated methods to optimize attention mechanisms and to effectively use human-based clinical rationales as additional supervision to improve the performance and interpretability of clinical text classification models. In particular, this dissertation shows the following: (i) the effective utilization of the reference information—initialization with external text code descriptions or encoding of code hierarchy of medical coding systems—contained by an attention mechanism’s query matrix can improve model performance; (ii) extending an atten- tion mechanism’s receptive field with a flexible context window at the phrase-level leads to improved understanding of local linguistic information in clinical text; and (iii) utilizing human-based clinical rationales as additional supplementary training data can improve model performance and positively impacts model interpretability.

Subjects

Attention Mechanisms

text classification

rationales

natural language proc...

cancer pathology repo...

deep learning

Disciplines
Artificial Intelligence and Robotics
Data Science
Degree
Doctor of Philosophy
Major
Data Science and Engineering
File(s)
Thumbnail Image
Name

Dissertation_Metzner_absolute_final_draft_20241122.pdf

Size

3.04 MB

Format

Adobe PDF

Checksum (MD5)

3e6c8a770471322b4028260e6568a297

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify