School of Information Sciences -- Faculty Publications and Other Works

Source Publication (e.g., journal title)

Proceedings of the International Conference on Dublin Core Metadata Applications

Author ORCID Identifier

https://orcid.org/0000-0002-2448-3495

Document Type

Conference Proceeding

Publication Date

2022

Abstract

Collections of digitized, historical images serve as rich primary sources for digital humanities research, though access to these resources has been hindered by inadequate subject metadata. In this study, researchers explored the feasibility of performing subject analysis for a collection of historical images of persons through an automated procedure. Building on previous work that developed a faceted system for representing the identities of persons depicted in 19th century visual images, the present work attempted to automate the process of person and facet parsing for images from the A.S. Williams III Collection at the University of Alabama. A case-based model was built and used to analyze image titles. Compared to a manual control process, the automated model achieved a 95% success rate in parsing persons and an 85% success rate in parsing facets. Errors in parsing were more likely to occur for images of multiple persons, as well as those labeled with incomplete or uncertain names. Findings offer further support for faceted analysis of personal identity in historical materials, and reveal the potentials of automated, text-based methods of enhancing subject access for large visual image collections.

Submission Type

Pre-print

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Share

COinS