Masters Theses

Date of Award

12-2016

Degree Type

Thesis

Degree Name

Master of Science

Major

Computer Science

Major Professor

Mark E. Dean

Committee Members

Bradley T. Vander Zanden, Chad A. Steed

Abstract

The University of Tennessee, Knoxville (UTK) Library possesses thousands of unlabeled gray-scale photographs from the Smoky Mountains circa the 1920s - 1940s. Their current method of identifying and labeling attributes of the photographs is to do so manually. This is problematic both because of the scale of the collection as well as the reliance on an individual's limited knowledge of the area's numerous landmarks.

In the past few years, similar dilemmas have been tackled via an approach known as crowd computing. Some examples include Floating Forests, in which users are asked to identify and mark kelp forests in satellite images, and Ancient Lives, which enlists users help in transcribing 2000-year-old manuscripts that Oxford University researchers had struggled to efficiently translate for over a century.

For this particular problem, we propose releasing the image collections to the public through a web application. The application would target outdoor enthusiasts, conservationists, or professionals such as geologists, rangers, or historians who are familiar with the region and would find interest in helping to label the more recognizable photos. Users would "tag" landmarks using a hierarchically sorted data set of landmark names accessible via an incremental search.

With sufficient participation, the image collection could be efficiently categorized and labeled beyond what is currently feasible using the librarys limited number of personnel. Furthermore, this application could easily be adapted to categorize other unlabeled image collections if provided with the proper data set for tagging.

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Share

COinS