Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Deep Machine Learning with Spatio-Temporal Inference
Details

Deep Machine Learning with Spatio-Temporal Inference

Date Issued
May 1, 2012
Author(s)
Karnowski, Thomas Paul
Advisor(s)
Itamar Arel
Additional Advisor(s)
Hairong Qi, Xiaobeng Feng, Husheng Li, Michael J. Roberts
Abstract

Deep Machine Learning (DML) refers to methods which utilize hierarchies of more than one or two layers of computational elements to achieve learning. DML may draw upon biomemetic models, or may be simply biologically-inspired. Regardless, these architectures seek to employ hierarchical processing as means of mimicking the ability of the human brain to process a myriad of sensory data and make meaningful decisions based on this data. In this dissertation we present a novel DML architecture which is biologically-inspired in that (1) all processing is performed hierarchically; (2) all processing units are identical; and (3) processing captures both spatial and temporal dependencies in the observations to organize and extract features suitable for supervised learning. We call this architecture Deep Spatio-Temporal Inference Network (DeSTIN). In this framework, patterns observed in pixel data at the lowest layer of the hierarchy are organized and fit to generalizations using decomposition algorithms. Subsequent spatial layers draw upon previous layers, their own temporal observations and beliefs, and the observations and beliefs of parent nodes to extract features suitable for supervised learning using standard classifiers such as feedforward neural networks. Hence, DeSTIN is viewed as an unsupervised feature extraction scheme in the sense that rather than relying on human engineering to determine features for a particular problem, DeSTIN naturally constructs features of interest by representing salient regularities in the patterns observed. Detailed discussion and analysis of the DeSTIN framework is provided, including focus on its key components of generalization through online clustering and temporal inference. We present a variety of implementation details, including static and dynamic learning formulations, and function approximation methods. Results on standardized datasets of handwritten digits as well as face and optic nerve detection are presented, illustrating the efficacy of the proposed approach.

Subjects

Machine Learning

Image Processing

Disciplines
Other Computer Engineering
Degree
Doctor of Philosophy
Major
Electrical Engineering
Link to full text
ftp://ftp.ornl.gov/outgoing/uP27nb9S2n7Z/ThomasKarnowskiDissertationDraft.pdf
Embargo Date
January 18, 2012
File(s)
Thumbnail Image
Name

KarnowskiDissertationFINALUpload.pdf

Size

4.7 MB

Format

Adobe PDF

Checksum (MD5)

c193fc9311cc3bc631669cc26bc6cb6f

Learn more about how TRACE supports reserach impact and open access here.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify