Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Conditional Computation in Deep and Recurrent Neural Networks
Details

Conditional Computation in Deep and Recurrent Neural Networks

Date Issued
August 1, 2016
Author(s)
Davis, Andrew Scott  
Advisor(s)
Itamar Arel
Additional Advisor(s)
Jamie Coble, Jens Gregor, Hairong Qi
Abstract

Recently, deep learning models such as convolutional and recurrent neural networks have displaced state-of-the-art techniques in a variety of application domains. While the computationally heavy process of training is usually conducted on powerful graphics processing units (GPUs) distributed in large computing clusters, the resulting models can still be somewhat heavy, making deployment in resource- constrained environments potentially problematic. In this work, we build upon the idea of conditional computation, where the model is given the capability to learn how to avoid computing parts of the graph. This allows for models where the number of parameters (and in a sense, the model’s capacity to learn) can grow at a faster rate than the computation that is required to propagate information through the graph. In this work, we apply conditional computation to feed forward and recurrent neural networks. In the feed forward case, we demonstrate a technique that trades off accuracy for potential computational benefits, and in the recurrent case, we demonstrate techniques that yield practical speed benefits on a language modeling task. Given the rapidly expanding domain of problems where deep learning proves useful, the work presented here can help enable the future scalability requirements of deploying trained models.

Subjects

neural networks

conditional computati...

deep learning

Disciplines
Artificial Intelligence and Robotics
Degree
Doctor of Philosophy
Major
Computer Engineering
Embargo Date
January 1, 2011
File(s)
Thumbnail Image
Name

my_dissertation.pdf

Size

842.59 KB

Format

Adobe PDF

Checksum (MD5)

ffb7ca18c00d2133b11046d09a195185

Learn more about how TRACE supports reserach impact and open access here.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify