Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Neuron Clustering for Mitigating Catastrophic Forgetting in Supervised and Reinforcement Learning
Details

Neuron Clustering for Mitigating Catastrophic Forgetting in Supervised and Reinforcement Learning

Date Issued
December 1, 2015
Author(s)
Goodrich, Benjamin Frederick  
Advisor(s)
Itamar Arel
Additional Advisor(s)
Jamie Coble, Jeremy Holleman, Jens Gregor
Abstract

Neural networks have had many great successes in recent years, particularly with the advent of deep learning and many novel training techniques. One issue that has affected neural networks and prevented them from performing well in more realistic online environments is that of catastrophic forgetting. Catastrophic forgetting affects supervised learning systems when input samples are temporally correlated or are non-stationary. However, most real-world problems are non-stationary in nature, resulting in prolonged periods of time separating inputs drawn from different regions of the input space.


Reinforcement learning represents a worst-case scenario when it comes to precipitating catastrophic forgetting in neural networks. Meaningful training examples are acquired as the agent explores different regions of its state/action space. When the agent is in one such region, only highly correlated samples from that region are typically acquired. Moreover, the regions that the agent is likely to visit will depend on its current policy, suggesting that an agent that has a good policy may avoid exploring particular regions. The confluence of these factors means that without some mitigation techniques, supervised neural networks as function approximation in temporal-difference learning will be restricted to the simplest test cases.

This work explores catastrophic forgetting in neural networks in terms of supervised and reinforcement learning. A simple mathematical model is introduced to argue that catastrophic forgetting is a result of overlapping representations in the hidden layers in which updates to the weights can affect multiple unrelated regions of the input space. A novel neural network architecture, dubbed "cluster-select," is introduced which utilizes online clustering for the selection of a subset of hidden neurons to be activated in the feedforward and backpropagation stages. Clusterselect is demonstrated to outperform leading techniques in both classification nd regression. In the context of reinforcement learning, cluster-select is studied for both fully and partially observable Markov decision processes and is demonstrated to converge faster and behave in a more stable manner when compared to other state-of-the-art algorithms.

Subjects

Neural Networks

Catastrophic Forgetti...

Clustering

Recurrent Clustering

Machine Learning

Reinforcement Learnin...

Disciplines
Artificial Intelligence and Robotics
Controls and Control Theory
Robotics
Degree
Doctor of Philosophy
Major
Computer Engineering
Embargo Date
January 1, 2011
File(s)
Thumbnail Image
Name

ben_goodrich_dissertation.pdf

Size

2.19 MB

Format

Adobe PDF

Checksum (MD5)

72687a9c98c517be648bdfd793203689

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify