Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Mitigation of Catastrophic Interference in Neural Networks and Ensembles using a Fixed Expansion Layer
Details

Mitigation of Catastrophic Interference in Neural Networks and Ensembles using a Fixed Expansion Layer

Date Issued
August 1, 2013
Author(s)
Coop, Robert Austin  
Advisor(s)
Itamar Arel
Additional Advisor(s)
Gregory Peterson, Jeremy Holleman, Joe Wilck
Abstract

Catastrophic forgetting (also known in the literature as catastrophic interference) is the phenomenon by which learning systems exhibit a severe exponential loss of learned information when exposed to relatively small amounts of new training data. This loss of information is not caused by constraints due to the lack of resources available to the learning system, but rather is caused by representational overlap within the learning system and by side-effects of the training methods used. Catastrophic forgetting in auto-associative pattern recognition is a well-studied attribute of most parameterized supervised learning systems. A variation of this phenomenon, in the context of feedforward neural networks, arises when non-stationary inputs lead to loss of previously learned mappings. The majority of the schemes proposed in the literature for mitigating catastrophic forgetting are not data-driven, but rather rely on storage of prior representations of the learning system. We introduce the Fixed Expansion Layer (FEL) feedforward neural network that embeds an expansion layer which sparsely encodes the information contained within the hidden layer, in order to help mitigate forgetting of prior learned representations. The fixed expansion layer approach is generally applicable to feedforward neural networks, as demonstrated by the application of the FEL technique to a recurrent neural network algorithm built on top of a standard feedforward neural network. Additionally, we investigate a novel framework for training ensembles of FEL networks, based on exploiting an information-theoretic measure of diversity between FEL learners, to further control undesired plasticity. The proposed methodology is demonstrated on a several tasks, clearly emphasizing its advantages over existing techniques. The architecture proposed can be applied to address a range of computational intelligence tasks, including classification problems, regression problems and system control.

Subjects

machine learning

neural network

catastrophic interfer...

non-stationary

concept drift

Disciplines
Computer Engineering
Degree
Doctor of Philosophy
Major
Computer Engineering
File(s)
Thumbnail Image
Name

coop_dissertation.pdf

Size

3.9 MB

Format

Adobe PDF

Checksum (MD5)

6a108b14e01a61d6e30ff72aaf65aa1d

Learn more about how TRACE supports reserach impact and open access here.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify