Doctoral Dissertations
Date of Award
12-2020
Degree Type
Dissertation
Degree Name
Doctor of Philosophy
Major Professor
Jacob D. Hinkle
Committee Members
Jacob Hinkle, Georgia Tourassi, Vasileios Maroulas, Hairong Qi
Abstract
Deep learning (DL) has emerged as the leading paradigm for predictive modeling in a variety of domains, especially those involving large volumes of high-dimensional spatio-temporal data such as images and text. With the rise of big data in scientific and engineering problems, there is now considerable interest in the research and development of DL for scientific applications. The scientific domain, however, poses unique challenges for DL, including special emphasis on interpretability and robustness. In particular, a priority of the Department of Energy (DOE) is the research and development of probabilistic ML methods that are robust to overfitting and offer reliable uncertainty quantification (UQ) on high-dimensional noisy data that is limited in size relative to its complexity. Gaussian processes (GPs) are nonparametric Bayesian models that are naturally robust to overfitting and offer UQ out-of-the-box. Unfortunately, traditional GP methods lack the balance of expressivity and domain-specific inductive bias that is key to the success of DL. Recently, however, a number of approaches have emerged to incorporate the DL paradigm into GP methods, including deep kernel learning (DKL), deep Gaussian processes (DGPs), and neural network Gaussian processes (NNGPs). In this work, we investigate DKL, DGPs, and NNGPs as paradigms for developing robust models for scientific applications. First, we develop DKL for text classification, and apply both DKL and Bayesian neural networks (BNNs) to the problem of classifying cancer pathology reports, with BNNs attaining new state-of-the-art results. Next, we introduce the deep ensemble kernel learning (DEKL) method, which is just as powerful as DKL while admitting easier model parallelism. Finally, we derive a new model called a ``bottleneck NNGP'' by unifying the DGP and NNGP paradigms, thus laying the groundwork for a new class of methods for future applications.
Recommended Citation
Agrawal, Devanshu, "Nonparametric Bayesian Deep Learning for Scientific Data Analysis. " PhD diss., University of Tennessee, 2020.
https://trace.tennessee.edu/utk_graddiss/6055
Included in
Applied Mathematics Commons, Artificial Intelligence and Robotics Commons, Data Science Commons, Probability Commons