Theses and Dissertations

FROM POINT ESTIMATES TO PREDICTIVE DISTRIBUTIONS IN MACHINE LEARNING MODELS - A STATISTICAL IMPORTANCE SAMPLING FRAMEWORK

Giuseppina Carannante, Rowan UniversityFollow

Date Approved

4-17-2023

Embargo Period

4-17-2025

Document Type

Dissertation

Degree Name

Ph.D. Doctor of Philosophy

Department

Electrical and Computer Engineering

College

Henry M. Rowan College of Engineering

Advisor

Nidhal C. Bouaynaya, Ph.D.

Committee Member 1

Gregory Ditzler, Ph.D.

Committee Member 2

Lyudmila S. Mihaylova, Ph.D.

Committee Member 3

Ghulam Rasool, Ph.D.

Committee Member 4

Umashanger Thayasivam, Ph.D.

Keywords

Bayesian Deep Learning, Machine Learning, Reliablility, Trustworthiness, Uncertainty Quantification

Subject(s)

Machine learning; Statistics

Disciplines

Artificial Intelligence and Robotics | Electrical and Computer Engineering

Abstract

In this thesis, we leverage powerful statistical frameworks for optimal sequential estimation and tracking in non-linear and non-Gaussian dynamical models, which enjoy proven (asymptotic) optimality properties. Initially, we build upon our previous work, which employed first-order Taylor series approximation to propagate the first two predictive moments, to derive Bayesian encoder-decoder networks. This work introduced the notion of dense, pixel-level uncertainty map that is crucial in fields, such as autonomous vehicles and medical segmentation. We then extended the Bayesian framework to an ensembling scheme based on ensemble Kalman Filtering (EnKF). While EnKF represents the predictive distribution with an ensemble of draws, it implicitly assumes a Gaussian model. Finally, we derived an advanced sequential importance sampling (SIS) technique to sequentially propagate random samples from the density through the model layers. The proposed importance sampling framework is able to approximate any density, not only Gaussians, has established performance guarantees and can be implemented in parallel. We demonstrate that propagating full distributions delivers significantly increased robustness to noise and adversarial attacks. Moreover, the second moment is a reliable metric to measure the model's confidence under various distributional shifts. Another notable characteristic of the model is its ability to self-assess its decisions, especially in situations where it generates inaccurate predictions. This feature represents a step towards the development of self-aware machines.

Recommended Citation

Carannante, Giuseppina, "FROM POINT ESTIMATES TO PREDICTIVE DISTRIBUTIONS IN MACHINE LEARNING MODELS - A STATISTICAL IMPORTANCE SAMPLING FRAMEWORK" (2023). Theses and Dissertations. 3091.
https://rdw.rowan.edu/etd/3091

Download

Included in

Artificial Intelligence and Robotics Commons, Electrical and Computer Engineering Commons

COinS

Rowan Digital Works

Theses and Dissertations

FROM POINT ESTIMATES TO PREDICTIVE DISTRIBUTIONS IN MACHINE LEARNING MODELS - A STATISTICAL IMPORTANCE SAMPLING FRAMEWORK

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Advisor

Committee Member 1

Committee Member 2

Committee Member 3

Committee Member 4

Keywords

Subject(s)

Disciplines

Abstract

Recommended Citation

Included in

Search

Browse

Author Corner

Rowan Digital Works

Theses and Dissertations

FROM POINT ESTIMATES TO PREDICTIVE DISTRIBUTIONS IN MACHINE LEARNING MODELS - A STATISTICAL IMPORTANCE SAMPLING FRAMEWORK

Author(s)

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Funder

Advisor

Committee Member 1

Committee Member 2

Committee Member 3

Committee Member 4

Keywords

Subject(s)

Disciplines

Abstract

Recommended Citation

Included in

Share

Search

Browse

Author Corner