Date Approved

6-2-2025

Embargo Period

6-2-2025

Document Type

Thesis

Degree Name

M.S. Data Science

Department

Computer Science

College

College of Science & Mathematics

Advisor

Silvija Kokalj-Filipovic, Ph.D.

Committee Member 1

Anthony Breitzman, Ph.D.

Committee Member 2

Ho Shen Shyang, Ph.D.

Keywords

Data Science;Deep Learning;Machine Learning

Disciplines

Computer Sciences | Physical Sciences and Mathematics

Abstract

Modern signal processing AI applications face increasing demands for diverse training data while operating under computational constraints. State-of-the-art generative models, though effective, often require prohibitive resources, limiting their deployment in real-time or embedded systems. This thesis proposes a computationally efficient framework for synthetic signal generation using a two-stage architecture that combines a Vector Quantized Variational Autoencoder (VQ-VAE) with either a decoder-only transformer or a discrete diffusion model. The VQ-VAE encodes high-dimensional signals into discrete latent tokens, significantly reducing model complexity while enabling symbolic sequence modeling. These discrete representations are then modeled using transformer-based autoregressive models or Score Entropy Discrete Diffusion (SEDD) models. We validate this approach on two datasets: TorchSig for radio-frequency signals and AudioMNIST for spoken digits. Our work introduces the first discrete-diffusion based generative models for both audio and RF data and presents the first transformer-based generative model for RF signals trained entirely in discrete latent space. We also improve and extend an existing discrete-space transformer-based speech synthesis pipeline and perform a comprehensive comparative analysis of these generative models across domains. The results demonstrate that these methods maintain high fidelity, generate diverse and realistic signals, and offer substantial computational advantages. This work establishes a scalable foundation for efficient data augmentation in signal-driven machine learning systems and opens new directions for generative modeling in low-resource environments.

Recommended Citation

Kaasaragadda, Yagna Veera Narayan, "EFFICIENT SIGNAL SYNTHESIS FOR DATA AUGMENTATION USING GENERATIVE AI" (2025). Theses and Dissertations. 3376.
https://rdw.rowan.edu/etd/3376

Download

Included in

Computer Sciences Commons

COinS

Rowan Digital Works

Theses and Dissertations

EFFICIENT SIGNAL SYNTHESIS FOR DATA AUGMENTATION USING GENERATIVE AI

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Advisor

Committee Member 1

Committee Member 2

Keywords

Disciplines

Abstract

Recommended Citation

Included in

Search

Browse

Author Corner

Rowan Digital Works

Theses and Dissertations

EFFICIENT SIGNAL SYNTHESIS FOR DATA AUGMENTATION USING GENERATIVE AI

Author(s)

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Advisor

Committee Member 1

Committee Member 2

Keywords

Disciplines

Abstract

Recommended Citation

Included in

Share

Search

Browse

Author Corner