Theses and Dissertations

A MACHINE LEARNING FRAMEWORK FOR AUTOMATIC SPEECH RECOGNITION IN AIR TRAFFIC CONTROL USING WORD LEVEL BINARY CLASSIFICATION AND TRANSCRIPTION

Fowad Shahid Sohail, Rowan UniversityFollow

Date Approved

9-23-2022

Embargo Period

9-27-2022

Document Type

Thesis

Degree Name

M.S. Electrical and Computer Engineering

Department

Electrical and Computer Engineering

College

Henry M. Rowan College of Engineering

Advisor

Ravi P. Ramachandran, Ph.D.

Committee Member 1

Parth Bhavsar, Ph.D.

Committee Member 2

Yusuf Mehta, Ph.D., P.E.

Committee Member 3

John Schmalzel, Ph.D., P.E.

Subject(s)

Automatic speech recognition; Air traffic control; Machine learning

Disciplines

Artificial Intelligence and Robotics | Electrical and Computer Engineering | Multi-Vehicle Systems and Air Traffic Control

Abstract

Advances in Artificial Intelligence and Machine learning have enabled a variety of new technologies. One such technology is Automatic Speech Recognition (ASR), where a machine is given audio and transcribes the words that were spoken. ASR can be applied in a variety of domains to improve general usability and safety. One such domain is Air Traffic Control (ATC). ASR in ATC promises to improve safety in a mission critical environment. ASR models have historically required a large amount of clean training data. ATC environments are noisy and acquiring labeled data is a difficult, expertise dependent task. This thesis attempts to solve these problems by presenting a machine learning framework which uses word-by-word audio samples to transcribe ATC speech. Instead of transcribing an entire speech sample, this framework transcribes every word individually. Then, overall transcription is pieced together based on the word sequence. Each stage of the framework is trained and tested independently of one another, and the overall performance is gauged. The overall framework was gauged to be a feasible approach to ASR in ATC.

Recommended Citation

Sohail, Fowad Shahid, "A MACHINE LEARNING FRAMEWORK FOR AUTOMATIC SPEECH RECOGNITION IN AIR TRAFFIC CONTROL USING WORD LEVEL BINARY CLASSIFICATION AND TRANSCRIPTION" (2022). Theses and Dissertations. 3057.
https://rdw.rowan.edu/etd/3057

Download

Included in

Artificial Intelligence and Robotics Commons, Electrical and Computer Engineering Commons, Multi-Vehicle Systems and Air Traffic Control Commons

COinS

Rowan Digital Works

Theses and Dissertations

A MACHINE LEARNING FRAMEWORK FOR AUTOMATIC SPEECH RECOGNITION IN AIR TRAFFIC CONTROL USING WORD LEVEL BINARY CLASSIFICATION AND TRANSCRIPTION

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Advisor

Committee Member 1

Committee Member 2

Committee Member 3

Subject(s)

Disciplines

Abstract

Recommended Citation

Included in

Search

Browse

Author Corner

Rowan Digital Works

Theses and Dissertations

A MACHINE LEARNING FRAMEWORK FOR AUTOMATIC SPEECH RECOGNITION IN AIR TRAFFIC CONTROL USING WORD LEVEL BINARY CLASSIFICATION AND TRANSCRIPTION

Author(s)

Date Approved

Embargo Period

Document Type

Degree Name

Department

College

Advisor

Committee Member 1

Committee Member 2

Committee Member 3

Subject(s)

Disciplines

Abstract

Recommended Citation

Included in

Share

Search

Browse

Author Corner