Encoding the Sensor Allocation Problem for Reinforcement Learning

dc.contributor.authorPenn, Dylan R.en
dc.contributor.committeechairBlack, Jonathan T.en
dc.contributor.committeememberSchroeder, Kevin Kenten
dc.contributor.committeememberSmith, Leonard Allenen
dc.contributor.committeememberFowler, Michael Chrispatricken
dc.contributor.departmentAerospace and Ocean Engineeringen
dc.date.accessioned2024-05-17T08:00:45Zen
dc.date.available2024-05-17T08:00:45Zen
dc.date.issued2024-05-16en
dc.description.abstractTraditionally, space situational awareness (SSA) sensor networks have relied on dynamic programming theory to generate tasking plans which govern how sensors are allocated to observe resident space objects. Deep reinforcement learning (DRL) techniques, with their ability to be trained on simulated environments, which are readily available for the SSA sensor allocation problem, and demonstrated performance in other fields, have potential to exceed performance of deterministic methods. The research presented in this dissertation develops techniques for encoding an SSA environment model to apply DRL to the sensor allocation problem. This dissertation is the compilation of two separate but related studies. The first study compares two alternative invalid action handling techniques, penalization and masking. The second study examines the performance of policies that have forecast state knowledge incorporated in the observation space.en
dc.description.abstractgeneralResident space objects (RSOs) are typically tracked by ground-based sensors (telescopes and radar). Determining how to allocate sensors to RSOs is a complex problem traditionally performed by dynamic programming techniques. Deep reinforcement learning (DRL), a subset of machine learning, has demonstrated performance in other fields, and has the potential to exceed performance of traditional techniques. The research presented in this dissertation develops techniques for encoding a space situational awareness environment model to apply DRL to the sensor allocation problem. This dissertation is the compilation of two separate but related studies. The first study compares two alternative invalid action handling techniques, penalization and masking. The second study examines the performance of policies that have forecast state knowledge incorporated in the observation space.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:40264en
dc.identifier.urihttps://hdl.handle.net/10919/119007en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectspace traffic managementen
dc.subjectresource allocationen
dc.subjectreinforcement learningen
dc.titleEncoding the Sensor Allocation Problem for Reinforcement Learningen
dc.typeDissertationen
thesis.degree.disciplineAerospace Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 1 of 1
Name:
Penn_DR_D_2024.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format