Galactic Flood Fill Segmentation and Machine Learning Redshift Estimation

dc.contributor.authorFerguson, Matthew Chaseen
dc.contributor.committeechairJones, Creed Farrisen
dc.contributor.committeememberTalty, Timothy Josephen
dc.contributor.committeememberPlassmann, Paul E.en
dc.contributor.departmentElectrical and Computer Engineeringen
dc.date.accessioned2025-01-22T09:00:15Zen
dc.date.available2025-01-22T09:00:15Zen
dc.date.issued2025-01-21en
dc.description.abstractThis thesis explores the use of machine learning redshift estimation models trained on segmented galactic images. Segmentation of galaxies from the background is accomplished using a flood fill segmentation method which is novel to the field of galactic segmentation. Astronomy datasets are so large due to high volume modern surveys that automated analysis techniques are now required. Redshift is a prime example of an expensive measurement that is a candidate for automation. The Sloan Digital Sky Survey alone imaged more than 1 billion objects in 9 years, but only produced 4 million spectra over more than 20 years. Machine learning is an automation technology that promises to efficiently analyze imaging data alone such that redshift can be estimated with a high degree of accuracy. Ground truth redshift and multi-band galactic images were obtained for 200,000 galaxies from the Sloan Digital Sky Survey. Two model architectures were experimented with, a fully connected artificial neural network, and a convolutional neural network. Experiments were conducted on flood fill parameters, crop sizes, color spaces, and thresholding. We demonstrated that model performance on flood fill segments is higher than on unsegmented images across many crop sizes. The best achieved model performances for artificial neural networks, and convolutional neural networks are median absolute dispersions of 0.024 and 0.031, respectively.en
dc.description.abstractgeneralAstronomy has progressed to the point where there is too much data for humans to analyze manually. Every year more data is created. Machine learning is a technology that can sort through all this data. We used machine learning to create a method to predict redshift of galaxies. Redshift is a measure of the object's speed and tells us about the history of the universe. We also created a novel way of separating galaxies in images from the background using flood fill. Machine learning creates a function that connects two datasets. Our first dataset is 200,000 galactic images from the Sloan Digital Sky Survey, and our second dataset is redshift measurements of these galaxies. We connected images to their redshift using a machine learning model. This lets us take images of galaxies and estimate their redshift. Our best models exceeded the performance of some other redshift estimation techniques.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:42354en
dc.identifier.urihttps://hdl.handle.net/10919/124287en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectRedshiften
dc.subjectFlood Fillen
dc.subjectSegmentationen
dc.subjectMachine Learningen
dc.subjectEstimationen
dc.subjectRegressionen
dc.subjectGalaxyen
dc.subjectDeep Learningen
dc.titleGalactic Flood Fill Segmentation and Machine Learning Redshift Estimationen
dc.typeThesisen
thesis.degree.disciplineComputer Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ferguson_MC_T_2025.pdf
Size:
1.78 MB
Format:
Adobe Portable Document Format

Collections