Extraction of Blood Volume Pulse Morphology from Facial Videos Using an LSTM-Based Temporal Encoder-Decoder Model
dc.contributor.author | Tyler, Jonathan David | en |
dc.contributor.committeechair | Abbott, Amos L. | en |
dc.contributor.committeemember | Sarkar, Abhijit | en |
dc.contributor.committeemember | Jones, Creed Farris | en |
dc.contributor.department | Electrical and Computer Engineering | en |
dc.date.accessioned | 2025-03-29T08:00:37Z | en |
dc.date.available | 2025-03-29T08:00:37Z | en |
dc.date.issued | 2025-03-28 | en |
dc.description.abstract | This thesis introduces a method for extracting blood volume pulse (BVP) signals from facial videos, moving beyond basic heart rate estimation to capture full pulse waveforms. Our approach adapts techniques from audio signal separation and applies them to video, using a machine learning model capable of processing complex time-based data. By incorporating both regular RGB (red, green, blue) and infrared (850nm, 940nm) video, we enhance the quality of the extracted signals, making signal extraction more reliable under different lighting conditions. This method not only improves accuracy in measuring real-time heart rate but also captures unique heart patterns that could support biometric identification. Our findings show that this approach effectively recovers detailed BVP shapes from video, paving the way for advancements in health monitoring and identity verification technologies. | en |
dc.description.abstractgeneral | This thesis focuses on how to measure heart signals from facial videos in a way that captures more detail than just average heart rate. We use a machine learning model designed for an audio separation task, adapting it to separate blood flow signals from noise in signals extracted from video of the face. By adding infrared video data along with regular color channels, our method becomes more accurate, especially in low-light situations. This allows us to not only calculate a person's heart rate more precisely but also to create unique patterns from their heartbeat, which could help in personal identification. Through testing, we show that our method can successfully extract clear heart signals from video, opening up new uses for health monitoring and security. | en |
dc.description.degree | Master of Science | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:42552 | en |
dc.identifier.uri | https://hdl.handle.net/10919/125112 | en |
dc.language.iso | en | en |
dc.publisher | Virginia Tech | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | Temporal Encoder-Decoder | en |
dc.subject | Machine Learning | en |
dc.subject | Remote Photoplethysmography | en |
dc.subject | iPPG | en |
dc.subject | BVP | en |
dc.subject | Instantaneous Heart Rate | en |
dc.title | Extraction of Blood Volume Pulse Morphology from Facial Videos Using an LSTM-Based Temporal Encoder-Decoder Model | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer Engineering | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | masters | en |
thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1