A Temporal Encoder-Decoder Approach to Extracting Blood Volume Pulse Signal Morphology from Face Videos

dc.contributor.authorLi, Fulanen
dc.contributor.committeechairAbbott, Amos L.en
dc.contributor.committeememberSarkar, Abhijiten
dc.contributor.committeememberXuan, Jianhuaen
dc.contributor.departmentElectrical and Computer Engineeringen
dc.date.accessioned2023-07-06T08:00:18Zen
dc.date.available2023-07-06T08:00:18Zen
dc.date.issued2023-07-05en
dc.description.abstractThis thesis considers methods for extracting blood volume pulse (BVP) representations from video of the human face. Whereas most previous systems have been concerned with estimating vital signs such as average heart rate, this thesis addresses the more difficult problem of recovering BVP signal morphology. We present a new approach that is inspired by temporal encoder-decoder architectures that have been used for audio signal separation. As input, this system accepts a temporal sequence of RGB (red, green, blue) values that have been spatially averaged over a small portion of the face. The output of the system is a temporal sequence that approximates a BVP signal. In order to reduce noise in the recovered signal, a separate processing step extracts individual pulses and performs normalization and outlier removal. After these steps, individual pulse shapes have been extracted that are sufficiently distinct to support biometric authentication. Our findings demonstrate the effectiveness of our approach in extracting BVP signal morphology from facial videos, which presents exciting opportunities for further research in this area. The source code is available at https://github.com/Adleof/CVPM-2023-Temporal-Encoder-Decoder-iPPGen
dc.description.abstractgeneralThis thesis considers methods for extracting blood volume pulse (BVP) representations from video of the human face. We present a new approach that is inspired by the method that has been used for audio signal separation. The output of our system is an approximation of the BVP signal of the person in the video. Our method can extract a signal that is sufficiently distinct to support biometric authentication. Our findings demonstrate the effectiveness of our approach in extracting BVP signal morphology from facial videos, which presents exciting opportunities for further research in this area.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:37593en
dc.identifier.urihttp://hdl.handle.net/10919/115649en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectremote photoplethysmographyen
dc.subjectiPPGen
dc.subjectBVPen
dc.subjectTemporal Encoder-Decoderen
dc.subjectMachine Learningen
dc.titleA Temporal Encoder-Decoder Approach to Extracting Blood Volume Pulse Signal Morphology from Face Videosen
dc.typeThesisen
thesis.degree.disciplineComputer Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Li_F_T_2023.pdf
Size:
3.51 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Li_F_T_2023_support_1.pdf
Size:
100.98 KB
Format:
Adobe Portable Document Format
Description:
Supporting documents

Collections