Generating Synthetic Healthcare Records Using Convolutional Generative Adversarial Networks

dc.contributor.authorTorfi, Amirsinaen
dc.contributor.authorBeyki, Mohammadrezaen
dc.date.accessioned2019-12-21T00:48:48Zen
dc.date.available2019-12-21T00:48:48Zen
dc.date.issued2019-12-20en
dc.description.abstractDeep learning models have demonstrated high-quality performance in several areas such as image classification and speech processing. However, creating a deep learning model using electronic health record (EHR) data requires addressing particular privacy challenges that make this issue unique to researchers in this domain. This matter focuses attention on generating realistic synthetic data to amplify privacy. Existing methods for artificial data generation suffer from different limitations such as being bound to particular use cases. Furthermore, their generalizability to real-world problems is controversial regarding the uncertainties in defining and measuring key realistic characteristics. Henceforth, there is a need to establish insightful metrics (and to measure the validity of synthetic data), as well as quantitative criteria regarding privacy restrictions. We propose the use of Generative Adversarial Networks to help satisfy requirements for realistic characteristics and acceptable values of privacy metrics simultaneously. The present study makes several unique contributions to synthetic data generation in the healthcare domain. First, utilizing 1-D Convolutional Neural Networks (CNNs), we devise a new approach to capturing the correlation between adjacent diagnosis records. Second, we employ convolutional autoencoders to map the discrete-continuous values. Finally, we devise a new approach to measure the similarity between real and synthetic data, and a means to measure the fidelity of the synthetic data and its associated privacy risks.en
dc.description.notesGeneratingSyntheticEHRdataPresentation.zip: The presentation source files, GeneratingSyntheticEHRdataPresentation.pdf: The presentation PDF, GeneratingSyntheticEHRdataReport.zip: The report source files, GeneratingSyntheticEHRdataReport.pdf: The PDF reporten
dc.description.sponsorshipNewWaveen
dc.identifier.urihttp://hdl.handle.net/10919/96186en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/en
dc.subjectsynthetic dataen
dc.subjectgenerative adversarial networksen
dc.subjecthealthcareen
dc.subjectconvolutional neural networksen
dc.titleGenerating Synthetic Healthcare Records Using Convolutional Generative Adversarial Networksen
dc.typePresentationen
dc.typeReporten

Files

Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
GeneratingSyntheticEHRdataPresentation.pdf
Size:
7.61 MB
Format:
Adobe Portable Document Format
Name:
GeneratingSyntheticEHRdataPresentation.zip
Size:
21.88 MB
Format:
Loading...
Thumbnail Image
Name:
GeneratingSyntheticEHRdataReport.pdf
Size:
555.78 KB
Format:
Adobe Portable Document Format
Name:
GeneratingSyntheticEHRdataReport.zip
Size:
612.25 KB
Format:
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: