Differentially private synthetic medical data generation using convolutional gans

dc.contributor.authorTorfi, Amirsinaen
dc.contributor.authorFox, Edward A.en
dc.contributor.authorReddy, Chandan K.en
dc.date.accessioned2024-01-22T13:05:33Zen
dc.date.available2024-01-22T13:05:33Zen
dc.date.issued2022en
dc.description.abstractDeep learning models have demonstrated superior performance in several real-world application problems such as image classification and speech processing. However, creating these models in sensitive domains like healthcare typically requires addressing certain privacy challenges that bring unique concerns. One effective way to handle such private data concerns is to generate realistic synthetic data that can provide practically acceptable data quality as well as be used to improve model performance. To tackle this challenge, we develop a differentially private framework for synthetic data generation using Rényi differential privacy. Our approach builds on convolutional autoencoders and convolutional generative adversarial networks to preserve critical characteristics of the generated synthetic data. In addition, our model can capture the temporal information and feature correlations present in the original data. We demonstrate that our model outperforms existing state-of-the-art models under the same privacy budget using several publicly available benchmark medical datasets in both supervised and unsupervised settings. The source code of this work is available at https://github.com/astorfi/differentially-private-cgan.en
dc.description.versionSubmitted versionen
dc.format.extentPages 485-500en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1016/j.ins.2021.12.018en
dc.identifier.eissn1872-6291en
dc.identifier.issn0020-0255en
dc.identifier.orcidReddy, Chandan [0000-0003-2839-3662]en
dc.identifier.orcidFox, Edward [0000-0003-1447-6870]en
dc.identifier.urihttps://hdl.handle.net/10919/117430en
dc.identifier.volume586en
dc.language.isoenen
dc.publisherElsevieren
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectDeep learningen
dc.subjectdifferential privacyen
dc.subjectsynthetic data generationen
dc.subjectgenerative adversarial networksen
dc.titleDifferentially private synthetic medical data generation using convolutional gansen
dc.title.serialInformation Sciencesen
dc.typeArticleen
dc.type.dcmitypeTexten
dc.type.otherarticleen
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Engineeringen
pubs.organisational-group/Virginia Tech/Engineering/Computer Scienceen
pubs.organisational-group/Virginia Tech/Faculty of Health Sciencesen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Engineering/COE T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DifferentiallyPrivate-arXiv.pdf
Size:
5.32 MB
Format:
Adobe Portable Document Format
Description:
Submitted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: