Differentially private synthetic medical data generation using convolutional gans

Files

TR Number

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Abstract

Deep learning models have demonstrated superior performance in several real-world application problems such as image classification and speech processing. However, creating these models in sensitive domains like healthcare typically requires addressing certain privacy challenges that bring unique concerns. One effective way to handle such private data concerns is to generate realistic synthetic data that can provide practically acceptable data quality as well as be used to improve model performance. To tackle this challenge, we develop a differentially private framework for synthetic data generation using Rényi differential privacy. Our approach builds on convolutional autoencoders and convolutional generative adversarial networks to preserve critical characteristics of the generated synthetic data. In addition, our model can capture the temporal information and feature correlations present in the original data. We demonstrate that our model outperforms existing state-of-the-art models under the same privacy budget using several publicly available benchmark medical datasets in both supervised and unsupervised settings. The source code of this work is available at https://github.com/astorfi/differentially-private-cgan.

Description

Keywords

Deep learning, differential privacy, synthetic data generation, generative adversarial networks

Citation