Synthetic and Private Smart Health Care Data Generation using GANs
2021 (English)In: 30th International Conference on Computer Communications and Networks (ICCCN 2021), Institute of Electrical and Electronics Engineers (IEEE) , 2021Conference paper, Published paper (Refereed)
Abstract [en]
With the rapid advancements in machine learning, the health care paradigm is shifting from treatment towards prevention. The smart health care industry relies on the availability of large-scale health datasets in order to benefit from machine learning-based services. As a consequence, preserving the individuals' privacy becomes vital for sharing sensitive personal information. Synthetic datasets with generative models are considered to be one of the most promising solutions for privacy-preserving data sharing. Among the generative models, generative adversarial networks (GANs) have emerged as the most impressive models for synthetic data generation in recent times. However, smart health care data is attributed with unique challenges such as volume, velocity, and various data types and distributions. We propose a GAN coupled with differential privacy mechanisms for generating a realistic and private smart health care dataset. The proposed approach is not only able to generate realistic synthetic data samples but also the differentially private data samples under different settings: learning from a noisy distribution or noising the learned distribution. We tested and evaluated our proposed approach using a real-world Fitbit dataset. Our results indicate that our proposed approach is able to generate quality synthetic and differentially private dataset that preserves the statistical properties of the original dataset.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2021.
Series
IEEE International Conference on Computer Communications and Networks, ISSN 1095-2055
Keywords [en]
Generative adversarial networks, differential privacy, synthetic data generation, smart health care, fitness trackers
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-304189DOI: 10.1109/ICCCN52240.2021.9522203ISI: 000701532600035Scopus ID: 2-s2.0-85114964507OAI: oai:DiVA.org:kth-304189DiVA, id: diva2:1609002
Conference
30th International Conference on Computer Communications and Networks (ICCCN), JUL 19-22, 2021, ELECTR NETWORK
Note
Part of proceedings: ISBN 978-1-6654-1278-0, QC 20230117
2021-11-052021-11-052023-03-06Bibliographically approved