kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
The Impact of Synthetic Data on Membership Inference Attacks
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Theoretical Computer Science, TCS.ORCID iD: 0000-0001-6934-0378
KTH, School of Electrical Engineering and Computer Science (EECS), Computer Science, Theoretical Computer Science, TCS.ORCID iD: 0000-0001-5742-5462
2023 (English)In: Security and Privacy in Social Networks and Big Data: 9th International Symposium, SocialSec 2023, Proceedings, Springer Nature , 2023, p. 93-108Conference paper, Published paper (Refereed)
Abstract [en]

Privacy of machine learning on Big Data has become a prominent issue in recent years due to the increased availability and usage of sensitive personal data to train the models. Membership inference attacks are one such issue that has been identified as a major privacy threat against machine learning models. Several techniques including applying differential privacy have been advocated to mitigate the effectiveness of inference attacks, however, they come at a cost of reduced utility/accuracy. Synthetic data is one approach that has been widely studied as a tool for privacy preservation recently but not much yet in the context of membership inference attacks. In this work, we aim to deepen the understanding of the impact of synthetic data on membership inference attacks. We compare models trained on original versus synthetic data, evaluate different synthetic data generation methods, and study the effect of overfitting in terms of membership inference attacks. Our investigation reveals that training on synthetic data can significantly reduce the effectiveness of membership inference attacks compared to models trained directly on the original data. This also holds for highly overfitted models that have been shown to increase the success rate of membership inference attacks. We also find that different synthetic data generation methods do not differ much in terms of membership inference attack accuracy but they do differ in terms of utility (i.e., observed based on train/test accuracy). Since synthetic data shows promising results for binary classification-based membership inference attacks on classification models explored in this work, exploring the impact on other attack types, models, and attribute inference attacks can be of worth.

Place, publisher, year, edition, pages
Springer Nature , 2023. p. 93-108
Keywords [en]
Accuracy, Machine Learning, Membership Inference Attack, Synthetic Data
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-337995DOI: 10.1007/978-981-99-5177-2_6Scopus ID: 2-s2.0-85172225507OAI: oai:DiVA.org:kth-337995DiVA, id: diva2:1804340
Conference
Security and Privacy in Social Networks and Big Data - 9th International Symposium, SocialSec 2023, Proceedings, Canterbury, United Kingdom of Great Britain and Northern Ireland, Aug 16 2023 - Aug 14 2023
Note

Part of ISBN 9789819951765

QC 20231012

Available from: 2023-10-12 Created: 2023-10-12 Last updated: 2023-10-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Khan, Md Sakib NizamBuchegger, Sonja

Search in DiVA

By author/editor
Khan, Md Sakib NizamBuchegger, Sonja
By organisation
Theoretical Computer Science, TCS
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf