Privacy Preserving Survival PredictionShow others and affiliations
2021 (English)In: 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) / [ed] Chen, Y Ludwig, H Tu, Y Fayyad, U Zhu, X Hu, X Byna, S Liu, X Zhang, J Pan, S Papalexakis, V Wang, J Cuzzocrea, A Ordonez, C, Institute of Electrical and Electronics Engineers (IEEE) , 2021, p. 4600-4608Conference paper, Published paper (Refereed)
Abstract [en]
Predictive modeling has the potential to improve risk stratification of cancer patients and thereby contribute to optimized treatment strategies and better outcomes for patients in clinical practice. To develop robust predictive models for decision-making in healthcare, sensitive patient-level data is often required when developing the training models. Consequently, data privacy is an important aspect to consider when building these predictive models and in subsequent communication of the results. In this study we have used Graph Neural Networks for survival prediction, and compared the accuracy to state-of-the-art prediction models after applying Differential Privacy and k-Anonymity, i.e. two privacy-preservation solutions. By using two different data sources we demonstrated that Graph Neural Networks and Survival Forests are the two most well-performing survival prediction methods when used in combination with privacy preservation solutions. Furthermore, when the predictive model was built using clinical expertise in the specific area of interest, the prediction accuracy of the proposed knowledge based graph model drops by at most 10% when used with privacy preservation solutions. Our proposed knowledge based graph is therefore more suitable to be used in combination with privacy preservation solutions as compared to other graph models.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2021. p. 4600-4608
Series
IEEE International Conference on Big Data, ISSN 2639-1589
Keywords [en]
knowledge graph, survival prediction, privacy preservation, differential privacy, anonymization, clinical data, national registry, graph neural network, survival forest
National Category
Computer Sciences Cancer and Oncology
Identifiers
URN: urn:nbn:se:kth:diva-315412DOI: 10.1109/BigData52589.2021.9672036ISI: 000800559504103Scopus ID: 2-s2.0-85125348391OAI: oai:DiVA.org:kth-315412DiVA, id: diva2:1681656
Conference
9th IEEE International Conference on Big Data (IEEE BigData), 15-18 December, 2021, Virtual
Note
Part of proceedings: ISBN 978-1-6654-3902-2
QC 20220707
2022-07-072022-07-072024-03-18Bibliographically approved