Improving Fairness for Data Valuation in Horizontal Federated LearningShow others and affiliations
2022 (English)In: 38th IEEE International Conference on Data Engineering, ICDE 2022, Institute of Electrical and Electronics Engineers (IEEE) , 2022, p. 2440-2453Conference paper, Published paper (Refereed)
Abstract [en]
Federated learning is an emerging decentralized machine learning scheme that allows multiple data owners to work collaboratively while ensuring data privacy. The success of federated learning depends largely on the participation of data owners. To sustain and encourage data owners' participation, it is crucial to fairly evaluate the quality of the data provided by the data owners as well as their contribution to the final model and reward them correspondingly. Federated Shapley value, recently proposed by Wang et al. [Federated Learning, 2020], is a measure for data value under the framework of federated learning that satisfies many desired properties for data valuation. However, there are still factors of potential unfairness in the design of federated Shapley value because two data owners with the same local data may not receive the same evaluation. We propose a new measure called completed federated Shapley value to improve the fairness of federated Shapley value. The design depends on completing a matrix consisting of all the possible contributions by different subsets of the data owners. It is shown under mild conditions that this matrix is approximately low-rank by leveraging concepts and tools from optimization. Both theoretical analysis and empirical evaluation verify that the proposed measure does improve fairness in many circumstances.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2022. p. 2440-2453
Series
IEEE International Conference on Data Engineering, ISSN 1084-4627
Keywords [en]
contribution evaluation, fairness, federated learning
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-321044DOI: 10.1109/ICDE53745.2022.00228ISI: 000855078402037Scopus ID: 2-s2.0-85136393416OAI: oai:DiVA.org:kth-321044DiVA, id: diva2:1708567
Conference
38th IEEE International Conference on Data Engineering (ICDE), MAY 09-11, 2022, ELECTR NETWORK
Note
Part of proceedings: ISBN 978-1-6654-0883-7
QC 20221104
2022-11-042022-11-042022-11-04Bibliographically approved