Open this publication in new window or tab >>Show others...
2021 (English)In: 2021 IEEE International Conference on Data Mining (ICDM) / [ed] Bailey, J Miettinen, P Koh, YS Tao, D Wu, X, Institute of Electrical and Electronics Engineers (IEEE) , 2021, p. 200-209Conference paper, Published paper (Refereed)
Abstract [en]
Semi-supervised learning aims to generate a model with a better performance using plenty of unlabeled data. However, most existing methods treat unlabeled data equally without considering whether it is safe or not, which may lead to the degradation of prediction performance. In this paper, towards reliable semi-supervised learning, we propose a data-driven algorithm, called Reliability Propagation (RP), to learn the reliability of each unlabeled instance. The basic idea is to take local label regularity as a prior, and then perform reliability propagation on an adaptive graph. As a result, the most reliable unlabeled instances could be selected to construct a safer classifier. Beyond, the distributed RP algorithm is introduced to scale up to large volumes of data. In contrast to existing approaches, RP exploits the structural information and shed light on the soft instance selection for unlabeled data in a classifier-independent way. Experiments on both synthetic and real-world data have demonstrated that RP allows extracting most reliable unlabeled instances and supports a gained prediction performance compared to other algorithms.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2021
Series
IEEE International Conference on Data Mining, ISSN 1550-4786
Keywords
Semi-supervised learning, classification, reliability propagation
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-311677 (URN)10.1109/ICDM51629.2021.00030 (DOI)000780454100021 ()2-s2.0-85125192455 (Scopus ID)
Conference
21st IEEE International Conference on Data Mining, ICDM 2021, Virtual, Online, 7 December 2021 through 10 December 2021
Note
Part of proceedings: ISBN 978-1-6654-2398-4
QC 20220502
2022-05-022022-05-022022-06-25Bibliographically approved