CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of CancerShow others and affiliations
2021 (English)In: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, Neural Information Processing Systems Foundation , 2021Conference paper, Published paper (Refereed)
Abstract [en]
Interval and large invasive breast cancers, which are associated with worse prognosis than other cancers, are usually detected at a late stage due to false negative assessments of screening mammograms. The missed screening-time detection is commonly caused by the tumor being obscured by its surrounding breast tissues, a phenomenon called masking. To study and benchmark mammographic masking of cancer, in this work we introduce CSAW-M, the largest public mammographic dataset, collected from over 10,000 individuals and annotated with potential masking. In contrast to the previous approaches which measure breast image density as a proxy, our dataset directly provides annotations of masking potential assessments from five specialists. We also trained deep learning models on CSAW-M to estimate the masking level and showed that the estimated masking is significantly more predictive of screening participants diagnosed with interval and large invasive cancers – without being explicitly trained for these tasks – than its breast density counterparts.
Place, publisher, year, edition, pages
Neural Information Processing Systems Foundation , 2021.
National Category
Cancer and Oncology Radiology and Medical Imaging
Identifiers
URN: urn:nbn:se:kth:diva-361967Scopus ID: 2-s2.0-105000231004OAI: oai:DiVA.org:kth-361967DiVA, id: diva2:1949640
Conference
35th Conference on Neural Information Processing Systems - Track on Datasets and Benchmarks, NeurIPS Datasets and Benchmarks 2021, Virtual, Online, NA, Dec 6 2021 - Dec 14 2021
Note
QC 20250404
2025-04-032025-04-032025-04-04Bibliographically approved