dc.contributor.author | López Pérez, Miguel | |
dc.contributor.author | Morales Álvarez, Pablo | |
dc.contributor.author | Molina Soriano, Rafael | |
dc.date.accessioned | 2023-03-10T09:05:15Z | |
dc.date.available | 2023-03-10T09:05:15Z | |
dc.date.issued | 2023-01-18 | |
dc.identifier.citation | M. López-Pérez... [et al.]. "Deep Gaussian Processes for Classification With Multiple Noisy Annotators. Application to Breast Cancer Tissue Classification," in IEEE Access, vol. 11, pp. 6922-6934, 2023, doi: [10.1109/ACCESS.2023.3237990] | es_ES |
dc.identifier.uri | https://hdl.handle.net/10481/80508 | |
dc.description.abstract | Machine learning (ML) methods often require large volumes of labeled data to achieve
meaningful performance. The expertise necessary for labeling data in medical applications like pathology
presents a significant challenge in developing clinical-grade tools. Crowdsourcing approaches address this
challenge by collecting labels from multiple annotators with varying degrees of expertise. In recent years,
multiple methods have been adapted to learn from noisy crowdsourced labels. Among them, Gaussian
Processes (GPs) have achieved excellent performance due to their ability to model uncertainty. Deep
Gaussian Processes (DGPs) address the limitations of GPs using multiple layers to enable the learning
of more complex representations. In this work, we develop Deep Gaussian Processes for Crowdsourcing
(DGPCR) to model the crowdsourcing problem with DGPs for the first time. DGPCR models the (unknown)
underlying true labels, and the behavior of each annotator is modeled with a confusion matrix among classes.
We use end-to-end variational inference to estimate both DGPCR parameters and annotator biases. Using
annotations from 25 pathologists and medical trainees, we show that DGPCR is competitive or superior
to Scalable Gaussian Processes for Crowdsourcing (SVGPCR) and other state-of-the-art deep-learning
crowdsourcing methods for breast cancer classification. Also, we observe that DGPCR with noisy labels
obtains better results (F1 = 81.91%) than GPs (F1 = 81.57%) and deep learning methods (F1 = 80.88%)
with true labels curated by experts. Finally, we show an improved estimation of annotators’ behavior. | es_ES |
dc.description.sponsorship | Ministry of Science and Innovation, Spain (MICINN) PID2019-105142RB-C22 | es_ES |
dc.description.sponsorship | Fondo Europeo de Desarrollo Regional (FEDER)/Junta de Andalucia-Consejeria deTransformacion Economica, Industria, Conocimiento y Universidades P20_00286 | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | IEEE | es_ES |
dc.rights | Atribución 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Crowdsourcing | es_ES |
dc.subject | Deep Gaussian Processes | es_ES |
dc.subject | Digital pathology | es_ES |
dc.subject | Breast cancer | es_ES |
dc.title | Deep Gaussian Processes for Classification With Multiple Noisy Annotators. Application to Breast Cancer Tissue Classification | es_ES |
dc.type | journal article | es_ES |
dc.rights.accessRights | open access | es_ES |
dc.identifier.doi | 10.1109/ACCESS.2023.3237990 | |
dc.type.hasVersion | VoR | es_ES |