<rdf:RDF xmlns:rdf="http://www.openarchives.org/OAI/2.0/rdf/" xmlns:ow="http://www.ontoweb.org/ontology/1#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/rdf/ http://www.openarchives.org/OAI/2.0/rdf.xsd">
   <ow:Publication rdf:about="oai:digibug.ugr.es:10481/97072">
      <dc:title>The CrowdGleason dataset: Learning the Gleason grade from crowds and experts✩</dc:title>
      <dc:creator>López Pérez, Miguel</dc:creator>
      <dc:creator>Morquecho, Alba</dc:creator>
      <dc:creator>Schmidt, Arne</dc:creator>
      <dc:creator>Pérez Bueno, Fernando</dc:creator>
      <dc:creator>Martín Castro, Aurelio</dc:creator>
      <dc:creator>Mateos Delgado, Javier</dc:creator>
      <dc:creator>Molina Soriano, Rafael</dc:creator>
      <dc:subject>Computational pathology</dc:subject>
      <dc:subject>Crowdsourcing</dc:subject>
      <dc:subject>Prostate cancer</dc:subject>
      <dc:description>Background: Currently, prostate cancer (PCa) diagnosis relies on the human analysis of prostate biopsy&#xd;
Whole Slide Images (WSIs) using the Gleason score. Since this process is error-prone and time-consuming,&#xd;
recent advances in machine learning have promoted the use of automated systems to assist pathologists.&#xd;
Unfortunately, labeled datasets for training and validation are scarce due to the need for expert pathologists&#xd;
to provide ground-truth labels.&#xd;
Methods: This work introduces a new prostate histopathological dataset named CrowdGleason, which consists&#xd;
of 19,077 patches from 1045 WSIs with various Gleason grades. The dataset was annotated using a&#xd;
crowdsourcing protocol involving seven pathologists-in-training to distribute the labeling effort. To provide a&#xd;
baseline analysis, two crowdsourcing methods based on Gaussian Processes (GPs) were evaluated for Gleason&#xd;
grade prediction: SVGPCR, which learns a model from the CrowdGleason dataset, and SVGPMIX, which&#xd;
combines data from the public dataset SICAPv2 and the CrowdGleason dataset. The performance of these&#xd;
methods was compared with other crowdsourcing and expert label-based methods through comprehensive&#xd;
experiments.&#xd;
Results: The results demonstrate that our GP-based crowdsourcing approach outperforms other methods for&#xd;
aggregating crowdsourced labels (𝜅�� = 0.7048 ± 0.0207) for SVGPCR vs.(𝜅�� = 0.6576 ± 0.0086) for SVGP with&#xd;
majority voting). SVGPCR trained with crowdsourced labels performs better than GP trained with expert&#xd;
labels from SICAPv2 (𝜅�� = 0.6583 ± 0.0220) and outperforms most individual pathologists-in-training (mean&#xd;
𝜅�� = 0.5432). Additionally, SVGPMIX trained with a combination of SICAPv2 and CrowdGleason achieves the&#xd;
highest performance on both datasets (𝜅�� = 0.7814 ± 0.0083 and 𝜅�� = 0.7276 ± 0.0260).&#xd;
Conclusion: The experiments show that the CrowdGleason dataset can be successfully used for training and&#xd;
validating supervised and crowdsourcing methods. Furthermore, the crowdsourcing methods trained on this&#xd;
dataset obtain competitive results against those using expert labels. Interestingly, the combination of expert and&#xd;
non-expert labels opens the door to a future of massive labeling by incorporating both expert and non-expert&#xd;
pathologist annotators.</dc:description>
      <dc:date>2024-11-19T09:50:37Z</dc:date>
      <dc:date>2024-11-19T09:50:37Z</dc:date>
      <dc:date>2024-11-01</dc:date>
      <dc:type>journal article</dc:type>
      <dc:identifier>López Pérez, M. et. al.  Computer Methods and Programs in Biomedicine 257 (2024) 108472. [https://doi.org/10.1016/j.cmpb.2024.108472]</dc:identifier>
      <dc:identifier>https://hdl.handle.net/10481/97072</dc:identifier>
      <dc:identifier>10.1016/j.cmpb.2024.108472</dc:identifier>
      <dc:language>eng</dc:language>
      <dc:rights>http://creativecommons.org/licenses/by-nc/4.0/</dc:rights>
      <dc:rights>open access</dc:rights>
      <dc:rights>Atribución-NoComercial 4.0 Internacional</dc:rights>
      <dc:publisher>Elsevier</dc:publisher>
   </ow:Publication>
</rdf:RDF>