A fusocelular skin dataset with whole slide images for deep learning models
Metadatos
Mostrar el registro completo del ítemAutor
del Amor, Rocío; López-Pérez, Miguel; Meseguer, Pablo; Morales, Sandra; Terradez, Liria; Aneiros Fernández, José; Mateos Delgado, Javier; Molina Soriano, Rafael; Naranjo, ValeryEditorial
Springer Nature
Fecha
2025-05-14Referencia bibliográfica
del Amor, R., López-Pérez, M., Meseguer, P. et al. A fusocelular skin dataset with whole slide images for deep learning models. Sci Data 12, 788 (2025). [DOI: 10.1038/s41597-025-05108-3]
Patrocinador
Ministerio de Economía, Comercio y Empresa; Ministerio de Universidades (FPU20/05263); valgrAI - Valencian Graduate School and Research Network of Artifcial Intelligence; Universitat Politécnica de Valéncia (PAID-10-20); MICIU/AEI/10.13039/501100011033; European Union - NextGenerationEU/PRTRResumen
Cutaneous spindle cell (CSC) lesions encompass a spectrum from benign to malignant neoplasms, often posing significant diagnostic challenges. Computer-aided diagnosis systems offer a promising solution to make pathologists’ decisions objective and faster. These systems usually require large-scale datasets with curated labels for effective training; however, manual annotation is time-consuming and expensive. To overcome this challenge, crowdsourcing has emerged as a popular and valuable strategy to scale up the labeling process by distributing the effort among different non-expert annotators. This work introduces AI4SkIN, the first public dataset Whole Slide Images (WSIs) for CSC neoplasms, annotated using an innovative crowdsourcing protocol. AI4SkIN dataset contains 641 Hematoxylin and Eosin stained WSIs with multiclass labels from both expert and trainee pathologists. The dataset improves CSC neoplasm diagnosis using advanced machine learning and crowdsourcing based on Gaussian Processes, showing that models trained on non-expert labels perform comparably to those using expert labels. In conclusion, we illustrate that AI4SkIN provides a good resource for developing and validating methods for multiclass CSC neoplasm classification.