Using Variational Autoencoders for Out of Distribution Detection in Histological Multiple Instance Learning
Metadatos
Mostrar el registro completo del ítemAutor
Sáez-Maldonado, Francisco Javier; García Martínez, María Luz; Cooper, Lee A.D.; Goldstein, Jeffery A.; Molina Soriano, Rafael; Katsaggelos, Aggelos K.Editorial
IEEE
Materia
Out-of-distribution detection Multiple instance learning Variational Autoencoder
Fecha
2025-07-28Referencia bibliográfica
Javier Sáez-Maldonado, F., García, L., Cooper, L. A. D., Goldstein, J. A., Molina, R., & Katsaggelos, A. K. (2025). Using variational autoencoders for out of distribution detection in histological multiple instance learning. IEEE Access: Practical Innovations, Open Solutions, 13, 133351–133369. https://doi.org/10.1109/access.2025.3593420
Patrocinador
MICIU/AEI/10.13039/501100011033 and by NextGenerationEU/PRTR (PID2022-140189OB-C22, TED2021-132178B-I00); U.S. National Institutes of Health National Library of Medicine Award (Grant R01LM013523); U.S. National Institutes of Health (Grant K08EB030120)Resumen
In the context of histological image classification, Multiple Instance Learning (MIL) methods
only require labels at Whole Slide Image (WSI) level, effectively reducing the annotation bottleneck.
However, for their deployment in real scenarios, they must be able to detect the presence of previously
unseen tissues or artifacts, the so-called Out-of-Distribution (OOD) samples. This would allow Computer
Assisted Diagnosis systems to flag samples for additional quality or content control. In this work, we propose
an OOD-aware probabilistic deep MIL model that combines the latent representation from a variational
autoencoder and an attention mechanism. At test time, the latent representations of the instances are used
in the classification and OOD detection tasks. We also propose a deterministic version of the model that
uses the reconstruction error as OOD score. Panda (prostate tissue) and Camelyon16 (lymph node tissue) are
used as train/test in-distribution datasets, obtaining bag classification results competitive with current stateof-the-art models. OOD detection is evaluated performing two experiments for each in-distribution dataset.
For Panda, Camelyon16 and ARTIF (prostate tissue contaminated with artifacts) are used as OOD datasets,
obtaining 100% AUC in both cases. For Camelyon16, Panda and BCELL (lymph node tissue diagnosed with
diffuse large B-cell lymphoma) are used as OOD datasets, obtaining AUCs of 100% and 97%, respectively.
Experimental validation demonstrates the models’ strong classification performance and effective OOD slide
detection, highlighting their clinical potential.





