On the disagreement problem in Human-in-the-Loop federated machine learning

Huelser, Matthias; Mueller, Heimo; Díaz Rodríguez, Natalia Ana; Holzinger, Andreas

doi:10.1016/j.jii.2025.100827

1-s2.0-S2452414X25000512-main.pdf (2.486Mb)

Identificadores

URI: https://hdl.handle.net/10481/104484

DOI: 10.1016/j.jii.2025.100827

Exportar

Editorial

Elsevier

Materia

Human-Centric

AI Human-in-the-Loop

Human–machine interaction

Fecha

2025-03-19

Referencia bibliográfica

M. Huelser et al. Journal of Industrial Information Integration 45 (2025) 100827 [https://doi.org/10.1016/j.jii.2025.100827]

Patrocinador

Austrian Science Fund (FWF), Project: P-32554; European Union’s Horizon 2020 research and innovation programme under grant agreement No. 826078 (Feature Cloud); European Union’s Horizon 2020 research and innovation programme under grant agreement No. 101079183 (BioMedAI TWINNING); Knowledge Generation Project PID2023-149128NB-I00, funded by the Ministry of Science, Innovation and Universities of Spain (Proyectos de generación de conocimiento n◦ PID2023-149128NB-I00)

Resumen

The popularity of Artificial Intelligence (AI) has risen sharply in recent years, revolutionizing applications in most sectors with unprecedented functionalities. Milestones and achievements like ChatGPT demonstrate not only the impressive capabilities of AI, but also how accessible such technologies have become in recent times. However, the success of AI applications depends heavily on the underlying information integration processes. Among the most important processes are the training of the AI model at the core of the application and the collection and pre-processing of training data. In particular, the task of collecting high-quality training data can be very costly and resource-intensive, as in many cases large amounts of data have to be annotated manually. Human annotators must have extensive expertise for certain tasks in order to provide high-quality training data. In this paper, we present a framework to maximize the efficiency of human experts in a Machine Learning (ML) scenario, with the aim of optimizing the use of human expertise in active learning. This is done by constantly measuring the quality of human experts’ input, as well as by involving human annotators only when needed. We showcase the benefits of our proposed framework by applying it to a problem in image classification, proving its usefulness to reduce the cost of annotating training data. The source code of the framework is publicly available at https://github.com/human-centered-ai-lab/app-HITL-annotator.

Colecciones

OpenAIRE (Open Access Infrastructure for Research in Europe)

Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional