On the influence of class noise in medical data classification: Treatment using noise filtering methods
Metadatos
Afficher la notice complèteEditorial
Taylor & Francis
Date
2016Referencia bibliográfica
José A. Sáez; Bartosz Krawczyk; Michal Wozniak. On the influence of class noise in medical data classification: Treatment using noise filtering methods. Applied Artificial Intelligence, 30(6), 590-609. 2016. doi: 10.1080/08839514.2016.1193719
Résumé
Classification systems play an important role in medical decision support, because they allow automatizing and accelerating the data analysis process. However, their quality is based on that of the training dataset upon which the classification models are built. The labeling process of each training example is usually performed by domain experts or automatic systems. When a wrong assignment of class labels to examples is performed, the training process and, therefore, the classification performance, might be negatively affected. This problem is formally known as class label noise. One of the most used techniques to reduce the harmful consequences of mislabeled objects is noise filtering, which removes noisy examples from the training data. This article analyzes the usefulness of such methods in the context of medical data classification. The experiments carried out on several real-world datasets show the importance of noise filtering when class noise affects the data.