ANCES: A novel method to repair attribute noise in classification problems
Metadatos
Mostrar el registro completo del ítemEditorial
Elsevier
Materia
attribute noise noise correction noise filtering noisy data classification
Fecha
2022Referencia bibliográfica
José A. Sáez; Emilio Corchado. ANCES: A novel method to repair attribute noise in classification problems. Pattern Recognition, 121, 108198. 2022. doi: 10.1016/j.patcog.2021.108198
Resumen
Noise negatively affects the complexity and performance of models built in classification problems. The most common approach to mitigate its consequences is the usage of preprocessing techniques, known as noise filters, which are designed to remove noisy samples from the training data. Nevertheless, they are specifically oriented to deal with errors affecting class labels. Their employment may not always result in an improvement when noise affects attribute values. In these cases, correcting the errors is an interesting alternative to traditional noise filtering that has not been enough studied so far in the specialized literature. This research proposes an attribute noise correction method with the final aim of increasing the performance of the classification algorithms used later. The identification of noisy data is based on an error score assigned to each one of the attribute values in the dataset, which are then passed through an optimization process to correct their potential noise. The validity of the proposed method is studied in an exhaustive experimental study, in which it is compared to several well-known preprocessing methods to deal with noisy datasets. The results obtained show the suitability of attribute noise correction with respect to the other alternatives when data suffer from attribute noise.