Managing Borderline and Noisy Examples in Imbalanced Classification by Combining SMOTE with Ensemble Filtering
Identificadores
URI: https://hdl.handle.net/10481/77962Metadata
Show full item recordEditorial
Springer
Materia
Classification Imbalanced data SMOTE Class noise Noise filters
Date
2014Referencia bibliográfica
Published version: Sáez, J.A... [et al.] (2014). Managing Borderline and Noisy Examples in Imbalanced Classification by Combining SMOTE with Ensemble Filtering. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2014. IDEAL 2014. Lecture Notes in Computer Science, vol 8669. Springer, Cham. [https://doi.org/10.1007/978-3-319-10840-7_8]
Sponsorship
Regional Projects P1O-TIC-06858 P11-TIC-9704 P12-TIC-2958 NCN-2013/11/B/5T6/00963; National Project TIN2011-28488; Spanish GovernmentAbstract
Imbalance data constitutes a great difficulty for most algorithms
learning classifiers. However, as recent works claim, class imbalance
is not a problem in itself and performance degradation is also associated
with other factors related to the distribution of the data as the presence of
noisy and borderline examples in the areas surrounding class boundaries.
This contribution proposes to extend SMOTE with a noise filter called
Iterative-Partitioning Filter (IPF), which can overcome these problems.
The properties of this proposal are discussed in a controlled experimental
study against SMOTE and its most well-known generalizations. The
results show that the new proposal performs better than exiting SMOTE
generalizations for all these different scenarios.