Missing Value Imputation Designs and Methods of Nature-Inspired Metaheuristic Techniques: A Systematic Review
Metadatos
Afficher la notice complèteEditorial
IEEE
Materia
Missing value Missing data Imputation Imputation Incomplete dataset Metaheuristic Systematic review
Date
2022-05-09Referencia bibliográfica
P. C. Chiu... [et al.]. "Missing Value Imputation Designs and Methods of Nature-Inspired Metaheuristic Techniques: A Systematic Review," in IEEE Access, vol. 10, pp. 61544-61566, 2022, doi: [10.1109/ACCESS.2022.3172319]
Patrocinador
Ministry of Education, Malaysia FRGS/1/2018/ICT04/UTM/01/1; Malaysia Research University Network (MRUN) 4L876; SPEV Project through the Faculty of Informatics and Management, University of Hradec Kralove, Czech Republic, "Smart Solutions in Ubiquitous Computing Environments'' 2102-2022Résumé
Missing values are highly undesirable in real-world datasets. The missing values should be
estimated and treated during the preprocessing stage. With the expansion of nature-inspired metaheuristic
techniques, interest in missing value imputation (MVI) has increased. The main goal of this literature is to
identify and review the existing research on missing value imputation (MVI) in terms of nature-inspired
metaheuristic approaches, dataset designs, missingness mechanisms, and missing rates, as well as the most
used evaluation metrics between 2011 and 2021. This study ultimately gives insight into how the MVI plan
can be incorporated into the experimental design. Using the systematic literature review (SLR) guidelines
designed by Kitchenham, this study utilizes renowned scienti c databases to retrieve and analyze all relevant
articles during the search process. A total of 48 related articles from 2011 to 2021 were selected to assess
the review questions. This review indicated that the synthetic missing dataset is the most popular baseline
test dataset to evaluate the effectiveness of the imputation strategy. The study revealed that missing at
random (MAR) is the most common proposed missing mechanism in the datasets. This review also indicated
that the hybridizations of metaheuristics with clustering or neural networks are popular among researchers.
The superior performance of the hybrid approaches is signi cantly attributed to the power of optimized
learning in MVI models. In addition, perspectives, challenges, and opportunities in MVI are also addressed
in this literature. The outcome of this review serves as a toolkit for the researchers to develop effective MVI
models.