A novel methodology to classify test cases using natural language processing and imbalanced learning
Metadatos
Mostrar el registro completo del ítemEditorial
Elsevier
Materia
Software testing Artificial intelligence Imbalanced classification Natural language processing Optimization IFROWANN Doc2Vec
Fecha
2020-08-14Referencia bibliográfica
Tahvili, S., Hatvani, L., Ramentol, E., Pimentel, R., Afzal, W., & Herrera, F. (2020). A novel methodology to classify test cases using natural language processing and imbalanced learning. Engineering applications of artificial intelligence, 95, 103878. [https://doi.org/10.1016/j.engappai.2020.103878]
Patrocinador
Vinnova; European Union's Horizon 2020 research and innovation program 871319; ERCIM "Alain Bensoussan'' Fellowship Programme; Spanish Government TIN2017-89517-PResumen
Detecting the dependency between integration test cases plays a vital role in the area of software test
optimization. Classifying test cases into two main classes – dependent and independent – can be employed
for several test optimization purposes such as parallel test execution, test automation, test case selection and
prioritization, and test suite reduction. This task can be seen as an imbalanced classification problem due to
the test cases’ distribution. Often the number of dependent and independent test cases is uneven, which is
related to the testing level, testing environment and complexity of the system under test. In this study, we
propose a novel methodology that consists of two main steps. Firstly, by using natural language processing we
analyze the test cases’ specifications and turn them into a numeric vector. Secondly, by using the obtained data
vectors, we classify each test case into a dependent or an independent class. We carry out a supervised learning
approach using different methods for handling imbalanced datasets. The feasibility and possible generalization
of the proposed methodology is evaluated in two industrial projects at Bombardier Transportation, Sweden,
which indicates promising results.





