Prediction of sentiment polarity in restaurant reviews using an ordinal regression approach based on evolutionary XGBoost
Metadatos
Mostrar el registro completo del ítemAutor
Al-Qudah, Dana A.; Al-Zoubi, Ala’ M.; Cristea, Alexandra I.; Merelo Guervos, Juan Julián; Castillo Valdivieso, Pedro Ángel; Faris, HossamEditorial
PeerJ, Ltd.
Materia
Ordinal regression Sentiment polarity Evolutionary Particle swarm optimisation XGBoost
Fecha
2025-01-09Referencia bibliográfica
Al-Qudah DA, Al-Zoubi AM, Cristea AI, Merelo-Guervós JJ, Castillo PA, Faris H. 2025. Prediction of sentiment polarity in restaurant reviews using an ordinal regression approach based on evolutionary XGBoost. PeerJ Comput. Sci. 11:e2370 [DOI: 10.7717/peerj-cs.2370]
Patrocinador
Ministerio Español de Ciencia e Innovación (MCIN/AEI/10.13039/501100011033)Resumen
As the business world shifts to the web and tremendous amounts of data become available on multilingual mobile applications, new business and research challenges and opportunities have been explored. This research aims to intensify the usage of data analytics, machine learning, and sentiment analysis of textual data to classify customers’ reviews, feedback, and ratings of businesses in Jordan’s food and restaurant industry. The main methods used in this research were sentiment polarity (to address the challenges posed by businesses to automatically apply text analysis) and bio-metric techniques (to systematically identify users’ emotional states, so reviews can be thoroughly understood). The research was extended to deal with reviews in Arabic, dialectic Arabic, and English, with the main focus on the Arabic language, as the application examined (Talabat) is based in Jordan. Arabic and English reviews were collected from the application, and a new model was proposed to sentimentally analyze reviews. The proposed model has four main stages: data collection, data preparation, model building, and model evaluation. The main purpose of this research is to study the problem expressed above using a model of ordinal regression to overcome issues related to misclassification. Additionally, an automatic multi-language prediction approach for online restaurant reviews was proposed by combining the eXtreme gradient boosting (XGBoost) and particle swarm optimization (PSO) techniques for the ordinal regression of these reviews. The proposed PSO-XGB algorithm showed superior results when compared to support vector machine (SVM) and other optimization methods in terms of root mean square error (RMSE) for the English and Arabic datasets. Specifically, for the Arabic dataset, PSO-XGB achieved an RMSE value of 0.7722, whereas PSO-SVM achieved an RSME value of 0.9988.