A hybrid TwinSVM-HHO model for multilingual spam review detection using sentiment features and pre-trained embeddings

Al-Zoubi, Ala´ M.; Mora García, Antonio Miguel; Faris, Hossam; Qaddoura, Raneem

doi:https://doi.org/10.1016/j.eswa.2025.128160

dc.contributor.author	Al-Zoubi, Ala´ M.
dc.contributor.author	Mora García, Antonio Miguel
dc.contributor.author	Faris, Hossam
dc.contributor.author	Qaddoura, Raneem
dc.date.accessioned	2026-01-28T12:03:26Z
dc.date.available	2026-01-28T12:03:26Z
dc.date.issued	2025-08-25
dc.identifier.citation	Ala’ M. Al-Zoubi, Antonio M. Mora, Hossam Faris, Raneem Qaddoura, A hybrid TwinSVM-HHO model for multilingual spam review detection using sentiment features and pre-trained embeddings, Expert Systems with Applications, Volume 287, 2025, 128160, ISSN 0957-4174, https://doi.org/10.1016/j.eswa.2025.128160. (https://www.sciencedirect.com/science/article/pii/S0957417425017804)	es_ES
dc.identifier.uri	https://hdl.handle.net/10481/110424
dc.description.abstract	The detection of spam reviews in multilingual environments remains a challenging task due to linguistic diversity, data imbalance, and semantic complexity. This paper proposes a novel hybrid model that integrates Twin Support Vector Machine (TwinSVM) with Harris Hawks Optimization (HHO) for simultaneous parameter optimization and feature selection. To enhance semantic understanding, sentiment-based features are incorporated alongside pre-trained word embedding models—BERT, FastText, and MUSE—across English, Arabic, and Spanish datasets. Our approach generates 24 high-quality datasets using embeddings with 100 and 400 dimensions, including a combined multilingual set. Experimental results demonstrate that our proposed HHO-TwinSVM model consistently outperforms conventional classifiers and metaheuristic-enhanced SVMs, achieving accuracy improvements of up to 9.44 % and enhanced robustness in low-resource languages. This integrated framework represents a scalable and adaptable solution for multilingual spam detection. Four detailed experiments were conducted in this study, each designed to address and demonstrate a specific aspect of the proposed approach. Across all experiments, the method outperformed existing algorithms, achieving impressive accuracy rates of 92.9741 %, 89.0314 %, 80.3580 %, and 85.0859 % on Arabic, English, Spanish, and multilingual datasets, respectively. Subsequently, sentiment analysis features were incorporated to further enhance detection performance, resulting in improvements of 1.0994 %, 2.6674 %, 9.4430 %, and 8.7448 %, respectively. A comprehensive analysis of the experimental results, including the influence of reviews and sentiment features, is also presented.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Elsevier	es_ES
dc.rights	Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/	es_ES
dc.subject	Multilingual analysis	es_ES
dc.subject	SPAM detection	es_ES
dc.subject	SPAM Review	es_ES
dc.subject	Sentiment Analysis	es_ES
dc.subject	Support Vector Machines	es_ES
dc.subject	SVM	es_ES
dc.subject	Harris Hawk Optimization	es_ES
dc.subject	HHO	es_ES
dc.subject	Embedding	es_ES
dc.title	A hybrid TwinSVM-HHO model for multilingual spam review detection using sentiment features and pre-trained embeddings	es_ES
dc.type	journal article	es_ES
dc.rights.accessRights	embargoed access	es_ES
dc.identifier.doi	https://doi.org/10.1016/j.eswa.2025.128160
dc.type.hasVersion	AM	es_ES

Ficheros en el ítem

Nombre:: ESWA - SPAM Review TwinSVM-HHO ...
Tamaño:: 3.382Mb
Formato:: PDF
Descripción:: Artículo publicado

Este ítem aparece en la(s) siguiente(s) colección(ones)

DTSTC - Artículos

Mostrar el registro sencillo del ítem

Excepto si se señala otra cosa, la licencia del ítem se describe como Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License