Fine-Tuning BERT Models for Intent Recognition Using a Frequency Cut-Off Strategy for Domain-Specific Vocabulary Extension

Fernández Martínez, Fernando; Griol Barres, David; Callejas Carrión, Zoraida

doi:10.3390/app12031610

dc.contributor.author	Fernández Martínez, Fernando
dc.contributor.author	Griol Barres, David
dc.contributor.author	Callejas Carrión, Zoraida
dc.date.accessioned	2022-03-08T07:22:40Z
dc.date.available	2022-03-08T07:22:40Z
dc.date.issued	2022-02-03
dc.identifier.citation	Fernández-Martínez, F... [et al.]. Fine-Tuning BERT Models for Intent Recognition Using a Frequency Cut-Off Strategy for Domain-Specific Vocabulary Extension. Appl. Sci. 2022, 12, 1610. [https://doi.org/10.3390/app12031610]	es_ES
dc.identifier.uri	http://hdl.handle.net/10481/73197
dc.description	The work leading to these results was supported by the Spanish Ministry of Science and Innovation through the R& D&i projects GOMINOLA (PID2020-118112RB-C21 and PID2020118112RB-C22, funded by MCIN/AEI/10.13039/501100011033), CAVIAR (TEC2017-84593-C2-1-R, funded by MCIN/ AEI/10.13039/501100011033/FEDER "Una manera de hacer Europa"), and AMICPoC (PDC2021-120846-C42, funded by MCIN/AEI/10.13039/501100011033 and by "the European Union "NextGenerationEU/PRTR"). This research also received funding from the European Union's Horizon2020 research and innovation program under grant agreement No 823907 (http://menhirproject.eu, accessed on 2 February 2022). Furthermore, R.K.'s research was supported by the Spanish Ministry of Education (FPI grant PRE2018-083225).	es_ES
dc.description.abstract	Intent recognition is a key component of any task-oriented conversational system. The intent recognizer can be used first to classify the user’s utterance into one of several predefined classes (intents) that help to understand the user’s current goal. Then, the most adequate response can be provided accordingly. Intent recognizers also often appear as a form of joint models for performing the natural language understanding and dialog management tasks together as a single process, thus simplifying the set of problems that a conversational system must solve. This happens to be especially true for frequently asked question (FAQ) conversational systems. In this work, we first present an exploratory analysis in which different deep learning (DL) models for intent detection and classification were evaluated. In particular, we experimentally compare and analyze conventional recurrent neural networks (RNN) and state-of-the-art transformer models. Our experiments confirmed that best performance is achieved by using transformers. Specifically, best performance was achieved by fine-tuning the so-called BETO model (a Spanish pretrained bidirectional encoder representations from transformers (BERT) model from the Universidad de Chile) in our intent detection task. Then, as the main contribution of the paper, we analyze the effect of inserting unseen domain words to extend the vocabulary of the model as part of the fine-tuning or domain-adaptation process. Particularly, a very simple word frequency cut-off strategy is experimentally shown to be a suitable method for driving the vocabulary learning decisions over unseen words. The results of our analysis show that the proposed method helps to effectively extend the original vocabulary of the pretrained models. We validated our approach with a selection of the corpus acquired with the Hispabot-Covid19 system obtaining satisfactory results.	es_ES
dc.description.sponsorship	Spanish Ministry of Science and Innovation (MCIN/AEI) PID2020-118112RB-C21 PID2020118112RB-C22 PDC2021-120846-C42	es_ES
dc.description.sponsorship	Spanish Ministry of Science and Innovation (MCIN/AEI/FEDER "Una manera de hacer Europa") TEC2017-84593-C2-1-R	es_ES
dc.description.sponsorship	Spanish Ministry of Science and Innovation (European Union "NextGenerationEU/PRTR") PDC2021-120846-C42	es_ES
dc.description.sponsorship	European Commission 823907	es_ES
dc.description.sponsorship	German Research Foundation (DFG) PRE2018-083225	es_ES
dc.language.iso	eng	es_ES
dc.publisher	MDPI	es_ES
dc.rights	Atribución 3.0 España	*
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/es/	*
dc.subject	Topic classification	es_ES
dc.subject	Intent detection	es_ES
dc.subject	Conversational systems	es_ES
dc.subject	Recurrent networks	es_ES
dc.subject	Attentive RNN	es_ES
dc.subject	Attentive LSTM	es_ES
dc.subject	Transformer models	es_ES
dc.subject	Transfer learning	es_ES
dc.title	Fine-Tuning BERT Models for Intent Recognition Using a Frequency Cut-Off Strategy for Domain-Specific Vocabulary Extension	es_ES
dc.type	journal article	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/823907	es_ES
dc.rights.accessRights	open access	es_ES
dc.identifier.doi	10.3390/app12031610
dc.type.hasVersion	VoR	es_ES

Ficheros en el ítem

Nombre:: applsci-12-01610-v2.pdf
Tamaño:: 1.307Mb
Formato:: PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

OpenAIRE (Open Access Infrastructure for Research in Europe)
Publicaciones financiadas por Framework Programme 7, Horizonte 2020, Horizonte Europa... del European Research Council de la Unión Europea en el marco del Proyecto OpenAIRE que promueve el acceso abierto a Europa.

Mostrar el registro sencillo del ítem

Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 3.0 España