Mostrar el registro sencillo del ítem

dc.contributor.authorMinutolo, Aniello
dc.contributor.authorFujita, Hamido 
dc.date.accessioned2022-10-17T12:41:11Z
dc.date.available2022-10-17T12:41:11Z
dc.date.issued2022-09-19
dc.identifier.citationMinutolo, A... [et al.]. A multi-level methodology for the automated translation of a coreference resolution dataset: an application to the Italian language. Neural Comput & Applic (2022). [https://doi.org/10.1007/s00521-022-07641-3]es_ES
dc.identifier.urihttps://hdl.handle.net/10481/77364
dc.description.abstractIn the last decade, the demand for readily accessible corpora has touched all areas of natural language processing, including coreference resolution. However, it is one of the least considered sub-fields in recent developments. Moreover, almost all existing resources are only available for the English language. To overcome this lack, this work proposes a methodology to create a corpus for coreference resolution in Italian exploiting knowledge of annotated resources in other languages. Starting from OntonNotes, the methodology translates and refines English utterances to obtain utterances respecting Italian grammar, dealing with language-specific phenomena and preserving coreference and mentions. A quantitative and qualitative evaluation is performed to assess the well-formedness of generated utterances, considering readability, grammaticality, and acceptability indexes. The results have confirmed the effectiveness of the methodology in generating a good dataset for coreference resolution starting from an existing one. The goodness of the dataset is also assessed by training a coreference resolution model based on BERT language model, achieving the promising results. Even if the methodology has been tailored for English and Italian languages, it has a general basis easily extendable to other languages, adapting a small number of language-dependent rules to generalize most of the linguistic phenomena of the language under examination.es_ES
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectCoreference resolutiones_ES
dc.subjectCorpus creationes_ES
dc.subjectAutomated translationes_ES
dc.subjectCross-languagees_ES
dc.subjectNatural language processinges_ES
dc.subjectLinguistic phenomenaes_ES
dc.titleA multi-level methodology for the automated translation of a coreference resolution dataset: an application to the Italian languagees_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.1007/s00521-022-07641-3
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional