IV-Nlp: A Methodology to Understand the Behavior of DL Models and Its Application from a Causal Approach

Guzman-Monteza, Yudi; Fernández Luna, Juan Manuel; Ribadas Pena, Francisco J.

doi:10.3390/electronics14081676

electronics-14-01676-v2.pdf (3.426Mb)

Identificadores

URI: https://hdl.handle.net/10481/104164

DOI: 10.3390/electronics14081676

Exportar

Editorial

MDPI

Materia

Natural language processing (NLP)

Synthetic data generation

Causal inference

Fecha

2025-04-21

Referencia bibliográfica

Guzman-Monteza, Y.; Fernandez-Luna, J.M.; Ribadas-Pena, F.J. IV-Nlp: A Methodology to Understand the Behavior of DL Models and Its Application from a Causal Approach. Electronics 2025, 14, 1676. [https://doi.org/10.3390/electronics14081676]

Resumen

Integrating causal inference and estimation methods, especially in Natural Language Processsing (NLP), is essential to improve interpretability and robustness in deep learning (DL) models. The objectives are to present the IV-NLP methodology and its application. IV-NLP integrates two approaches. The first defines the process of the inference and estimation of the causal effect in original, predicted, and synthetic data. The second one includes a validation method of the results obtained by the selected Large- Language Model (LLM). IV-NLP proposes to use synthetic data in predictive tasks only if the causal effect pattern of the synthetic data is aligned with the causal effect pattern of the original data. DL models, the Instrumental Variable (IV) method, statistical methods, and GPT-3.5-turbo-0125 were used for its application, including an intervention method using a variation of the Retrieval-Augmented Generation (RAG) technique. Our findings reveal notable discrepancies between the original and synthetic data, highlighting that the synthetic data do not fully capture the underlying causal effect patterns of the original data, evidencing homogeneity and low diversity in the synthetic data. Interestingly, when evaluating the causal effect in the predictions made by our three best DL models, it was verified that the model with the lowest accuracy (84.50%) was fully aligned with the overall causal effect pattern. These results demonstrate the potential of integrating DL and LLM models with causal inference methods.

Colecciones

DCCIA - Artículos

Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional