Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings

Khanday, Owais Mujtaba; Rodríguez San Esteban, Pablo; Ahmad, Zubair; Ouellet, Marc; González López, José Andrés

doi:10.21437/Interspeech.2025-1400

dc.contributor.author	Khanday, Owais Mujtaba
dc.contributor.author	Rodríguez San Esteban, Pablo
dc.contributor.author	Ahmad, Zubair
dc.contributor.author	Ouellet, Marc
dc.contributor.author	González López, José Andrés
dc.date.accessioned	2025-09-02T07:09:47Z
dc.date.available	2025-09-02T07:09:47Z
dc.date.issued	2025-08-17
dc.identifier.citation	Khanday, O.M., Esteban, P.R.S., Lone, Z.A., Ouellet, M., Gonzalez-Lopez, J.A. (2025) Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings. Proc. Interspeech 2025, 5553-5557, doi: 10.21437/Interspeech.2025-1400	es_ES
dc.identifier.uri	https://hdl.handle.net/10481/105970
dc.description.abstract	Understanding how neural activity encodes speech and language production is a fundamental challenge in neuroscience and artificial intelligence. This study investigates whether embeddings from large-scale, self-supervised language and speech models can effectively reconstruct high-gamma neural activity characteristics, key indicators of cortical processing, recorded during speech production. We use pre-trained embeddings from deep learning models on linguistic and acoustic data to map high-level speech features onto high-gamma signals. We analyze the extent to which these embeddings preserve the spatio-temporal dynamics of brain activity. Reconstructed neural signals are evaluated against high-gamma ground-truth activity using correlation metrics and signal reconstruction quality assessments. The results indicate High-gamma activity was effectively reconstructed using language and speech model embeddings, yielding Pearson correlation coefficients of 0.79–0.99 across all participants.	es_ES
dc.description.sponsorship	This work was supported by the grant PID2022-141378OB-C22 funded by MICIU/AEI/10.13039/501100011033 and ERDF/EU.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	ISCA	es_ES
dc.rights	Atribución-NoComercial-CompartirIgual 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	*
dc.title	Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings	es_ES
dc.type	conference output	es_ES
dc.rights.accessRights	open access	es_ES
dc.identifier.doi	10.21437/Interspeech.2025-1400
dc.type.hasVersion	AO	es_ES

Files in this item

Name:: main.pdf
Size:: 314.9Kb
Format:: PDF

This item appears in the following Collection(s)

TIC234 - Comunicación Congresos, Conferencias...

Show simple item record

Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 4.0 Internacional