Show simple item record

dc.contributor.authorKhanday, Owais Mujtaba
dc.contributor.authorRodríguez San Esteban, Pablo
dc.contributor.authorAhmad, Zubair
dc.contributor.authorOuellet, Marc 
dc.contributor.authorGonzález López, José Andrés 
dc.date.accessioned2025-09-02T07:09:47Z
dc.date.available2025-09-02T07:09:47Z
dc.date.issued2025-08-17
dc.identifier.citationKhanday, O.M., Esteban, P.R.S., Lone, Z.A., Ouellet, M., Gonzalez-Lopez, J.A. (2025) Recreating Neural Activity During Speech Production with Language and Speech Model Embeddings. Proc. Interspeech 2025, 5553-5557, doi: 10.21437/Interspeech.2025-1400es_ES
dc.identifier.urihttps://hdl.handle.net/10481/105970
dc.description.abstractUnderstanding how neural activity encodes speech and language production is a fundamental challenge in neuroscience and artificial intelligence. This study investigates whether embeddings from large-scale, self-supervised language and speech models can effectively reconstruct high-gamma neural activity characteristics, key indicators of cortical processing, recorded during speech production. We use pre-trained embeddings from deep learning models on linguistic and acoustic data to map high-level speech features onto high-gamma signals. We analyze the extent to which these embeddings preserve the spatio-temporal dynamics of brain activity. Reconstructed neural signals are evaluated against high-gamma ground-truth activity using correlation metrics and signal reconstruction quality assessments. The results indicate High-gamma activity was effectively reconstructed using language and speech model embeddings, yielding Pearson correlation coefficients of 0.79–0.99 across all participants.es_ES
dc.description.sponsorshipThis work was supported by the grant PID2022-141378OB-C22 funded by MICIU/AEI/10.13039/501100011033 and ERDF/EU.es_ES
dc.language.isoenges_ES
dc.publisherISCAes_ES
dc.rightsAtribución-NoComercial-CompartirIgual 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.titleRecreating Neural Activity During Speech Production with Language and Speech Model Embeddingses_ES
dc.typeconference outputes_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.21437/Interspeech.2025-1400
dc.type.hasVersionAOes_ES


Files in this item

[PDF]

This item appears in the following Collection(s)

Show simple item record

Atribución-NoComercial-CompartirIgual 4.0 Internacional
Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 4.0 Internacional