Mostrar el registro sencillo del ítem

dc.contributor.authorGarcía Ruíz, Celia
dc.contributor.authorMartín Doñas, Juan M.
dc.contributor.authorGómez García, Ángel Manuel 
dc.date.accessioned2023-03-16T07:08:53Z
dc.date.available2023-03-16T07:08:53Z
dc.date.issued2022-11
dc.identifier.urihttps://hdl.handle.net/10481/80607
dc.description.abstractDeep learning techniques have widely been applied to speech enhancement as they show outstanding modeling capa- bilities that are needed for proper speech-noise separation. In contrast to other end-to-end approaches, masking-based meth- ods consider speech spectra as input to the deep neural network, providing spectral masks for noise removal or attenuation. In these approaches, the Short-Time Fourier Transform (STFT) and, particularly, the parameters used for the analysis/synthesis window, plays an important role which is often neglected. In this paper, we analyze the effects of window length and shift on a complex-domain convolutional-recurrent neural network (DCCRN) which is able to provide, separately, magnitude and phase corrections. Different perceptual quality and intelligibil- ity objective metrics are used to assess its performance. As a re- sult, we have observed that phase corrections have an increased impact with shorter window sizes. Similarly, as window overlap increases, phase takes more relevance than magnitude spectrum in speech enhancement.es_ES
dc.description.sponsorshipProject PID2019-104206GB-I00 funded by MCIN/AEI/10.13039/501100011033.es_ES
dc.language.isoenges_ES
dc.publisherISCA - Iberspeech 2022es_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectSpeech enhancementes_ES
dc.subjectDeep neural networkes_ES
dc.subjectShort Time Fourier Transformes_ES
dc.subjectComplex spectral maskinges_ES
dc.titleThe role of window length and shift in complex-domain DNN-based speech enhancementes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.identifier.doi10.21437/IberSPEECH.2022-30
dc.type.hasVersioninfo:eu-repo/semantics/submittedVersiones_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional