Mostrar el registro sencillo del ítem

dc.contributor.authorMartín Doñas, Juan M.
dc.contributor.authorPeinado Herreros, Antonio Miguel 
dc.contributor.authorLópez Espejo, Iván
dc.contributor.authorGómez García, Ángel Manuel 
dc.date.accessioned2023-03-14T08:07:47Z
dc.date.available2023-03-14T08:07:47Z
dc.date.issued2021-03
dc.identifier.urihttps://hdl.handle.net/10481/80569
dc.description.abstractThis paper presents a dual-channel speech enhance- ment framework that effectively integrates deep neural net- work (DNN) mask estimators. Our framework follows a beamforming-plus-postfiltering approach intended for noise reduction on dual-microphone smartphones. An extended Kalman filter is used for the estimation of the relative acous- tic channel between microphones, while the noise estimation is performed using a speech presence probability estimator. We propose the use of a DNN estimator to improve the prediction of the speech presence probabilities without making any assump- tion about the statistics of the signals. We evaluate and compare different dual-channel features to improve the accuracy of this estimator, including the power and phase difference between the speech signals at the two microphones. The proposed in- tegrated scheme is evaluated in different reverberant and noisy environments when the smartphone is used in both close- and far-talk positions. The experimental results show that our ap- proach achieves significant improvements in terms of speech quality, intelligibility, and distortion when compared to other approaches based only on statistical signal processing.es_ES
dc.description.sponsorshipSpanish Ministry of Science and Innovation Project No. PID2019-104206GB- I00/AEI/10.13039/501100011033es_ES
dc.description.sponsorshipSpanish Ministry of Uni- versities through the National Program FPU (grant reference FPU15/04161)es_ES
dc.language.isoenges_ES
dc.publisherProceedings of IBERSPEECH 2021es_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectDual-microphone smartphonees_ES
dc.subjectBeamforminges_ES
dc.subjectExtended Kalman filteres_ES
dc.subjectSpeech presence probabilityes_ES
dc.subjectDeep Neural Networkes_ES
dc.titleDual-channel eKF-RTF framework for speech enhancement with DNN-based speech presence estimationes_ES
dc.typeconference outputes_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.21437/IberSPEECH.2021-7
dc.type.hasVersionSMURes_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional