Dual-channel eKF-RTF framework for speech enhancement with DNN-based speech presence estimation Martín Doñas, Juan M. Peinado Herreros, Antonio Miguel López Espejo, Iván Gómez García, Ángel Manuel Dual-microphone smartphone Beamforming Extended Kalman filter Speech presence probability Deep Neural Network This paper presents a dual-channel speech enhance- ment framework that effectively integrates deep neural net- work (DNN) mask estimators. Our framework follows a beamforming-plus-postfiltering approach intended for noise reduction on dual-microphone smartphones. An extended Kalman filter is used for the estimation of the relative acous- tic channel between microphones, while the noise estimation is performed using a speech presence probability estimator. We propose the use of a DNN estimator to improve the prediction of the speech presence probabilities without making any assump- tion about the statistics of the signals. We evaluate and compare different dual-channel features to improve the accuracy of this estimator, including the power and phase difference between the speech signals at the two microphones. The proposed in- tegrated scheme is evaluated in different reverberant and noisy environments when the smartphone is used in both close- and far-talk positions. The experimental results show that our ap- proach achieves significant improvements in terms of speech quality, intelligibility, and distortion when compared to other approaches based only on statistical signal processing. 2023-03-14T08:07:47Z 2023-03-14T08:07:47Z 2021-03 info:eu-repo/semantics/conferenceObject https://hdl.handle.net/10481/80569 10.21437/IberSPEECH.2021-7 eng http://creativecommons.org/licenses/by-nc-nd/4.0/ info:eu-repo/semantics/openAccess Attribution-NonCommercial-NoDerivatives 4.0 Internacional Proceedings of IBERSPEECH 2021