Mostrar el registro sencillo del ítem

dc.contributor.authorHernández-Manrique, Pablo
dc.contributor.authorPeinado Herreros, Antonio Miguel 
dc.contributor.authorGómez García, Ángel Manuel 
dc.date.accessioned2024-12-17T09:42:44Z
dc.date.available2024-12-17T09:42:44Z
dc.date.issued2024-11
dc.identifier.citation"Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking", Proceedings of IberSPEECH 2024, Aveiro, Portugal, 11-13 Nov 2024es_ES
dc.identifier.urihttps://hdl.handle.net/10481/98117
dc.description.abstractSpeech and audio watermarking has been an active research topic during the last thirty years. However, unlike other signal processing techniques, implementations based on deep neural networks (DNN) are relatively recent and many issues remain unexplored. In this paper, we focus on speech watermarking and a key requirement such as the imperceptibility of the watermark. In particular, we explore the application the Perceptual Metric for Speech Quality Evaluation (PMSQE) loss function, originally proposed in the context of speech enhancement, for achieving this goal. In particular, we examine the training trade-offs associated to the watermarking system training procedure and look for a suitable way of incorporating the PMSQE loss. Our experimental results show that the PMSQE loss can, not only meaningfully improve the perceptual quality of the watermarked speech, but also keep, or even improve, other audio quality measures and the bit error rates yielded by attacked signals.es_ES
dc.description.sponsorshipSignal Processing, Multimedia Transmission and Speech/Audio Technologies (TIC234)es_ES
dc.language.isoenges_ES
dc.publisherISCA Archivees_ES
dc.titleIntegrating the Perceptual PMSQE Loss into DNN-based Speech Watermarkinges_ES
dc.typeconference outputes_ES
dc.rights.accessRightsopen accesses_ES
dc.identifier.doi10.21437/IberSPEECH.2024-3
dc.type.hasVersionAMes_ES


Ficheros en el ítem

[PDF]

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem