Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking

Hernández-Manrique, Pablo; Peinado Herreros, Antonio Miguel; Gómez García, Ángel Manuel

doi:10.21437/IberSPEECH.2024-3

dc.contributor.author	Hernández-Manrique, Pablo
dc.contributor.author	Peinado Herreros, Antonio Miguel
dc.contributor.author	Gómez García, Ángel Manuel
dc.date.accessioned	2024-12-17T09:42:44Z
dc.date.available	2024-12-17T09:42:44Z
dc.date.issued	2024-11
dc.identifier.citation	"Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking", Proceedings of IberSPEECH 2024, Aveiro, Portugal, 11-13 Nov 2024	es_ES
dc.identifier.uri	https://hdl.handle.net/10481/98117
dc.description.abstract	Speech and audio watermarking has been an active research topic during the last thirty years. However, unlike other signal processing techniques, implementations based on deep neural networks (DNN) are relatively recent and many issues remain unexplored. In this paper, we focus on speech watermarking and a key requirement such as the imperceptibility of the watermark. In particular, we explore the application the Perceptual Metric for Speech Quality Evaluation (PMSQE) loss function, originally proposed in the context of speech enhancement, for achieving this goal. In particular, we examine the training trade-offs associated to the watermarking system training procedure and look for a suitable way of incorporating the PMSQE loss. Our experimental results show that the PMSQE loss can, not only meaningfully improve the perceptual quality of the watermarked speech, but also keep, or even improve, other audio quality measures and the bit error rates yielded by attacked signals.	es_ES
dc.description.sponsorship	Signal Processing, Multimedia Transmission and Speech/Audio Technologies (TIC234)	es_ES
dc.language.iso	eng	es_ES
dc.publisher	ISCA Archive	es_ES
dc.title	Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking	es_ES
dc.type	conference output	es_ES
dc.rights.accessRights	open access	es_ES
dc.identifier.doi	10.21437/IberSPEECH.2024-3
dc.type.hasVersion	AM	es_ES

Files in this item

Name:: paper.pdf
Size:: 202.9Kb
Format:: PDF
Description:: Artículo pdf

This item appears in the following Collection(s)

TIC234 - Comunicación Congresos, Conferencias...

Show simple item record