• français 
    • español
    • English
    • français
  • FacebookPinterestTwitter
  • español
  • English
  • français
Voir le document 
  •   Accueil de DIGIBUG
  • 1.-Investigación
  • Departamentos, Grupos de Investigación e Institutos
  • Grupo: Signal Processing, Multimedia Transmission and Speech/Audio Technologies (TIC234)
  • TIC234 - Comunicación Congresos, Conferencias...
  • Voir le document
  •   Accueil de DIGIBUG
  • 1.-Investigación
  • Departamentos, Grupos de Investigación e Institutos
  • Grupo: Signal Processing, Multimedia Transmission and Speech/Audio Technologies (TIC234)
  • TIC234 - Comunicación Congresos, Conferencias...
  • Voir le document
JavaScript is disabled for your browser. Some features of this site may not work without it.

Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking

[PDF] Artículo pdf (202.9Ko)
Identificadores
URI: https://hdl.handle.net/10481/98117
DOI: 10.21437/IberSPEECH.2024-3
Exportar
RISRefworksMendeleyBibtex
Estadísticas
Statistiques d'usage de visualisation
Metadatos
Afficher la notice complète
Auteur
Hernández-Manrique, Pablo; Peinado Herreros, Antonio Miguel; Gómez García, Ángel Manuel
Editorial
ISCA Archive
Date
2024-11
Referencia bibliográfica
"Integrating the Perceptual PMSQE Loss into DNN-based Speech Watermarking", Proceedings of IberSPEECH 2024, Aveiro, Portugal, 11-13 Nov 2024
Patrocinador
Signal Processing, Multimedia Transmission and Speech/Audio Technologies (TIC234)
Résumé
Speech and audio watermarking has been an active research topic during the last thirty years. However, unlike other signal processing techniques, implementations based on deep neural networks (DNN) are relatively recent and many issues remain unexplored. In this paper, we focus on speech watermarking and a key requirement such as the imperceptibility of the watermark. In particular, we explore the application the Perceptual Metric for Speech Quality Evaluation (PMSQE) loss function, originally proposed in the context of speech enhancement, for achieving this goal. In particular, we examine the training trade-offs associated to the watermarking system training procedure and look for a suitable way of incorporating the PMSQE loss. Our experimental results show that the PMSQE loss can, not only meaningfully improve the perceptual quality of the watermarked speech, but also keep, or even improve, other audio quality measures and the bit error rates yielded by attacked signals.
Colecciones
  • TIC234 - Comunicación Congresos, Conferencias...

Mon compte

Ouvrir une sessionS'inscrire

Parcourir

Tout DIGIBUGCommunautés et CollectionsPar date de publicationAuteursTitresSujetsFinanciaciónPerfil de autor UGRCette collectionPar date de publicationAuteursTitresSujetsFinanciación

Statistiques

Statistiques d'usage de visualisation

Servicios

Pasos para autoarchivoAyudaLicencias Creative CommonsSHERPA/RoMEODulcinea Biblioteca UniversitariaNos puedes encontrar a través deCondiciones legales

Contactez-nous | Faire parvenir un commentaire