ASASVIcomtech: The Vicomtech-UGR Speech Deepfake Detection and SASV Systems for the ASVspoof5 Challenge
Metadata
Show full item recordAuthor
Martín Doñas, Juan Manuel; Roselló Casado, Eros; Gómez García, Ángel Manuel; Álvarez, Aitor; López Espejo, Iván; Peinado Herreros, Antonio MiguelMateria
ASVspoof5 Challenge anti-spoofing Deepfakes Deep learning
Date
2024-08Sponsorship
Project PID2022-138711OB-I00 funded by MICIU/AEI/10.13039/501100011033 and by ERDF/EU; FPI grant PRE2022-000363; European Union’s Horizon Europe re- search and innovation programme in the context of project EITHOS under Grant Agreement No. 101073928Abstract
This paper presents the work carried out by the ASASVI- comtech team, made up of researchers from Vicomtech and University of Granada, for the ASVspoof5 Challenge. The team has participated in both Track 1 (speech deepfake detection) and Track 2 (spoofing-aware speaker verification). This work started with an analysis of the challenge available data, which was regarded as an essential step to avoid later potential biases of the trained models, and whose main conclusions are presented here. With respect to the proposed approaches, a closed- condition system employing a deep complex convolutional recurrent architecture was developed for Track 1, although, un- fortunately, no noteworthy results were achieved. On the other hand, different possibilities of open-condition systems, based on leveraging self-supervised models, augmented training data from previous challenges, and novel vocoders, were explored for both tracks, finally achieving very competitive results with an ensemble system.