Assessing the impact of speaker identity in speech spoofing detection

Dao, Anh-Tuan; Matrouf, Driss; Evans, Nicholas

ICASSP 2026, IEEE International Conference on Acoustics, Speech, and Signal Processing, 4-8 May 2026, Barcelona, Spain

Spoofing detection systems are typically trained using diverse recordings from multiple speakers, often assuming that the resulting embeddings are independent of speaker identity. However, this assumption remains unverified. In this paper, we investigate the impact of speaker information on spoofing detection systems. We propose two approaches within our Speaker-Invariant Multi-Task framework, one that models speaker identity within the embeddings and another that removes it. SInMT integrates multi-task learning for joint speaker recognition and spoofing detection, incorporating a gradient reversal layer. Evaluated using four datasets, our speaker-invariant model reduces the average equal error rate by 17% compared to the baseline, with up to 48% reduction for the most challenging attacks (e.g., A11).

Detail

ARXIV

BIBTEX

Type:

Conference

City:

Barcelona

Date:

2026-05-04

Department:

Digital Security

Eurecom Ref:

8652

© 2026 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.