Rom, Italien  /  03. Juli 2025

International Workshop on Media Verification and Integrity @ IJCNN 2025

VERIMEDIA

Der International Workshop on Media Verification and Integrity VERIMEDIA  findet am 3. Juli 2025 in Rom, Italien, im Rahmen der IJCNN-Konferenz statt und befasst sich mit den wachsenden Herausforderungen durch KI-generierte Inhalte und Deepfakes.

Mit einem Beitrag zur Erkennung von synthetischer Sprach stellt das Fraunhofer IDMT dort aktuelle Forschungsaktivitäten im Bereich der Medienforensik vor.

Towards Explainable Person-of-Interest-based Audio Synthesis Detection

Pianese, Alessandro; Cuccoviello, Luca; Poggi, Giovanni; Le Roux, Thomas; Aichroth, Patrick

Generalization and explainability are two key challenges in synthetic audio detection. Effective detectors should not only reliably classify unseen data from unknown synthesis algorithms, but also provide insight into their decision-making process and explain why a given input was classified as real or fake. To promote generalization we use the Person-of-Interest approach, which allows us to detect synthetic audio using a model trained only on real data, provided that some pristine audio of the putative speaker is provided. To support explainability, we instead use an encoder-decoder backbone such that the bottleneck features ensure syntactic and semantic fidelity to the input, as well as enable reliable decisions. Experiments show that our approach outperforms both state-of-the-art models based on supervised learning and methods based on speaker verification.