ODSS: An Open Dataset of Synthetic Speech

Reference

A. Yaroshchuk, C. Papastergiopoulos, L. Cuccovillo, P. Aichroth, K. Votis and D. Tzovaras

An Open Dataset of Synthetic Speech, 2023 IEEE International Workshop on Information Forensics and Security (WIFS), Nürnberg, Germany, 2023, pp. 1-6

DOI: 10.1109/WIFS58808.2023.10374863

Dataset Overview

ODSS is a multilingual, multispeaker dataset of synthetic and natural speech, designed to foster research and benchmarking of novel studies on synthetic speech detection. 

ODSS comprises audio utterances generated from text by state-of-the-art synthesis methods, paired with their corresponding natural counterparts. The synthetic audio data includes several languages, with an equal representation of genders.

More audio datasets

Fraunhofer IDMT has compiled further audio data sets for various research areas.

Research topic

Media Forensics

Trustworthy media content