Environmental Sound Analysis

AI-based analysis of complex acoustic scenes and sounds

Using cutting-edge AI technologies, we are exploring the untapped potential of environmental sounds for applications in the fields of bioacoustics, noise monitoring, logistics and traffic monitoring, as well as security surveillance at construction sites and public events.

News and upcoming events

Podcast

Follow us into the lab

Our colleague Jakob Abesser and his work were featured in the DLG podcast agriculture about the BioMonitor4CAP project.

Listen to podcast

Event / 17.3.2025

DAS I DAGA 2025

We are presenting our research at the 51st annual conference of the German Acoustical Society (DEGA). Visit us at booth B4-34.

DAS I DAGA 2025

Project / 18.2.2025

BioMonitor4CAP Annual Meeting

Project partners from all over Europe and Peru met in Warsaw for the 2nd Annual Meeting of the BioMonitor4CAP project.

BioMonitor4CAP Meeting

Tabbed contents

Expand all Close all

Research

Capturing information from environmental sounds

Sounds and noises surround us everywhere in our daily lives – as disturbing noise, as the soothing rustle of leaves or as the warning sound of sirens on the street. Humans possess not only the ability to distinguish between important and unimportant sounds but also to derive crucial information about their surroundings through sound interpretation based on their experiences.

"Machine listening" is a subfield of artificial intelligence that aims to replicate this human capability by automatically capturing and interpreting information from environmental sounds. This involves combining signal processing techniques and machine learning and developing algorithms for the analysis, source separation, and classification of music, speech, and environmental sounds. Source separation allows for the decomposition of complex acoustic scenes into their components, i.e., individual sound sources, while classification identifies sounds and assigns them to predefined sound sources or classes.

The developed technologies and solutions find applications in various areas:

Bioacoustics: Identifying animal species, studying behavioral patterns, or monitoring environmental impacts based on acoustic characteristics
Noise monitoring: recording noise data, identifying noise sources and planning noise protection measures
Logistics and traffic monitoring: Counting and classifying vehicles, analyzing traffic flows to improve emergency response planning, and implement traffic management measures
Safety surveillance (construction sites, public events): Detecting hazardous situations, vandalism, or break-ins acoustically

Robust recognition, energy-efficient implementation

General challenges in the analysis of environmental sounds include robust recognition of individual sounds despite high acoustic variability within and between different sound classes. In simple terms, the algorithm must be able to recognize a Dachshund labrador and a Great Dane German shepherd as dogs based on their barking. The strong overlap of multiple static and moving sound sources in complex scenarios further complicates reliable recognition.

When deploying AI algorithms in acoustic sensors, various microphone characteristics and room acoustics effects such as reverberation and reflections can make classification challenging.

Our research also addresses the question of how compact AI models can be trained with minimal training data for deployment on resource-constrained hardware. This is necessary because many deployment locations often lack sufficient or consistent power supply, and long-term analyses may span several days or weeks. Therefore, the models must not be overly large and complex to function for real-time analysis on the devices.

Learning to understand sounds

Our aim is to use the technology for practically relevant issues such as the measurement and investigation of noise pollution, bio- and eco-acoustics as well as construction site and logistics monitoring.

The basic research in the areas of efficient AI models, explainable AI, training with little data and domain adaptation also has the potential to be used across domains in other audio research areas such as speech processing or music signal processing.

Additionally, we conduct research in the context of listening tests and citizen science applications involving participants to explore the subjective perception of noise and other perceptual sound attributes. The aim is to gain a better understanding of which sound sources in everyday situations have a particularly disruptive impact on our perception of noise (and, by extension, our health).

How we proceed

The following methods and procedures are used to analyze environmental noise:

Audio signal processing
Deep learning
Perception of sound signals

Projects and activities

Research project

"StadtLärm" (CityNoise)

Development of a noise monitoring system to support urban noise protection tasks

StadtLärm

Field test project

Open Innovation Lab

Noise monitoring field test project as part of the City of Gelsenkirchen's "Open Innovation Lab"

Open Innovation Lab

Research project

BioMonitor4CAP

Acoustic animal species recognition and classification for improved biodiversity monitoring in agriculture

BioMonitor4CAP

Research project

Construction-sAIt

Multi-modal AI-driven technologies for automatic construction site monitoring

Construction-sAIt

Research project

ISAD 2

Development of explainable and comprehensible deep-learning models to enable a better understanding of the structural and acoustic properties of sound sources (music or environmental sounds)

ISAD 2

Research project

vera.ai

Sound event detection and acoustic scene recognition for the development of trustworthy AI solutions for detecting advanced disinformation techniques in the media sector

vera.ai

Research project

news-polygraph

Sound event detection and acoustic landmark detection for the development of a multi-modal, crowd-supported technology platform for disinformation analysis

news-polygraph

Research project

NeuroSensEar

Sound event detection and acoustic scene recognition for bio-inspired acoustic sensor technology for highly efficient hearing aids

NeuroSensEar

Research project

Sound Surv:AI:llance

Acoustic Burglary Monitoring

SoundSurvAIllance

Publications

use cases

With our technical solutions and services, we provide companies and institutions with concrete support and real added value for their use cases. Contact us to discuss your application!

Automated biodiversity monitoring of animal species

use case

Interested in further use cases?

Here you will find an overview of our use cases.

Overview use cases

Publications

Jahr Year	Titel/Autor:in Title/Author	Publikationstyp Publication Type
2025	Sound recurrence analysis for acoustic scene classification Abeßer, Jakob; Liang, Zhiwei; Seeber, Bernhard	Zeitschriftenaufsatz Journal Article
2025	Automatic Retrieval of Indicator Sounds for Acoustic Geo-Tagging Abeßer, Jakob	Poster
2024	Towards Measuring and Forecasting Noise Exposure at the VELTINS-Arena in Gelsenkirchen, Germany Ngamthipwatthana, Pitchapa; Götze, Marco; Kátai, András; Abeßer, Jakob	Konferenzbeitrag Conference Paper
2024	Selbstüberwachtes Vortraining zur Verbesserung automatischer Audioklassifikationsalgorithmen Grollmisch, Sascha; Abeßer, Jakob; Bös, Joachim	Konferenzbeitrag Conference Paper
2024	Aktuelle Forschungsschwerpunkte in der akustischen Ereignisdetektion Abeßer, Jakob; Grollmisch, Sascha; Bös, Joachim	Konferenzbeitrag Conference Paper
2024	Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol Apostolidis, Konstantinos; Abeßer, Jakob; Cuccovillo, Luca; Vasileios, Mezaris	Konferenzbeitrag Conference Paper
2024	Towards Domain Shift in Location-Mismatch Scenarios for Bird Activity Detection Latifi Bidarouni, Amir; Abeßer, Jakob	Konferenzbeitrag Conference Paper
2023	Investigations on the Implementation of an Acoustic Rain Sensor System Hock, Kevin; Götz, Julian; Seideneck, Mario; Sladeczek, Christoph	Konferenzbeitrag Conference Paper
2023	Human and Machine Performance in Counting Sound Classes in Single-Channel Soundscapes Abeßer, Jakob; Ullah, Asad; Ziegler, Sebastian; Grollmisch, Sascha	Zeitschriftenaufsatz Journal Article
2023	How Robust are Audio Embeddings for Polyphonic Sound Event Tagging? Abeßer, Jakob; Grollmisch, Sascha; Müller, Meinard	Zeitschriftenaufsatz Journal Article
2022	Analyzing Bird and Bat Activity in Agricultural Environments using AI-driven Audio Monitoring Abeßer, Jakob; Wang, Xiaoyi; Bänsch, Svenja; Scherber, Christoph; Lukashevich, Hanna	Konferenzbeitrag Conference Paper
2022	Classifying Sounds in Polyphonic Urban Sound Scenes Abeßer, Jakob	Zeitschriftenaufsatz Journal Article
2022	Construction-sAIt: Multi-modal AI-driven technologies for construction site monitoring Abeßer, Jakob; Loos, Alexander; Sharma, Prachi	Konferenzbeitrag Conference Paper
2021	Improving Semi-Supervised Learning for Audio Classification with FixMatch Grollmisch, Sascha; Cano, Estefanía	Zeitschriftenaufsatz Journal Article
2021	DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection Johnson, David S.; Lorenz, Wolfgang; Taenzer, Michael; Grollmisch, Sascha; Abeßer, Jakob; Lukashevich, Hanna; Mimilakis, Stylianos	Paper
2021	Investigating the influence of microphone mismatch for acoustic traffic monitoring Gourishetti, Saichand; Abeßer, Jakob; Grollmisch, Sascha; Kátai, András; Liebetrau, Judith	Konferenzbeitrag Conference Paper
2021	DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection Johnson, David S.; Lorenz, Wolfgang; Taenzer, Michael; Mimilakis, Stylianos Ioannis; Grollmisch, Sascha; Abeßer, Jakob; Lukashevich, Hanna	Konferenzbeitrag Conference Paper
2021	IDMT-Traffic: An Open Benchmark Dataset for Acoustic Traffic Monitoring Research Abeßer, Jakob; Gourishetti, Saichand; Kátai, András; Clauß, Tobias; Sharma, Prachi; Liebetrau, Judith	Konferenzbeitrag Conference Paper
2020	Sound Event Detection with Depthwise Separable and Dilated Convolutions Drossos, Konstantinos; Mimilakis, Stylianos I.; Gharib, Shayan; Li, Yanxiong; Virtanen, Tuomas	Paper
2020	A Review of Deep Learning Based Methods for Acoustic Scene Classification Abeßer, Jakob	Zeitschriftenaufsatz Journal Article
2020	Identifikation urbaner Geräuschquellen mittels maschineller Lernverfahren Clauß, T.; Abeßer, Jakob	Zeitschriftenaufsatz Journal Article
2020	Analyzing the potential of pre-trained embeddings for audio classification tasks Grollmisch, Sascha; Kehling, Christian; Taenzer, Michael; Cano, E.	Konferenzbeitrag Conference Paper
2019	Smart Solutions to Cope with Urban Noise Pollution Abeßer, Jakob; Kepplinger, Sara	Zeitschriftenaufsatz Journal Article
2018	Stadtlärm - a distributed system for noise level measurement and noise source identification in a smart city environment Clauß, Tobias; Abeßer, Jakob; Lukashevich, Hanna; Gräfe, Robert; Häuser, Franz; Kühn, Christian; Sporer, Thomas	Konferenzbeitrag Conference Paper

Diese Liste ist ein Auszug aus der Publikationsplattform Fraunhofer-Publica

This list has been generated from the publication platform Fraunhofer-Publica

Environmental Sound Analysis

AI-based analysis of complex acoustic scenes and sounds

News and upcoming events

Follow us into the lab

DAS I DAGA 2025

BioMonitor4CAP Annual Meeting

Tabbed contents

Research

Capturing information from environmental sounds

Robust recognition, energy-efficient implementation

Learning to understand sounds

How we proceed

Projects and activities

"StadtLärm" (CityNoise)

Open Innovation Lab

BioMonitor4CAP

Construction-sAIt

ISAD 2

vera.ai

news-polygraph

NeuroSensEar

Sound Surv:AI:llance

Publications

use cases

Automated biodiversity monitoring of animal species

Interested in further use cases?

Publications

Datasets

Contact Press / Media

Dr.-Ing. Jakob Abeßer

Contact Press / Media

Hanna Lukashevich